PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG007219.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family HD-ZIP
Protein Properties Length: 741aa    MW: 81814.9 Da    PI: 5.3554
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG007219.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox29.61.2e-0961912656
                 -HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox 26 saeereeLAkklgLterqVkvWFqNrRakek 56
                 +a++++eL  +lgL+ +q+k+WFqNrR+++k
  CCG007219.1 61 TANQIQELELWLGLESKQIKFWFQNRRTQMK 91
                 6789************************999 PP

2START161.27.7e-512414702205
                  HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.....SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECT CS
        START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv....dsgealrasgvvdmvlallveellddkeqWdetla....kaetleviss 87 
                  la++a++el+k+a+ e+p W ks     e +n +e++++f++ +     + +ea r+sgvv  +   lve+l+d++  W e+++    +a+t++ iss
  CCG007219.1 241 LALAAMDELIKMAQIESPIWIKSLdggkEVLNHEEYMRTFPRIGMkpsnFVTEATRESGVVLVNISALVETLMDVN-GWVEMFPsliaRAATTDIISS 337
                  6899**********************9999**********8866699999**************************.********************* PP

                  T......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE........EEEEEEE-E CS
        START  88 g......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh........skvtwvehv 170
                  g      galq   ae+q++sp vp R ++f+R ++ql++g+w++vdvS+d +q++ + +    +++lpSg++i+++ ng          +vtwveh 
  CCG007219.1 338 GmggtksGALQMRHAEFQLISPFVPvRQVTFIRLCKQLTEGVWAVVDVSIDANQENLNAQAPETCKRLPSGCIIQDMNNGCsklfapavREVTWVEHS 435
                  ********************************************************9966666679***************88888888889****** PP

                  E--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
        START 171 dlkgrlphwllrslvksglaegaktwvatlqrqce 205
                  +++++ +h+l+r++++sg+ +ga++w+atlqr+ e
  CCG007219.1 436 EYDESAVHQLYRPILGSGRGFGAQRWLATLQRYYE 470
                  *******************************9865 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466895.01E-94093IPR009057Homeodomain-like
CDDcd000862.37E-64993No hitNo description
PfamPF000464.0E-76191IPR001356Homeobox domain
PROSITE profilePS5007110.2366193IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.602.9E-76199IPR009057Homeodomain-like
PROSITE profilePS5084834.821231474IPR002913START domain
SuperFamilySSF559611.04E-25232470No hitNo description
CDDcd088751.02E-106235467No hitNo description
SMARTSM002342.3E-34240471IPR002913START domain
PfamPF018526.5E-43241469IPR002913START domain
SuperFamilySSF559611.08E-13502735No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 741 aa     Download sequence    Send to blast
MDGRGDMGLF GEHFDPCLVG RIKEDGYYES RSGSDNIEGA SGEDQDVGDD QRPRKKYNRH  60
TANQIQELEL WLGLESKQIK FWFQNRRTQM KTQLERHENV ILRQENDKLR LENELLKQNM  120
SDPICNNCGG PVVPGPVSYE QQQLRIENAR LTDELGRVCA LANKFLGRPL TSSANPIPPL  180
SSKSKLDLAV GINGYGNLGH TDNMLPMVLD NNRAIMMSLM KPIGNAVGKE VPHDRSIFVD  240
LALAAMDELI KMAQIESPIW IKSLDGGKEV LNHEEYMRTF PRIGMKPSNF VTEATRESGV  300
VLVNISALVE TLMDVNGWVE MFPSLIARAA TTDIISSGMG GTKSGALQMR HAEFQLISPF  360
VPVRQVTFIR LCKQLTEGVW AVVDVSIDAN QENLNAQAPE TCKRLPSGCI IQDMNNGCSK  420
LFAPAVREVT WVEHSEYDES AVHQLYRPIL GSGRGFGAQR WLATLQRYYE GMAMIMSPSI  480
LGEDQTVINL GGKKSMLKLA RRMVDNFCSG VCASSLHKWG NPVAGNVSED VRILTRKSIN  540
EPGEPDGIVL SAATSVWLPV SRQRLFDFLR DEKSRSHWDI LSNGGILQEI IQIPKGQGQG  600
QWNRVSLLRS TAVDADAVEN NMLILQETWN DVSGSLVVYA PVDLQSMSVV TSGGDSTYVA  660
LLPSGFVILP DNSFSNGEPS NSDGNPVKRD SDSNNGGGSF FTVGFQILAS NLPSAELTVE  720
SVETIHNLIS CTMHRIRTAF N
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011012369.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
RefseqXP_011012370.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLB9I4A90.0B9I4A9_POPTR; Uncharacterized protein
STRINGPOPTR_0012s13390.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]