PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG010605.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family HD-ZIP
Protein Properties Length: 821aa    MW: 89229.3 Da    PI: 6.0332
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG010605.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.47.9e-21116171156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  +++ +++t++q++eLe+lF+++++p++++r eL+++l L++rqVk+WFqNrR+++k
  CCG010605.1 116 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMK 171
                  688999***********************************************999 PP

2START196.21.4e-613265481203
                  HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEE CS
        START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevi 85 
                  ela++a++elvk+a+ +ep+W +s     e++n +e+l+++++  +     + +ea+r++g+v+ ++  lve+l+d++ +W e+++    + +t++vi
  CCG010605.1 326 ELALAAMDELVKMAQTDEPLWIRSFdggrEILNHEEYLRTITPCIGmkpsgFVSEASRETGMVIINSLALVETLMDSN-RWAEMFPcviaRTSTTDVI 422
                  5899*************************************99988999*9***************************.******************* PP

                  CTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--..-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SS CS
        START  86 ssg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe.sssvvRaellpSgiliepksnghskvtwvehvdlkgr 175
                   +g      g lqlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvSvd  ++ +   + +v +++lpSg+++++++ng+skvtw+eh++++++
  CCG010605.1 423 ANGmggtrnGSLQLMHAELQVLSPLVPvREVNFLRFCKQHAEGVWAVVDVSVDTIRETSGaPPTFVNCRRLPSGCVVQDMPNGYSKVTWIEHAEYDES 520
                  ******************************************************999999999*********************************** PP

                  XXHHHHHHHHHHHHHHHHHHHHHHTXXX CS
        START 176 lphwllrslvksglaegaktwvatlqrq 203
                  + h+l+r+l++sg+ +ga++w atlqrq
  CCG010605.1 521 QTHQLYRPLISSGMGFGAQRWIATLQRQ 548
                  **************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.07E-2099173IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.0E-21103173IPR009057Homeodomain-like
PROSITE profilePS5007117.216113173IPR001356Homeobox domain
SMARTSM003891.7E-17114177IPR001356Homeobox domain
CDDcd000862.03E-18115173No hitNo description
PfamPF000462.2E-18116171IPR001356Homeobox domain
PROSITE patternPS000270148171IPR017970Homeobox, conserved site
PROSITE profilePS5084842.246317554IPR002913START domain
SuperFamilySSF559612.67E-33319548No hitNo description
CDDcd088752.29E-121321549No hitNo description
PfamPF018522.8E-53326548IPR002913START domain
SMARTSM002342.4E-44326551IPR002913START domain
SuperFamilySSF559613.39E-23577747No hitNo description
SuperFamilySSF559613.39E-23774814No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 821 aa     Download sequence    Send to blast
MSFGGFLENT SPGGGGARIV ADIPYNNNNM PTGAIVQPRL VSPSITKSMF NSPGLSLALQ  60
QPNIDGQGDI TRMSENFETS VGRRSREEEH ESRSGSDNMD GASGDDQDAA DNPPRKKRYH  120
RHTPQQIQEL EALFKECPHP DEKQRLELSR RLCLETRQVK FWFQNRRTQM KTQLERHENS  180
LLRQENDKLR AENMSIRDAM RNPMCSNCGG PAIIGDISLE EQHLRIENAR LKDELDRVCA  240
LAGKFLGRPI SSLASSLGPP MPNSSLELGV GSNGFAGLST VATTLPLGPD FVGGISGALP  300
VLAQTRPATT GVTGIGRSLE RSMFLELALA AMDELVKMAQ TDEPLWIRSF DGGREILNHE  360
EYLRTITPCI GMKPSGFVSE ASRETGMVII NSLALVETLM DSNRWAEMFP CVIARTSTTD  420
VIANGMGGTR NGSLQLMHAE LQVLSPLVPV REVNFLRFCK QHAEGVWAVV DVSVDTIRET  480
SGAPPTFVNC RRLPSGCVVQ DMPNGYSKVT WIEHAEYDES QTHQLYRPLI SSGMGFGAQR  540
WIATLQRQSD FAKRTVVDME GVDVAITASG RRSMLKLAQR MTANFCAGVC ASTVHKWNKL  600
NAGNVDEDVR VMTRKSVDDP GEPPGIVLSA ATSVWLPVSP QRLFDFLRDE RLRSEWDILS  660
NGGPMQEMAH IAKGQDHGNC VSLLRASAMN SNQSSMLILQ ETCIDAAGSL VVYAPVDIPA  720
MHVVMNGGDS AYVALLPSGF AIVPDGPGSR GPPTTNGGPT ANNNSNGCGP DRVSGSLLTV  780
AFQILVNSLP TAKLTVESVE TVNNLISCTV QKIKAALQCE S
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011035097.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLB9IC550.0B9IC55_POPTR; Uncharacterized protein
STRINGPOPTR_0014s07130.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]