PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CCG005953.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Salicaceae; Saliceae; Populus
Family HD-ZIP
Protein Properties Length: 748aa    MW: 82381.6 Da    PI: 5.444
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CCG005953.1genomeLZUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.11e-2053108156
                  TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
     Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                  r+k  ++t++q++eLe++F+++++p++++r eL+++lgL+ +q+k+WFqNrR+++k
  CCG005953.1  53 RKKYNRHTANQIQELESFFKECPHPDEKQRSELSRRLGLESKQIKFWFQNRRTQMK 108
                  79999************************************************999 PP

2START181.25.8e-572584802205
                  HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEEC CS
        START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevis 86 
                  la++a++el+k a++e+p W ks     e +n +e++++f++  +     + +ea r+sgvv +++ +lve+l+d++  W e+++    +a+t++ +s
  CCG005953.1 258 LALAAMDELIKIAQVESPIWIKSLdggkEVLNHEEYMRTFPPCIGmkpsnFVIEATRESGVVLANSLDLVETLMDVN-GWVEMFPsliaRAATIDIVS 354
                  6899**********************9999************999999999**************************.******************** PP

                  TT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXX CS
        START  87 sg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlp 177
                  sg      galq + ae+q++sp vp R + f+R ++ql +g+w+++dvSvd +q++ + +  v +++lpSg++i+++ ng +kvtwveh +++++ +
  CCG005953.1 355 SGmggtksGALQMIHAEFQVISPFVPvRQVKFLRLCKQLAEGVWAVADVSVDGNQENLNAQTPVTCRRLPSGCIIQDMNNGCCKVTWVEHSEYDESAV 452
                  *********************************************************999999999******************************** PP

                  HHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
        START 178 hwllrslvksglaegaktwvatlqrqce 205
                  h l+r++++sg+ +ga++w a+lqr+ e
  CCG005953.1 453 HRLYRHILNSGMGFGAQRWIAALQRHYE 480
                  *************************977 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466891.04E-1939110IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.0E-2140110IPR009057Homeodomain-like
PROSITE profilePS5007118.25350110IPR001356Homeobox domain
SMARTSM003891.5E-1852114IPR001356Homeobox domain
PfamPF000462.8E-1853108IPR001356Homeobox domain
CDDcd000869.61E-1953110No hitNo description
PROSITE patternPS00027085108IPR017970Homeobox, conserved site
PROSITE profilePS5084839.624248484IPR002913START domain
SuperFamilySSF559615.86E-32249480No hitNo description
CDDcd088751.66E-114252480No hitNo description
SMARTSM002341.6E-40257481IPR002913START domain
PfamPF018522.6E-49258480IPR002913START domain
Gene3DG3DSA:3.30.530.201.2E-4319478IPR023393START-like domain
SuperFamilySSF559619.77E-16509744No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 748 aa     Download sequence    Send to blast
MDSHGDMGLL GEHFDPSLVG KMREDGYESR SGSDNIEGAS GEDQDAGDYQ RPRKKYNRHT  60
ANQIQELESF FKECPHPDEK QRSELSRRLG LESKQIKFWF QNRRTQMKTQ LERHENAILR  120
QENDKLRAEN ELLKQNMSDP ICNNCGGPVV PVPVSYEQQQ LRIENARLKD ELGRVCALAN  180
KFLGRPLTSS ASPVPPFGSN TKFDLAVGRN GYGNLGHTDN TLPMGLDNNG GVMMPLIKPI  240
GNAVGNEVPF DRSMFVDLAL AAMDELIKIA QVESPIWIKS LDGGKEVLNH EEYMRTFPPC  300
IGMKPSNFVI EATRESGVVL ANSLDLVETL MDVNGWVEMF PSLIARAATI DIVSSGMGGT  360
KSGALQMIHA EFQVISPFVP VRQVKFLRLC KQLAEGVWAV ADVSVDGNQE NLNAQTPVTC  420
RRLPSGCIIQ DMNNGCCKVT WVEHSEYDES AVHRLYRHIL NSGMGFGAQR WIAALQRHYE  480
CMAMLLSPTI LGEDQTVINL GGKKSMLKLA RRMVDSFCSG VCASTLHNWA NLVVESVSED  540
VRILTRKIID EPGEPDGIVL SVSTSVWLPV SQQRLFDFLR DEQSRSQWDI LSNGGILQEM  600
VQIPKGQGHW NTVSVLRSTA VDANASDNML ILQETWNDVS GSLVVYAPVD VQSVSVVMNG  660
GDSTYVALLP SGFVILPGNS FSNGEPNNCN GNPAKRDCDG NSGGGSFLTV GFQILASNLP  720
SAKLTVESVK TVQNLISCTM QRIKTAFN
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011048682.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A2K1XMT60.0A0A2K1XMT6_POPTR; Uncharacterized protein
STRINGPOPTR_0015s13340.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]