PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.03G016600.1.p
Common NameGLYMA_03G016600, LOC100818372
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 837aa    MW: 91372.3 Da    PI: 6.3528
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.03G016600.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox65.38.1e-21134189156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          +++ +++t++q++eLe+lF+++++p++++r eL+++l L++rqVk+WFqNrR+++k
  Glyma.03G016600.1.p 134 KKRYHRHTPQQIQELEALFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMK 189
                          688999***********************************************999 PP

2START211.33.5e-663375611206
                          HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.... CS
                START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla.... 77 
                          ela++a++elvk+a+ +ep+W++ +    e++n +e++++f++s +     + +ea+r+ g+v+ ++  lve+l+d++ +W e+++    
  Glyma.03G016600.1.p 337 ELALAAMDELVKMAQTGEPLWMRNVeggrEILNHEEYVRNFTPSIGlrpngFVSEASRENGMVIINSLALVETLMDSN-RWAEMFPciia 425
                          5899****************************************99********************************.*********** PP

                          EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTC CS
                START  78 kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksng 160
                          + +t+evissg      galqlm aelq+lsplvp R++ f+R+++q+ +g w++vdvS+ds ++ +  + +v +++lpSg+++++++ng
  Glyma.03G016600.1.p 426 RTSTTEVISSGingtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAEGLWAVVDVSIDSIRESSGAPTFVNGRRLPSGCVVQDMPNG 515
                          *****************************************************************999********************** PP

                          EEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 161 hskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          +skvtwveh++++++++h+l+r+l++sg+ +ga++wvatlqrqce+
  Glyma.03G016600.1.p 516 YSKVTWVEHAEYEESQVHQLYRPLLSSGMGFGAQRWVATLQRQCEC 561
                          ********************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466899.61E-21117191IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.3E-22121191IPR009057Homeodomain-like
PROSITE profilePS5007117.216131191IPR001356Homeobox domain
SMARTSM003891.7E-17132195IPR001356Homeobox domain
CDDcd000862.09E-18133191No hitNo description
PfamPF000462.2E-18134189IPR001356Homeobox domain
PROSITE patternPS000270166189IPR017970Homeobox, conserved site
PROSITE profilePS5084845.039328564IPR002913START domain
SuperFamilySSF559613.98E-33330561No hitNo description
CDDcd088755.62E-126332560No hitNo description
SMARTSM002349.2E-51337561IPR002913START domain
PfamPF018526.2E-58337561IPR002913START domain
Gene3DG3DSA:3.30.530.203.5E-4441530IPR023393START-like domain
SuperFamilySSF559613.39E-22588760No hitNo description
SuperFamilySSF559613.39E-22789829No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 837 aa     Download sequence    Send to blast
MSFGGFLDDK SGSGGARINN FSDIPYNNNN VTNTTTTNNN NNDRMPFGAI SQPRLVTTTP  60
TLAKSMFNSP GLSLALQQTS IDGQEDVNRM AENSFEPNGL RRSREDEHES RSGSDNMDGG  120
SGDEHDAADN PPRKKRYHRH TPQQIQELEA LFKECPHPDE KQRLELSRRL CLETRQVKFW  180
FQNRRTQMKT QLERHENTLL RQENDKLRAE NMSIRDAMRN PMCSNCGGLA IIGEISLEEQ  240
HLRIENARLK DELDRVCALA GKFLGRPVSS LPSLELGMGG NGFAGMPAAT LPLAQDFAMG  300
MSVSMNNNAL AMVSPPTSTR PAAAGFDRSV ERSMFLELAL AAMDELVKMA QTGEPLWMRN  360
VEGGREILNH EEYVRNFTPS IGLRPNGFVS EASRENGMVI INSLALVETL MDSNRWAEMF  420
PCIIARTSTT EVISSGINGT RNGALQLMHA ELQVLSPLVP VREVNFLRFC KQHAEGLWAV  480
VDVSIDSIRE SSGAPTFVNG RRLPSGCVVQ DMPNGYSKVT WVEHAEYEES QVHQLYRPLL  540
SSGMGFGAQR WVATLQRQCE CLAILMSSAA PSRDHSAITA GGRRSMVKLA QRMTNNFCAG  600
VCASTVHKWN KLNAAANVDE DVRVMTRKSV DDPGEPPGIV LSAATSVWLP VSPHRLFDFL  660
RDERLRSEWD ILSNGGPMQE MAHIAKGQDH GNAVSLLRAS AINSNQSSML ILQETCIDAA  720
GSLVVYAPVD IPAMHVVMNG GDSAYVALLP SGFAIVPDGP GSRGPHQNGP TSSTTTTTNG  780
GDNGVTRVSG SLLTVAFQIL VNSLPTAKLT VESVETVNNL ISCTVQKIKA ALHCES*
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.03G016600.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006576359.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X1
RefseqXP_028224092.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0R0KKG10.0A0A0R0KKG1_SOYBN; Uncharacterized protein
STRINGGLYMA03G01860.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Representative plantOGRP14515136
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]