PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.18G225800.1.p
Common NameGLYMA_18G225800, LOC100787537
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 823aa    MW: 89779.7 Da    PI: 6.2157
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.18G225800.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.14.7e-21125180156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          +++ +++t++q++eLe+lF+++++p++++r eL+++l+L++rqVk+WFqNrR+++k
  Glyma.18G225800.1.p 125 KKRYHRHTPQQIQELESLFKECPHPDEKQRLELSRRLNLETRQVKFWFQNRRTQMK 180
                          688999***********************************************999 PP

2START203.68e-643395631206
                          HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S.... CS
                START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla.... 77 
                          ela++a++elvk+a+ +ep+W +s     e++n +e+++++++  +     + +ea+r +g+v+ ++  lve+l+d++ +W+e+++    
  Glyma.18G225800.1.p 339 ELALAAMDELVKMAQTGEPLWIRSLeggrEILNHEEYTRTITPCIGlrpngFVTEASRQTGMVIINSLALVETLMDSN-RWSEMFPcmia 427
                          5899*************************************999989***9***************************.*********** PP

                          EEEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTC CS
                START  78 kaetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksng 160
                          + +t evis+g      galqlm aelq+lsplvp R++ f+R+++q+ +g w++vdvS+d  ++ +  + +v +++lpSg+++++++ng
  Glyma.18G225800.1.p 428 RTSTAEVISNGingtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAEGLWAVVDVSIDTIRETSGAPTFVNCRRLPSGCVVQDMPNG 517
                          *****************************************************************999********************** PP

                          EEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 161 hskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          +skvtwveh++++++++h+l+r+l++sg+ +ga++wv tlqrqce+
  Glyma.18G225800.1.p 518 YSKVTWVEHAEYDESQIHQLFRPLLSSGMGFGAQRWVTTLQRQCEC 563
                          ********************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.88E-21108182IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.0E-22115182IPR009057Homeodomain-like
PROSITE profilePS5007117.783122182IPR001356Homeobox domain
SMARTSM003892.4E-19123186IPR001356Homeobox domain
CDDcd000861.52E-19124182No hitNo description
PfamPF000461.1E-18125180IPR001356Homeobox domain
PROSITE patternPS000270157180IPR017970Homeobox, conserved site
PROSITE profilePS5084844.843330566IPR002913START domain
SuperFamilySSF559612.86E-34332563No hitNo description
CDDcd088753.30E-124334562No hitNo description
PfamPF018524.3E-55339563IPR002913START domain
SMARTSM002343.9E-46339563IPR002913START domain
SuperFamilySSF559617.03E-23591815No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 823 aa     Download sequence    Send to blast
MSFGGFLETK QSDGGGGRIV VSDIPYNSNN GSNHSNDIMP SGAISLPRLA TPTLAKSMFN  60
SPGLSLALQS DIDGQGDMNR LMPENFEQNG LRRSREEEHE SRSGSDNMDG GSGDDFDAAD  120
NPPRKKRYHR HTPQQIQELE SLFKECPHPD EKQRLELSRR LNLETRQVKF WFQNRRTQMK  180
TQLERHENSL LRQENDKLRA ENMSMREAMR NPICSNCGGP AMIGEISLEE QHLRIENARL  240
KDELDRVCAL AGKFLGRPVS SLTSSIGPPM PNSSLELGVG SNGFGQGLST VPSTMPDFGV  300
GISSPLAMVS PSSTRPTTTA LVTPSGFDNR SIERSIVLEL ALAAMDELVK MAQTGEPLWI  360
RSLEGGREIL NHEEYTRTIT PCIGLRPNGF VTEASRQTGM VIINSLALVE TLMDSNRWSE  420
MFPCMIARTS TAEVISNGIN GTRNGALQLM HAELQVLSPL VPVREVNFLR FCKQHAEGLW  480
AVVDVSIDTI RETSGAPTFV NCRRLPSGCV VQDMPNGYSK VTWVEHAEYD ESQIHQLFRP  540
LLSSGMGFGA QRWVTTLQRQ CECLAILMSS AAPSREHSAI SSGGRRSMLK LAHRMTNNFC  600
SGVCASTVHK WNKLNAGNVG EDVRVMTRKS VDDPGEPPGI VLSAATSVWL PVSSQRLFDF  660
LRDERLRSEW DILSNGGPMQ EMAHIAKGQD HANCVSLLRA SAINANQSSM LILQETCTDA  720
SGSLVVYAPV DIPAMHVVMN GGDSAYVALL PSGFAIVPDG SGEEQGGASQ QRAASGCLLT  780
VAFQILVNSL PTAKLTVESV ETVNNLISCT VQKIKSALHC ES*
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.458980.0leaf
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.18G225800.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKT0311660.0KT031166.1 Glycine max clone HN_CCL_121 homeodomain/HOMEOBOX transcription factor (Glyma09g40130.1) mRNA, partial cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003552359.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2
RefseqXP_006602766.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2
RefseqXP_014626485.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLK7MU390.0K7MU39_SOYBN; Uncharacterized protein
STRINGGLYMA18G45970.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Representative plantOGRP14515136
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]