PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G156500.1.p
Common NameGLYMA_20G156500, LOC100785102
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 778aa    MW: 85118.9 Da    PI: 6.1507
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G156500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.81.3e-2179134156
                          TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          +++ +++t++q++eLe++F+++++p++++r +L+k+lgL+ +qVk+WFqNrR+++k
  Glyma.20G156500.1.p  79 KKRYHRHTPHQIQELEAFFKECPHPDEKQRLDLSKRLGLENKQVKFWFQNRRTQMK 134
                          688999***********************************************999 PP

2START179.12.6e-562895122206
                          HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....E CS
                START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....k 78 
                          la++a++el+k+ +ae+p+W ks     e+ n +e+ + f++  +     + +ea r++g+v+ ++  lve+l+d + +W e+++    +
  Glyma.20G156500.1.p 289 LALAAMEELLKMTQAESPLWIKSLdgekEIFNHEEYARLFSPCIGpkpagYVTEATRETGIVIINSLALVETLMDAN-RWAEMFPsmiaR 377
                          6899********************99988999********99998********************************.*******99999 PP

                          EEEEEEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCE CS
                START  79 aetlevissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksngh 161
                          a  l+vis+g      galq+m ae q+lsplvp R + f+R+++q+ +g+w++vdvS++   +  + ++++ +++lpSg+++++++ng+
  Glyma.20G156500.1.p 378 AINLDVISNGmggtrnGALQVMHAEVQLLSPLVPvRQVRFIRFCKQHAEGVWAVVDVSIEIGHDAANAQPSISCRRLPSGCIVQDMPNGY 467
                          999*********************************************************9999999999******************** PP

                          EEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
                START 162 skvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                          skvtw+eh++++++++h+l+r+l++sg+ +ga +w atlqrqce+
  Glyma.20G156500.1.p 468 SKVTWLEHWEYDENVVHQLYRPLLSSGVGFGAHRWIATLQRQCEC 512
                          *******************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.2E-2263130IPR009057Homeodomain-like
SuperFamilySSF466891.96E-2065136IPR009057Homeodomain-like
PROSITE profilePS5007117.65376136IPR001356Homeobox domain
SMARTSM003891.1E-1977140IPR001356Homeobox domain
CDDcd000862.43E-1978136No hitNo description
PfamPF000463.1E-1979134IPR001356Homeobox domain
PROSITE patternPS000270111134IPR017970Homeobox, conserved site
PROSITE profilePS5084841.143279515IPR002913START domain
SuperFamilySSF559612.56E-31281512No hitNo description
CDDcd088752.64E-117283511No hitNo description
SMARTSM002344.3E-36288512IPR002913START domain
PfamPF018525.5E-47289512IPR002913START domain
SuperFamilySSF559614.4E-21540771No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 778 aa     Download sequence    Send to blast
MIFISQSIPS SMLNSSSFSY STQRKMEGPS EIGLIGENFD AGLMGRMRDD EYESRSGSDN  60
FEGASGDDQD GGDDQPQRKK RYHRHTPHQI QELEAFFKEC PHPDEKQRLD LSKRLGLENK  120
QVKFWFQNRR TQMKTQLERH ENIMLRQEND KLRAENSLIK EAMSNPVCNN CGGPAIPGQI  180
SFEEHQIRIE NARLKDELNR ICVLANKFLG KPISSLTSPM ALTTSNSGLE LGIGRNGIGG  240
SSTLGTPLPM GLDLGDGVLG TQPAMPGVRS ALGLMGNEVQ LERSMLIDLA LAAMEELLKM  300
TQAESPLWIK SLDGEKEIFN HEEYARLFSP CIGPKPAGYV TEATRETGIV IINSLALVET  360
LMDANRWAEM FPSMIARAIN LDVISNGMGG TRNGALQVMH AEVQLLSPLV PVRQVRFIRF  420
CKQHAEGVWA VVDVSIEIGH DAANAQPSIS CRRLPSGCIV QDMPNGYSKV TWLEHWEYDE  480
NVVHQLYRPL LSSGVGFGAH RWIATLQRQC ECLAILMSSS ISSDSHTALS QAGRRSMLKL  540
AQRMTSNFCS GVCASSARKW DSLHIGTLGD DMKVMTRKNV DDPGEPPGIV LSAATSVWMP  600
VSRQRLFDFL RDERLRSEWD ILSNGGPMQE MVHIAKGQGH GNCVSLLRAN AVNANDSSML  660
ILQETWMDAS CSVVVYAPVD VQSLNVVMSG GDSAYVALLP SGFAILPDGH CNDNGCNGSL  720
QKGRGSDDGS GGSLLTVGFQ ILVNSLPTAK LTVESVDTVN NLISCTIQKI KAALRVA*
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in roots, stems, leaves and floral buds. {ECO:0000269|PubMed:10402424}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.20G156500.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAP0150410.0AP015041.1 Vigna angularis var. angularis DNA, chromosome 8, almost complete sequence, cultivar: Shumari.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006606096.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2 isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLI1NGR20.0I1NGR2_SOYBN; Uncharacterized protein
STRINGGLYMA20G29580.20.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Representative plantOGRP14515136
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]