PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.09G241800.1.p
Common NameGLYMA_09G2418001, LOC100804456
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family HD-ZIP
Protein Properties Length: 230aa    MW: 25443.4 Da    PI: 8.3781
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.09G241800.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox61.11.7e-1968122256
                          T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
             Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                          rk+ ++tkeq  +Lee F+++++++ ++++ LA++l+L+ rqV vWFqNrRa+ k
  Glyma.09G241800.1.p  68 RKKLRLTKEQSMVLEETFKEHSTLNPKRKQALAEELNLKPRQVEVWFQNRRARTK 122
                          78889************************************************98 PP

2HD-ZIP_I/II127.84.5e-4168157191
          HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLree 90 
                          +kk+rl+keq+ +LEe+F+e+++L+p+rK++la+eL+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr+y++l+een+rL+kev+eLr +
  Glyma.09G241800.1.p  68 RKKLRLTKEQSMVLEETFKEHSTLNPKRKQALAEELNLKPRQVEVWFQNRRARTKLKQTEVDCEYLKRCYENLTEENRRLHKEVQELR-A 156
                          69*************************************************************************************9.5 PP

          HD-ZIP_I/II  91 l 91 
                          l
  Glyma.09G241800.1.p 157 L 157
                          5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF046185.2E-6236IPR006712HD-ZIP protein, N-terminal
Gene3DG3DSA:1.10.10.603.0E-2051118IPR009057Homeodomain-like
SuperFamilySSF466892.05E-1955125IPR009057Homeodomain-like
PROSITE profilePS5007117.89664124IPR001356Homeobox domain
SMARTSM003891.5E-1666128IPR001356Homeobox domain
CDDcd000867.24E-1768125No hitNo description
PfamPF000466.5E-1768122IPR001356Homeobox domain
PROSITE patternPS00027099122IPR017970Homeobox, conserved site
PfamPF021836.9E-12124158IPR003106Leucine zipper, homeobox-associated
SMARTSM003404.0E-26124167IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 230 aa     Download sequence    Send to blast
MIRGIDVNSA AECDGVSSPN SAVSSVSGGD GKQSERDDDN NAAAVAGERT SCSRGSDDDD  60
GGGSDASRKK LRLTKEQSMV LEETFKEHST LNPKRKQALA EELNLKPRQV EVWFQNRRAR  120
TKLKQTEVDC EYLKRCYENL TEENRRLHKE VQELRALKLS PQMYMHMNPP TTLTICPSCE  180
RTHSFASSST ATIHSAVAAT SSNRKLFGTN IRLPVSFNTR PFEGPIPRP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
16672SRKKLRL
2116124RRARTKLKQ
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.566390.0somatic embryo
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.09G241800.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC2358920.0AC235892.2 Glycine max clone GM_WBc0089E01, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003534468.11e-171homeobox-leucine zipper protein HAT3
RefseqXP_006587771.11e-171homeobox-leucine zipper protein HAT3
SwissprotP466025e-82HAT3_ARATH; Homeobox-leucine zipper protein HAT3
TrEMBLI1L6151e-170I1L615_SOYBN; Uncharacterized protein
STRINGGLYMA09G37680.11e-171(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF123934105
Representative plantOGRP19616156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16780.12e-69homeobox protein 2
Publications ? help Back to Top
  1. Ding Y, et al.
    Four distinct types of dehydration stress memory genes in Arabidopsis thaliana.
    BMC Plant Biol., 2013. 13: p. 229
    [PMID:24377444]
  2. Zou LJ, et al.
    Role of Transcription Factor HAT1 in Modulating Arabidopsis thaliana Response to Cucumber mosaic virus.
    Plant Cell Physiol., 2016. 57(9): p. 1879-89
    [PMID:27328697]
  3. Caggiano MP, et al.
    Cell type boundaries organize plant development.
    Elife, 2018.
    [PMID:28895530]