PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G447400.1.p
Common NameSb01g042030, SORBIDRAFT_01g042030
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 300aa    MW: 31691.6 Da    PI: 8.1606
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G447400.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox56.54.8e-18129183256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           rk+ ++tkeq   Le+ F+++++++ +++  LAk+l+L  rqV vWFqNrRa+ k
  Sobic.001G447400.1.p 129 RKKLRLTKEQSALLEDRFKEHSTLNPKQKVALAKQLNLRPRQVEVWFQNRRARTK 183
                           788899***********************************************98 PP

2HD-ZIP_I/II124.93.8e-40129218191
           HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLre 89 
                           +kk+rl+keq++lLE+ F+e+++L+p++Kv+la++L+l+prqv+vWFqnrRARtk+kq+E+d+e Lkr++++l+een+rL++e +eLr 
  Sobic.001G447400.1.p 129 RKKLRLTKEQSALLEDRFKEHSTLNPKQKVALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCELLKRCCESLTEENRRLQRELQELR- 216
                           69*************************************************************************************9. PP

           HD-ZIP_I/II  90 el 91 
                           +l
  Sobic.001G447400.1.p 217 AL 218
                           55 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.98E-19122186IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.609.1E-18125188IPR009057Homeodomain-like
PROSITE profilePS5007117.232125185IPR001356Homeobox domain
SMARTSM003897.8E-16127189IPR001356Homeobox domain
CDDcd000863.07E-15129186No hitNo description
PfamPF000462.1E-15129183IPR001356Homeobox domain
PROSITE patternPS000270160183IPR017970Homeobox, conserved site
PfamPF021838.2E-11185219IPR003106Leucine zipper, homeobox-associated
SMARTSM003401.2E-22185228IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 300 aa     Download sequence    Send to blast
MAQEDVHHLD DAGLALGLSL GGGGASDAVR HGTSSSRLSM EAVARLPSPR PQLEPSLTLS  60
MPDEATATAT GSGGGGGGAA HSVSSLSVAG VKRERVDDAE GERASSTTAA AAARAVSAGA  120
EDDDDGSTRK KLRLTKEQSA LLEDRFKEHS TLNPKQKVAL AKQLNLRPRQ VEVWFQNRRA  180
RTKLKQTEVD CELLKRCCES LTEENRRLQR ELQELRALKF APLHPQAAQA PPSSAAQAAG  240
VPAPPQPFYM QMQLPAATLS LCPSCERLAG PAAAAKAEPD RPKAATHHFF NPFTHSAAC*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1127133TRKKLRL
2177185RRARTKLKQ
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, stems, leaf sheaths and blades and panicles. {ECO:0000269|PubMed:17999151}.
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, stems, leaf sheaths and blades and panicles. {ECO:0000269|PubMed:17999151}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G447400.1.p
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: In leaves by drought stress. {ECO:0000269|PubMed:17999151}.
UniProtINDUCTION: In leaves by drought stress. {ECO:0000269|PubMed:17999151}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0357540.0BT035754.2 Zea mays full-length cDNA clone ZM_BFb0081O05 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002465609.10.0homeobox-leucine zipper protein HOX19 isoform X2
SwissprotA2XE761e-107HOX19_ORYSI; Homeobox-leucine zipper protein HOX19
SwissprotQ8GRL41e-107HOX19_ORYSJ; Homeobox-leucine zipper protein HOX19
TrEMBLC5WRR40.0C5WRR4_SORBI; Uncharacterized protein
STRINGSb01g042030.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP20553698
Representative plantOGRP19616156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06710.13e-50homeobox from Arabidopsis thaliana