PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.010G030200.1.p
Common NameSb10g002750, SORBIDRAFT_10g002750
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 319aa    MW: 33412.5 Da    PI: 9.8441
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.010G030200.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox59.84.3e-19127181256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           rk+ +++k+q  +Lee+F+ +++++ +++  LA++lgL  rqV vWFqNrRa+ k
  Sobic.010G030200.1.p 127 RKKLRLSKDQAAVLEECFKTHSTLNPKQKLALATRLGLRPRQVEVWFQNRRARTK 181
                           678899***********************************************98 PP

2HD-ZIP_I/II123.41e-39127217192
           HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLre 89 
                           +kk+rlsk+q+++LEe+F+++++L+p++K +la++Lgl+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l++en+rLeke ++Lr 
  Sobic.010G030200.1.p 127 RKKLRLSKDQAAVLEECFKTHSTLNPKQKLALATRLGLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRWCERLADENKRLEKELADLR- 214
                           69*************************************************************************************9. PP

           HD-ZIP_I/II  90 elk 92 
                           +lk
  Sobic.010G030200.1.p 215 ALK 217
                           555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.3E-17108181IPR009057Homeodomain-like
PROSITE profilePS5007116.422123183IPR001356Homeobox domain
SMARTSM003896.1E-16125187IPR001356Homeobox domain
SuperFamilySSF466895.85E-18125192IPR009057Homeodomain-like
PfamPF000461.8E-16127181IPR001356Homeobox domain
CDDcd000861.94E-14127184No hitNo description
PRINTSPR000315.5E-5154163IPR000047Helix-turn-helix motif
PROSITE patternPS000270158181IPR017970Homeobox, conserved site
PRINTSPR000315.5E-5163179IPR000047Helix-turn-helix motif
SMARTSM003402.1E-20183226IPR003106Leucine zipper, homeobox-associated
PfamPF021831.5E-9183217IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 319 aa     Download sequence    Send to blast
MMPRTSLDLG LSLGLSLTSQ GSLSSSTTTA GSSSPWAAAL SSVVADVAKA RDAGAHMLYH  60
ASASAAALDR AAMRASTSPD SAAALSSGGS GDNTTGTKRE RETELERTGS GGVRSDEEDG  120
VDGAGGRKKL RLSKDQAAVL EECFKTHSTL NPKQKLALAT RLGLRPRQVE VWFQNRRART  180
KLKQTEVDCE YLKRWCERLA DENKRLEKEL ADLRALKAAP SPAAAQPASP AATLTMCPSC  240
RRVAATATAA ASPTKHHHHQ QQQCHPKPSS SLTPAAAAAA GGAGSVVPSH CQFFPAAVDR  300
TSQSTWSTAA PLVTRELF*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1175183RRARTKLKQ
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.11500.0pollen
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, leaves, nodes, internodes, flowers and embryo. {ECO:0000269|PubMed:10732669, ECO:0000269|PubMed:17999151}.
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, leaves, nodes, internodes, flowers and embryo. {ECO:0000269|PubMed:10732669, ECO:0000269|PubMed:17999151}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds to the DNA sequence 5'-CAAT[GC]ATTG-3'. {ECO:0000269|PubMed:10732669}.
UniProtProbable transcription factor that binds to the DNA sequence 5'-CAAT[GC]ATTG-3'. {ECO:0000269|PubMed:10732669}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.010G030200.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankFP0932881e-137FP093288.1 Phyllostachys edulis cDNA clone: bphylf053m07, full insert sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002436446.10.0homeobox-leucine zipper protein HOX2
SwissprotQ5VPE31e-90HOX2_ORYSJ; Homeobox-leucine zipper protein HOX2
SwissprotQ84U861e-90HOX2_ORYSI; Homeobox-leucine zipper protein HOX2
TrEMBLC5Z3S70.0C5Z3S7_SORBI; Uncharacterized protein
STRINGSb10g002750.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP38423376
Representative plantOGRP19616156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G60390.11e-50homeobox-leucine zipper protein 3
Publications ? help Back to Top
  1. Kikuchi S, et al.
    Collection, mapping, and annotation of over 28,000 cDNA clones from japonica rice.
    Science, 2003. 301(5631): p. 376-9
    [PMID:12869764]