PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.007G152300.1.p
Common NameSb07g022620, SORBIDRAFT_07g022620
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family HD-ZIP
Protein Properties Length: 383aa    MW: 39140.7 Da    PI: 8.0773
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.007G152300.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.32.6e-18196250256
                           T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
              Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                           rk+ +++keq   Lee F+++++++ ++++ LAk+l+L  rqV vWFqNrRa+ k
  Sobic.007G152300.1.p 196 RKKLRLSKEQSAFLEESFKEHSTLNPKQKAALAKQLNLRPRQVEVWFQNRRARTK 250
                           788899***********************************************98 PP

2HD-ZIP_I/II127.74.9e-41196286192
           HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLre 89 
                           +kk+rlskeq+++LEesF+e+++L+p++K++la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+ke +eLr 
  Sobic.007G152300.1.p 196 RKKLRLSKEQSAFLEESFKEHSTLNPKQKAALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCETLTEENRRLHKELAELR- 283
                           69*************************************************************************************9. PP

           HD-ZIP_I/II  90 elk 92 
                           +lk
  Sobic.007G152300.1.p 284 ALK 286
                           555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.602.3E-18180246IPR009057Homeodomain-like
SuperFamilySSF466892.48E-18184253IPR009057Homeodomain-like
PROSITE profilePS5007117.248192252IPR001356Homeobox domain
SMARTSM003894.9E-16194256IPR001356Homeobox domain
CDDcd000861.24E-16196253No hitNo description
PfamPF000469.8E-16196250IPR001356Homeobox domain
PROSITE patternPS000270227250IPR017970Homeobox, conserved site
SMARTSM003404.4E-25252295IPR003106Leucine zipper, homeobox-associated
PfamPF021831.3E-10252286IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 383 aa     Download sequence    Send to blast
MELRLSLGEA AVPDAGRAAV PELGLGLGVG IGASAAAGSG RREEGGTGNR AAPGTGTGTG  60
TRWWAAPATP EPAAVRLSLV SSSLGLQWPP SDAGICHAGR AEAPAARGFD VNRPAPSSSA  120
VAASALLALE DDEDDPGAAA ALSSSPNDSA GSFPLDLGGG PHAHHAEGGA AAQAAGGGGE  180
RSSSRASDED DGASARKKLR LSKEQSAFLE ESFKEHSTLN PKQKAALAKQ LNLRPRQVEV  240
WFQNRRARTK LKQTEVDCEY LKRCCETLTE ENRRLHKELA ELRALKAAPP FFMRLPATTL  300
SMCPSCERVA SGPNPAASTS APVSLSSSSP PATATATAVA APVVRGEHRP PSSFAALFAA  360
TRSFTLASQP RPPAPAPASN CL*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1244252RRARTKLKQ
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.40241e-99callus| embryo
Expression -- Description ? help Back to Top
Source Description
UniprotTISSUE SPECIFICITY: Expressed in seedlings, roots, stems, leaf sheaths and blades and panicles. {ECO:0000269|PubMed:17999151}.
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.007G152300.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK2411961e-174AK241196.1 Oryza sativa Japonica Group cDNA, clone: J065121H01, full insert sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021320313.10.0homeobox-leucine zipper protein HOX27
SwissprotQ6YPD01e-118HOX27_ORYSJ; Homeobox-leucine zipper protein HOX27
TrEMBLA0A1B6PHX60.0A0A1B6PHX6_SORBI; Uncharacterized protein
STRINGSb07g022620.11e-176(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP52093363
Representative plantOGRP19616156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06710.16e-58homeobox from Arabidopsis thaliana