PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.005G230100.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family GRAS
Protein Properties Length: 694aa    MW: 77016.9 Da    PI: 5.683
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.005G230100.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS298.51.7e-912996822373
                  GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPi 90 
                             lLl+cA+av++g+  +a++lL++++ ++sp+gd++qRla +f++AL arla+++++ly++l +++ts     e ++a++++  vs +
  Sobic.005G230100.1.p 299 HALLLQCAHAVATGNRLQATELLCKIKRHSSPTGDATQRLAYCFARALDARLAGTGNQLYRSLMAKHTS---AMEFIKAYQMYLAVSCF 384
                           579*************************************************************99999...99**************9 PP

                  GRAS  91 lkfshltaNqaIleavege.....ervHiiDfd.isqGlQWpaLlqaLasRp.....egppslRiTgvgspesg..skeeleetgerLa 166
                             +++   N +I +av+g+     +++Hi+D++   +G++Wp+Ll    +++     +gpp++RiT vg p++g   + ++eetg+rL+
  Sobic.005G230100.1.p 385 KMMAFKFSNLTICKAVAGStsrrkKKLHIVDYGeHCYGFHWPTLLGFWGTKTwdedeAGPPEVRITFVGLPQPGfrPAARIEETGRRLS 473
                           999999999999999999888878899*****83569************9999888889**************999************* PP

                  GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvs...leserdevLklvkslsPkvvvvveqeadhnses 252
                            fA+++g+pf+f+  +a ++e++  ++L+++p+E+l+Vn  ++  r++de  +   ++s+rd +L  +++++P+v++++ +++ hn++ 
  Sobic.005G230100.1.p 474 TFARQCGIPFRFRC-IAAKWETVCADDLDLEPDEVLVVNGLFHFGRMMDEGIDdiySPSPRDVLLGNIQKMRPHVFILCVENSLHNAPY 561
                           **************.799******************************987651114569***************************** PP

                  GRAS 253 FlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklll 341
                           Fl rf eal yys++fd+++a +pr++++r  vE+ l+g ++ n+vaceg++r+er et+++W++r ++aG++++pl+ +++k ++  +
  Sobic.005G230100.1.p 562 FLGRFQEALFYYSSMFDMMDAAAPRNNDQRLLVEQDLFGGRVLNAVACEGFDRVERPETYKQWQARNDRAGLRQLPLDPDIVKAVSDKV 650
                           ***************************************************************************************** PP

                  GRAS 342 rkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                           r  + + + v  +++++++gWk+r L+++S W
  Sobic.005G230100.1.p 651 RDNYHRDFVVYVDQKWILQGWKGRILYAMSTW 682
                           ****999************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098552.444272663IPR005202Transcription factor GRAS
PfamPF035145.7E-89299682IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 694 aa     Download sequence    Send to blast
MAAEPEDIPD EEPFSPSIFL NLSPTPAPHR DDDHQDPANH QASQTSTEIL SVANTISGGV  60
GTSGFTLSPC FSDTTATVLD GVTCWPYDPV ELSQKLLYST TRTASLHVQD ASASVGFWQN  120
GGKLKTATTA PASAGDDVGE HDALFSGGGA GETSRVTMDM LNQAFLKGME EANKFLPTNN  180
TLLTHFHLET ISESGDRHGI AAGQKRHNHQ EDNDSLEAEA GRKSKVVAPE PEETGEMVDR  240
FVLVGYQSLL DKMMDMSIAV DSEAEHKART KQGKKKPTIT MAASSSSKEE EEEVVVDLHA  300
LLLQCAHAVA TGNRLQATEL LCKIKRHSSP TGDATQRLAY CFARALDARL AGTGNQLYRS  360
LMAKHTSAME FIKAYQMYLA VSCFKMMAFK FSNLTICKAV AGSTSRRKKK LHIVDYGEHC  420
YGFHWPTLLG FWGTKTWDED EAGPPEVRIT FVGLPQPGFR PAARIEETGR RLSTFARQCG  480
IPFRFRCIAA KWETVCADDL DLEPDEVLVV NGLFHFGRMM DEGIDDIYSP SPRDVLLGNI  540
QKMRPHVFIL CVENSLHNAP YFLGRFQEAL FYYSSMFDMM DAAAPRNNDQ RLLVEQDLFG  600
GRVLNAVACE GFDRVERPET YKQWQARNDR AGLRQLPLDP DIVKAVSDKV RDNYHRDFVV  660
YVDQKWILQG WKGRILYAMS TWVANEDAIS NLS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A6e-363026828374GRAS family transcription factor containing p
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1405410RRKKKL
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002450068.10.0hypothetical protein SORBIDRAFT_05g027783, partial
TrEMBLC5Y8M00.0C5Y8M0_SORBI; Putative uncharacterized protein Sb05g027783 (Fragment)
STRINGSb05g027783.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP26615129
MonocotsOGMP23131297
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.11e-119SCARECROW-like 14