PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.005G230100.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family GRAS
Protein Properties Length: 694aa    MW: 77016.9 Da    PI: 5.683
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.005G230100.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS298.51.7e-912996822373
                  GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPi 90 
                             lLl+cA+av++g+  +a++lL++++ ++sp+gd++qRla +f++AL arla+++++ly++l +++ts     e ++a++++  vs +
  Sobic.005G230100.1.p 299 HALLLQCAHAVATGNRLQATELLCKIKRHSSPTGDATQRLAYCFARALDARLAGTGNQLYRSLMAKHTS---AMEFIKAYQMYLAVSCF 384
                           579*************************************************************99999...99**************9 PP

                  GRAS  91 lkfshltaNqaIleavege.....ervHiiDfd.isqGlQWpaLlqaLasRp.....egppslRiTgvgspesg..skeeleetgerLa 166
                             +++   N +I +av+g+     +++Hi+D++   +G++Wp+Ll    +++     +gpp++RiT vg p++g   + ++eetg+rL+
  Sobic.005G230100.1.p 385 KMMAFKFSNLTICKAVAGStsrrkKKLHIVDYGeHCYGFHWPTLLGFWGTKTwdedeAGPPEVRITFVGLPQPGfrPAARIEETGRRLS 473
                           999999999999999999888878899*****83569************9999888889**************999************* PP

                  GRAS 167 kfAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvs...leserdevLklvkslsPkvvvvveqeadhnses 252
                            fA+++g+pf+f+  +a ++e++  ++L+++p+E+l+Vn  ++  r++de  +   ++s+rd +L  +++++P+v++++ +++ hn++ 
  Sobic.005G230100.1.p 474 TFARQCGIPFRFRC-IAAKWETVCADDLDLEPDEVLVVNGLFHFGRMMDEGIDdiySPSPRDVLLGNIQKMRPHVFILCVENSLHNAPY 561
                           **************.799******************************987651114569***************************** PP

                  GRAS 253 FlerflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklll 341
                           Fl rf eal yys++fd+++a +pr++++r  vE+ l+g ++ n+vaceg++r+er et+++W++r ++aG++++pl+ +++k ++  +
  Sobic.005G230100.1.p 562 FLGRFQEALFYYSSMFDMMDAAAPRNNDQRLLVEQDLFGGRVLNAVACEGFDRVERPETYKQWQARNDRAGLRQLPLDPDIVKAVSDKV 650
                           ***************************************************************************************** PP

                  GRAS 342 rkvksdgyrveeesgslvlgWkdrpLvsvSaW 373
                           r  + + + v  +++++++gWk+r L+++S W
  Sobic.005G230100.1.p 651 RDNYHRDFVVYVDQKWILQGWKGRILYAMSTW 682
                           ****999************************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098552.444272663IPR005202Transcription factor GRAS
PfamPF035145.7E-89299682IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 694 aa     Download sequence    Send to blast
MAAEPEDIPD EEPFSPSIFL NLSPTPAPHR DDDHQDPANH QASQTSTEIL SVANTISGGV  60
GTSGFTLSPC FSDTTATVLD GVTCWPYDPV ELSQKLLYST TRTASLHVQD ASASVGFWQN  120
GGKLKTATTA PASAGDDVGE HDALFSGGGA GETSRVTMDM LNQAFLKGME EANKFLPTNN  180
TLLTHFHLET ISESGDRHGI AAGQKRHNHQ EDNDSLEAEA GRKSKVVAPE PEETGEMVDR  240
FVLVGYQSLL DKMMDMSIAV DSEAEHKART KQGKKKPTIT MAASSSSKEE EEEVVVDLHA  300
LLLQCAHAVA TGNRLQATEL LCKIKRHSSP TGDATQRLAY CFARALDARL AGTGNQLYRS  360
LMAKHTSAME FIKAYQMYLA VSCFKMMAFK FSNLTICKAV AGSTSRRKKK LHIVDYGEHC  420
YGFHWPTLLG FWGTKTWDED EAGPPEVRIT FVGLPQPGFR PAARIEETGR RLSTFARQCG  480
IPFRFRCIAA KWETVCADDL DLEPDEVLVV NGLFHFGRMM DEGIDDIYSP SPRDVLLGNI  540
QKMRPHVFIL CVENSLHNAP YFLGRFQEAL FYYSSMFDMM DAAAPRNNDQ RLLVEQDLFG  600
GRVLNAVACE GFDRVERPET YKQWQARNDR AGLRQLPLDP DIVKAVSDKV RDNYHRDFVV  660
YVDQKWILQG WKGRILYAMS TWVANEDAIS NLS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A1e-3730468225378Protein SCARECROW
5b3h_A1e-3730468224377Protein SCARECROW
5b3h_D1e-3730468224377Protein SCARECROW
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1405410RRKKKL
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.005G230100.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021316797.10.0scarecrow-like protein 34
TrEMBLA0A1B6PU740.0A0A1B6PU74_SORBI; Uncharacterized protein
STRINGPavir.Hb00042.1.p0.0(Panicum virgatum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP23131297
Representative plantOGRP26615129
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.11e-119SCARECROW-like 14