PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.005G230600.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family GRAS
Protein Properties Length: 620aa    MW: 70574.6 Da    PI: 6.6987
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.005G230600.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS298.81.4e-912426113373
                  GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                           +lL++cA+avs+++++ a++lL++++++as +gd++qRla++f+++L+arl++++ +l + l+ ++      +e l+a++l+  ++ + 
  Sobic.005G230600.1.p 242 TLLINCAQAVSASNFRTAHELLKQIKQHASATGDATQRLAQCFAKGLEARLMGTGRQLWQLLTLEQPL---AIEYLKAYNLYMATCSFN 327
                           79*********************************************************999888887...9***************** PP

                  GRAS  92 kfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfef 178
                           +++ +    +I  a+ g++++Hi+D++  +G+QW+ Ll+ +a+R++gpp++RiT++++ ++    +e +  tg+rL k A+e+gvpf+f
  Sobic.005G230600.1.p 328 RVALFFNVMTIEHAMVGKSKLHIVDYGPHHGFQWAGLLRWMANREGGPPEVRITAISRLQPRscPSEGTDDTGRRLDKCAREFGVPFKF 416
                           **********************************************************9999999************************ PP

                  GRAS 179 nvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsles..erdevLklvkslsPkvvvvveqeadhnsesFlerflealeyys 265
                           ++ ++ ++e++++++L+++ +E+l+V   ++   l +e+   +    rd+vL+ +++++P+v++    +++h s+sFl+rf eal  ys
  Sobic.005G230600.1.p 417 HA-ITAKWETISIDDLKTEADEVLVVVDLFSFSILREENIYFDGlsSRDTVLNNIRKMRPDVFIQGIMNCSH-STSFLTRFREALFSYS 503
                           **.6999*******************999988888777776665668*************************.899************* PP

                  GRAS 266 alfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveee 354
                           alfd+l+a++pr+s+ r ++E+ +lg ++ n+vacega+ ++r e++++W+ r ++aG++++pl+ +++k  k  + k + + + + e+
  Sobic.005G230600.1.p 504 ALFDMLDATIPRDSKLRPVLEQNMLGHSVLNLVACEGADVVNRPEKYRRWQVRNQRAGLRQLPLKPNIVKVLKDKVMKDHHKDFFISED 592
                           *************************************************************************9999998788****** PP

                  GRAS 355 sgslvlgWkdrpLvsvSaW 373
                            ++l++gW++r  ++ S W
  Sobic.005G230600.1.p 593 GQWLLQGWMGRIFYAHSTW 611
                           ******************* PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098552.199214592IPR005202Transcription factor GRAS
PfamPF035144.8E-89242611IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 620 aa     Download sequence    Send to blast
MATTPEEFFM EGLMNLSPPS PSVLLDLPQM TNDVGQDSLC PDEIVLSYVS SVLMEDQSED  60
KLLCQDTDHP SLLQVQKPFS QILSSPSFST NSDNTVNRDN IEGARNLFQD CSGDQCTLRS  120
SLSIGALAAG SVLKGMEEAS RFLSEDNVFR KDQQLNQMTR ESGNSRVFKK RYNRDEDGEV  180
GRAYKVFMMM EELEEMFDKM MLRGYETCIE EMKKLRITKA DEAKNKKKGY NKRRSNVVDL  240
YTLLINCAQA VSASNFRTAH ELLKQIKQHA SATGDATQRL AQCFAKGLEA RLMGTGRQLW  300
QLLTLEQPLA IEYLKAYNLY MATCSFNRVA LFFNVMTIEH AMVGKSKLHI VDYGPHHGFQ  360
WAGLLRWMAN REGGPPEVRI TAISRLQPRS CPSEGTDDTG RRLDKCAREF GVPFKFHAIT  420
AKWETISIDD LKTEADEVLV VVDLFSFSIL REENIYFDGL SSRDTVLNNI RKMRPDVFIQ  480
GIMNCSHSTS FLTRFREALF SYSALFDMLD ATIPRDSKLR PVLEQNMLGH SVLNLVACEG  540
ADVVNRPEKY RRWQVRNQRA GLRQLPLKPN IVKVLKDKVM KDHHKDFFIS EDGQWLLQGW  600
MGRIFYAHST WVAEDTISE*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A7e-3824657025337Protein SCARECROW
5b3h_A6e-3824657024336Protein SCARECROW
5b3h_D6e-3824657024336Protein SCARECROW
Search in ModeBase
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.005G230600.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021317864.10.0scarecrow-like protein 9
TrEMBLA0A1B6PU830.0A0A1B6PU83_SORBI; Uncharacterized protein
STRINGSb05g027795.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP23131297
Representative plantOGRP1080039
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G29060.11e-110GRAS family protein