PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.004G291000.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family GRAS
Protein Properties Length: 648aa    MW: 67019.2 Da    PI: 7.0089
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.004G291000.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS1471.8e-453575603207
                  GRAS   3 elLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPil 91 
                           + L  +A+a ++g+   a+++Larl+++  p g+p+ R a+y++eAL   la+      +   +  ts+ +++ +laa+k+fs+ sP+l
  Sobic.004G291000.2.p 357 DELAVAAKAAEAGNSIGAREILARLNHQLPPLGKPFLRSASYLKEALLLALAE-----GHHGGCHLTSPLDVALKLAAYKTFSDHSPVL 440
                           67999************************************************.....666677788888899**************** PP

                  GRAS  92 kfshltaNqaIleavege..ervHiiDfdisqGlQWpaLlqaLasRp....egppslRiTgvgspesgskeeleetgerLakfAeelgv 174
                           +f+++ta qa+l++++g+  +++H+iDfd+  G+QW+++lq+La R+    +  p +++T+++s++s++  el+  ++++a+fA+ lg+
  Sobic.004G291000.2.p 441 QFTNFTATQALLDEIAGNtsSCIHVIDFDLAVGGQWASFLQELAHRRgaggAALPFVKLTAFVSAASHHPLELRLARDNIAQFAADLGI 529
                           *****************98899************************977555589********************************** PP

                  GRAS 175 pfefnvlvakrledleleeLrvkp.gEalaVnlv 207
                           pfef+++   + +++++ eL   + +E++aV l 
  Sobic.004G291000.2.p 530 PFEFSAI---SADTINPTELISATgDEVVAVVLP 560
                           ******8...778899999877777999999875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098529.087329647IPR005202Transcription factor GRAS
PfamPF035146.1E-43357560IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0030154Biological Processcell differentiation
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 648 aa     Download sequence    Send to blast
MRAALFGAQR SGAADLVGGG RTLLWPEEGK AKAQLEPRSV PDCTRSPSPS NSTSTLSSYM  60
GGGGAADSTG GVAVVSGSSA AAAADAAKWG ASGKHGGGGK EDWAGGCALP PIPTGLDMGV  120
IGGGDSWDAM LGSAAAAGQD QTFLNWIIGA AGDLDQSGPP LPVHQQPLLD NVGFGFPAAD  180
PLGFSLDPHL GGVASDMSSP GAVSHAPNSG GGGGNKASSA FGLFSPESAS LQPPPPPVLF  240
HEGIDTKPPL LGAQPPGRLH QYQHQPTSAT TFFMPIPSFP NHNQQSPLVQ PPPKRHQSIG  300
DDLYLARNRL LPPPAGQGHA FPPLNGPALF QLQPSPPPPH GAMKTTAAEA AQQQLLDELA  360
VAAKAAEAGN SIGAREILAR LNHQLPPLGK PFLRSASYLK EALLLALAEG HHGGCHLTSP  420
LDVALKLAAY KTFSDHSPVL QFTNFTATQA LLDEIAGNTS SCIHVIDFDL AVGGQWASFL  480
QELAHRRGAG GAALPFVKLT AFVSAASHHP LELRLARDNI AQFAADLGIP FEFSAISADT  540
INPTELISAT GDEVVAVVLP AGCSARAPPL PAILRLACSS LIRLMLLALM LILPARLRSS  600
LFSQELRIQC LGGPRWTSQL HGEVHSQLLG SRLFRPAVWL RRRLTAS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A3e-1136255726217Protein SCARECROW
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.204310.0embryo
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.004G291000.2.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0635040.0BT063504.1 Zea mays full-length cDNA clone ZM_BFc0070G06 mRNA, complete cds.
GenBankKJ7279390.0KJ727939.1 Zea mays clone pUT6034 GRAS transcription factor (GRAS80) mRNA, partial cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002454517.10.0scarecrow-like protein 27
TrEMBLA0A1Z5RPJ40.0A0A1Z5RPJ4_SORBI; Uncharacterized protein
STRINGSb04g032590.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP20203796
Representative plantOGRP17501242
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00150.11e-31GRAS family protein