PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.010G003700.1.p
Common NameSb10g000520, SORBIDRAFT_10g000520
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family GRAS
Protein Properties Length: 514aa    MW: 54697.3 Da    PI: 5.4041
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.010G003700.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS224.35.9e-691535122374
                  GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgd.......pmqRlaayfteALaarlarsvselykalppsetseknsseelaalkl 83 
                           v+ Lle+A+  ++gd + a+++L+rl++++ p+ +       p+ R+aa++  AL + l+     l     +s +++ +++ ++aa+++
  Sobic.010G003700.1.p 153 VDELLEAARRADAGDSAGAREILTRLNHRRLPSMSlpghphpPLLRAAAHLRDALLRLLVA--LPLP--HGSSVSTPLDVALKVAAHRA 237
                           678**************************77776666778889*****************9..2222..2333333445899999**** PP

                  GRAS  84 fsevsPilkfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRp.....egppslRiTgvgspesgskeeleetgerLak 167
                           + ++sP+++f+ +t  qa+l+ +  ++rvH++D+d++ G+ W++L+q+La +      ++pp +++T+++sp s +  el+ t+e L++
  Sobic.010G003700.1.p 238 LADASPTVQFASFTSTQALLDVLGAARRVHVVDLDVGFGGRWAPLMQELALQWrrapvSPPPCFKVTALVSPGSAHPLELHLTHEGLTR 326
                           **************************************************5433368899***************************** PP

                  GRAS 168 fAeelgvpfefnvlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadh.nsesFle 255
                           fA+elg++fefn+++ +  + l + eL+v pgEa+aV+l +   + l          +++L++vk+l+P +vv+v+    h ++ ++ +
  Sobic.010G003700.1.p 327 FAAELGISFEFNAVAFDPSDPLPPTELSVAPGEAVAVHLPIGSGTPL----------PTTLRVVKQLRPAIVVCVDDHGCHrGDLPLSH 405
                           *****************************************666655..........56****************9986666******* PP

                  GRAS 256 rflealeyysalfdsleaklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkv 344
                           + l+ ++  +a+++sl+a  ++ +++ + +E+ +l+++++ ++             +  W+ +l +aGF   pls+ a++qa++l++++
  Sobic.010G003700.1.p 406 HALNVVRSTAAFLESLDAA-GAPADAVARLEQYVLRPRVERLLLGDR---------MPPWQTMLASAGFA--PLSNAAEAQAECLVSRT 482
                           ****************887.677888999************998654.........56***********7..589************** PP

                  GRAS 345 ksdgyrveeesgslvlgWkdrpLvsvSaWr 374
                           +  g++ve+++++l l W++ +Lv+vSaWr
  Sobic.010G003700.1.p 483 PTPGFHVEKRQAALALRWQESELVMVSAWR 512
                           *****************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098531.735126492IPR005202Transcription factor GRAS
PfamPF035142.0E-66153512IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0030154Biological Processcell differentiation
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 514 aa     Download sequence    Send to blast
MSELQDDGAG GNGQDETAPA AAFASPAKRL RSRSPASEPT SVLYNRSPSP PTSSSLASSS  60
APEPPPISAE DWEWEAVLDM AAPPAAARSQ DTTSFLRWIM DADAQVDAFD PFLPPPPCQE  120
TAAVEPFLHP QLQLPLPLPV AQEDLDLEPG VAVDELLEAA RRADAGDSAG AREILTRLNH  180
RRLPSMSLPG HPHPPLLRAA AHLRDALLRL LVALPLPHGS SVSTPLDVAL KVAAHRALAD  240
ASPTVQFASF TSTQALLDVL GAARRVHVVD LDVGFGGRWA PLMQELALQW RRAPVSPPPC  300
FKVTALVSPG SAHPLELHLT HEGLTRFAAE LGISFEFNAV AFDPSDPLPP TELSVAPGEA  360
VAVHLPIGSG TPLPTTLRVV KQLRPAIVVC VDDHGCHRGD LPLSHHALNV VRSTAAFLES  420
LDAAGAPADA VARLEQYVLR PRVERLLLGD RMPPWQTMLA SAGFAPLSNA AEAQAECLVS  480
RTPTPGFHVE KRQAALALRW QESELVMVSA WRC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A3e-2523551280375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Sbi.26080.0callus| shoot
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.010G003700.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankBT0394660.0BT039466.1 Zea mays full-length cDNA clone ZM_BFc0014N06 mRNA, complete cds.
GenBankEU9743700.0EU974370.1 Zea mays clone 447391 scarecrow-like 6 mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002436328.10.0scarecrow-like protein 6
TrEMBLC5Z2460.0C5Z246_SORBI; Uncharacterized protein
STRINGSb10g000520.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP145372224
Representative plantOGRP1407444
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00150.15e-72GRAS family protein