PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen05g031600.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family GRAS
Protein Properties Length: 746aa    MW: 84371.6 Da    PI: 6.2352
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen05g031600.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS382.45.2e-1173737431374
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       l++lL+ cA+av+ g+++ a++lL++++e +sp gd mqRla+yf+ +L+ar+a+s++++ykal +++ s    ++ l+a++l+  ++P+ ++
  Sopen05g031600.1 373 LRTLLTLCAQAVAVGNQRTANELLKQIRESSSPMGDGMQRLAHYFADGLEARMAGSGTHIYKALITRPVS---AADILKAYHLLLAACPFRTM 462
                       5789***************************************************************999...9******************* PP

              GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvak 184
                       s +  N++I++ +e++++vHiiD++i+ G+QWp L+q LasRp+gpp+lRiTg++ p++g   +e++eetg+rLa++Ae+++vpfefn+ +a+
  Sopen05g031600.1 463 SSFFSNKTIMNLAEKASTVHIIDIGIMWGFQWPGLIQRLASRPGGPPKLRITGIDFPNPGfrPAERVEETGRRLANYAESFKVPFEFNA-IAQ 554
                       ***********************************************************9*****************************.7** PP

              GRAS 185 rledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpr 277
                       ++e+++le+L++++gE+l+Vn+ ++  +llde+v ++s+rd +L+l+++l+P+v++    +  +n++ F  rf eal +ys+lfd+le+ +pr
  Sopen05g031600.1 555 KWETVKLEDLKINKGEVLVVNCLYRFRNLLDETVVVNSPRDVFLNLIRRLNPDVFIQGTVNGGYNAPFFISRFREALFHYSSLFDMLETIIPR 647
                       ********************************************************************************************* PP

              GRAS 278 eseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsv 370
                       e +er+ vE+ +lg+e++n++acegaer+er et+++W+ r+ +aGF+++pl+e+++       + ++++ + ++ +s++l++gWk+r  v++
  Sopen05g031600.1 648 EVHERMLVEKNILGQEAMNAIACEGAERIERPETYKQWQVRILKAGFRQLPLDEEIMRMTTERFKVYDKN-FIIDVDSEWLLQGWKGRIAVAL 739
                       **************************************************************99999955.********************** PP

              GRAS 371 SaWr 374
                       S W+
  Sopen05g031600.1 740 STWK 743
                       ***8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098567.725347724IPR005202Transcription factor GRAS
PfamPF035141.8E-114373743IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 746 aa     Download sequence    Send to blast
MGQYHEAGSA VKLEDEDCSF FAYPNLINNL RVNDYFHDDY DPNLINNLRV SDNFVNRNVD  60
ISPFQSDVER NTLVPSTADD FHEDYDFSDG VLKYINQMLM EEDIEEKTCM FQESAALQAA  120
ERSFYEVIGE KYPSSTNHEK SSTLGQIERY AMGHYSGNDG RDGLLCPNWI LDLGEDDVSL  180
VPDDVALDST SRSSNSLSGT VPDVPVDSPV STLRIPDIFS DGESVMQFKK GVEEASKFLP  240
TGNSLLASVR YHVVGKELRY KERKDAVVKV DKYGEKQYTE RSRGKKNTFH EDVVDLTEGR  300
NNKQSAVFSE STVRSEMFDR VLLCSAGKNE SALREALQAI SRQNASKNGP SKGSNGKKLQ  360
RKKKGGKRDV VDLRTLLTLC AQAVAVGNQR TANELLKQIR ESSSPMGDGM QRLAHYFADG  420
LEARMAGSGT HIYKALITRP VSAADILKAY HLLLAACPFR TMSSFFSNKT IMNLAEKAST  480
VHIIDIGIMW GFQWPGLIQR LASRPGGPPK LRITGIDFPN PGFRPAERVE ETGRRLANYA  540
ESFKVPFEFN AIAQKWETVK LEDLKINKGE VLVVNCLYRF RNLLDETVVV NSPRDVFLNL  600
IRRLNPDVFI QGTVNGGYNA PFFISRFREA LFHYSSLFDM LETIIPREVH ERMLVEKNIL  660
GQEAMNAIAC EGAERIERPE TYKQWQVRIL KAGFRQLPLD EEIMRMTTER FKVYDKNFII  720
DVDSEWLLQG WKGRIAVALS TWKAAY
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A7e-5338074226378Protein SCARECROW
5b3h_A6e-5338074225377Protein SCARECROW
5b3h_D6e-5338074225377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754440.0HG975444.1 Solanum pennellii chromosome ch05, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015075133.10.0scarecrow-like protein 30
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A3Q7GJU10.0A0A3Q7GJU1_SOLLC; Uncharacterized protein
STRINGSolyc05g053090.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G37650.10.0GRAS family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]