PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen04g005810.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family GRAS
Protein Properties Length: 731aa    MW: 82265.9 Da    PI: 5.5117
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen04g005810.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3872.2e-1183597282374
              GRAS   2 velLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkfs 94 
                       ++ L+  A+av+++d + a+++L+++++++sp+gd mqRla+yf+++L+ar+a+s++++yk l + +ts    ++ l+a++lf  ++P+ k+s
  Sopen04g005810.1 359 RTILTLGAQAVAADDRRTANEFLKQIRQNSSPTGDGMQRLAHYFANGLEARMAGSGTQIYKDLISMPTS---AADILKAYQLFLAACPFRKLS 448
                       5678888********************************************************999998...9******************** PP

              GRAS  95 hltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvakr 185
                       ++  N++I++ +e++++vHiiDf+i++G+QWp+++q L+sRp+gpp+lRiTg++ p++g   +e++eetg+rLa++Ae+++vpfef + +a++
  Sopen04g005810.1 449 NFFSNKTIMNVAETASTVHIIDFGIMYGFQWPCFIQRLSSRPGGPPKLRITGIDFPNPGfrPAERVEETGRRLADYAESFNVPFEFIA-IAQK 540
                       **********************************************************9*****************************.7*** PP

              GRAS 186 ledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpre 278
                       +e++++e+L+++++E+laVn++++  +llde+v ++s+rd vL+l+++l+P+v +    +  +n++ F +rf eal +ys++fd+lea++pre
  Sopen04g005810.1 541 WETIKVEDLKIQKDEVLAVNCMYRFRNLLDETVVVNSPRDIVLNLIRKLNPDVYIQGIVNGAYNAPFFITRFREALFHYSSVFDMLEANIPRE 633
                       ********************************************************************************************* PP

              GRAS 279 seerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsvS 371
                         er  vE+ ++gre++nvvace aer+er et+++W+ r ++aGF+++pl+e++   ak  ++ +++d + ++ + ++l++gWk+r ++++S
  Sopen04g005810.1 634 IPERLLVEKLIFGREAMNVVACEAAERIERPETYKQWQVRNTRAGFRQLPLNEEILRMAKDRVKAYHKD-FVIDVDGHWLLQGWKGRIMYAAS 725
                       *******************************************************************77.*********************** PP

              GRAS 372 aWr 374
                        W+
  Sopen04g005810.1 726 TWK 728
                       **8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098567.859332709IPR005202Transcription factor GRAS
PfamPF035147.6E-116359728IPR005202Transcription factor GRAS
Sequence ? help Back to Top
Protein Sequence    Length: 731 aa     Download sequence    Send to blast
MVMDSRNYKG LYDATSGIQL KDEDDDKSFF QDLNLINHLR VSDALVERNL EPGDFVPSAM  60
DNSHEDYDFS DVVLKYISQL LMEENIEEKT CMFQESAALQ AAERSFYEVI GEKYPLSPIL  120
DLGQDGRRGV DCSSNNYYSC GSDVTDGLLC PNWNPDLGDT DASHTQQFVV DSGTSQSSLS  180
SPSSSGTVTD AHVDSPVNSI QIPDIFSDSE SIMQFKKGVE EASKFLPTGN SLLLDVKYNV  240
VVKEDNENGK YAVEKMEDRG KQKSPEASRG KKNIHHDDVD VMEARSNKQS AVFYESAVRS  300
DLFDKVLLCS GGKNESALRE SWQVVSSKHA PENVLPKGSN GRKSRGKKQG GKRDAVDLRT  360
ILTLGAQAVA ADDRRTANEF LKQIRQNSSP TGDGMQRLAH YFANGLEARM AGSGTQIYKD  420
LISMPTSAAD ILKAYQLFLA ACPFRKLSNF FSNKTIMNVA ETASTVHIID FGIMYGFQWP  480
CFIQRLSSRP GGPPKLRITG IDFPNPGFRP AERVEETGRR LADYAESFNV PFEFIAIAQK  540
WETIKVEDLK IQKDEVLAVN CMYRFRNLLD ETVVVNSPRD IVLNLIRKLN PDVYIQGIVN  600
GAYNAPFFIT RFREALFHYS SVFDMLEANI PREIPERLLV EKLIFGREAM NVVACEAAER  660
IERPETYKQW QVRNTRAGFR QLPLNEEILR MAKDRVKAYH KDFVIDVDGH WLLQGWKGRI  720
MYAASTWKGA L
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A2e-543427273378Protein SCARECROW
5b3h_A2e-5436672726377Protein SCARECROW
5b3h_D2e-5436672726377Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754430.0HG975443.1 Solanum pennellii chromosome ch04, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015073886.10.0scarecrow-like protein 14
SwissprotQ9XE580.0SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A3Q7G0P30.0A0A3Q7G0P3_SOLLC; Uncharacterized protein
STRINGPGSC0003DMT4000162580.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA27624191
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.10.0SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]