PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG041813t1
Common NameTCM_041813
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 667aa    MW: 75260.5 Da    PI: 5.28
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG041813t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS364.51.5e-1112916611374
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvselykalppsetseknsseelaalklfsevsPilkf 93 
                       l++lL++cAeav s+d+++a+ +L ++++++sp g++ qR a+yf++AL+arla+++se y+al +++ +    + ++ a kl+  ++P++k+
  Thecc1EG041813t1 291 LRTLLIQCAEAVGSNDFRNANDFLMQIRNHSSPFGNASQRKAHYFAKALEARLAGTGSEEYAALVSKRIP----TASVEACKLLISACPFMKV 379
                       6789***********************************************************9999997....5677899************ PP

              GRAS  94 shltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRpegppslRiTgvgspesg..skeeleetgerLakfAeelgvpfefnvlvak 184
                       s++ + ++I++ +++++r+Hii f++ +Gl+W +L+q L++Rp++pp+lRiTg++ p++g  s+ ++ee g+ La+ ++++++pfe+n  +++
  Thecc1EG041813t1 380 SNFFTTEMIMKLAKKATRIHIIHFGVPYGLKWSSLIQRLSTRPGNPPTLRITGIDLPQPGveSAGRVEEFGHFLANCCKQFNIPFEYNG-ITQ 471
                       ***********************************************************9999**************************.799 PP

              GRAS 185 rledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsleaklpr 277
                       ++e+++le+L++ ++E+++Vn+ ++l +++de ++l+s+rd+vL+lv++++P+++++   +  +n++ F+ rf eal+y+s++fd+le+ +p+
  Thecc1EG041813t1 472 KWESIQLEDLKIAKDEVVVVNCLYRLRHIVDEMADLSSPRDTVLNLVRKINPNIFIHGIVNGAYNAPFFVSRFREALYYFSTMFDMLEEIAPS 564
                       ********************************************************************************************* PP

              GRAS 278 eseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvksdgyrveeesgslvlgWkdrpLvsv 370
                       e++er+++E+ + g+ei nvvaceg+er+er et+++W+ r  +aG++++pl++++ ++ak+ ++ ++ + + v+e+++++++gWk+r L+++
  Thecc1EG041813t1 565 EDQERMVLEENMYGKEILNVVACEGSERIERPETYKQWQVRNLRAGLRQLPLKQEILNSAKAHVKLHYHKDFLVDEDKNWILQGWKGRILFAL 657
                       ********************************************************************888********************** PP

              GRAS 371 SaWr 374
                       S+Wr
  Thecc1EG041813t1 658 SFWR 661
                       ***8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098562.042265641IPR005202Transcription factor GRAS
PfamPF035145.2E-109291661IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 667 aa     Download sequence    Send to blast
MNPSSYSTVA DPADESNFPN AFYITTPIFT EDANDNTFLL QDSTVGAAES SFADILAEKY  60
PLSTTLPFPI CSDLDWSSCI TVANNLVQSD FICDLARFEV SSMLSPQVEH AFQDYPQPVS  120
QSLFPDNSFG EAVDVPVIDP VNKSLVPDLV NQDLCCESLY DCQFLEGGVK ETQKLIAYGD  180
HMSAEEESPV AAERETSEEM DHSFCKTRKV ETRLQICGEG EIGREHGRVV CPEQFEQPDM  240
FKNDVSAAAT ADIAWRNEAR MIMNACGEPK RNSAGTKKRR RSVNKRNLVD LRTLLIQCAE  300
AVGSNDFRNA NDFLMQIRNH SSPFGNASQR KAHYFAKALE ARLAGTGSEE YAALVSKRIP  360
TASVEACKLL ISACPFMKVS NFFTTEMIMK LAKKATRIHI IHFGVPYGLK WSSLIQRLST  420
RPGNPPTLRI TGIDLPQPGV ESAGRVEEFG HFLANCCKQF NIPFEYNGIT QKWESIQLED  480
LKIAKDEVVV VNCLYRLRHI VDEMADLSSP RDTVLNLVRK INPNIFIHGI VNGAYNAPFF  540
VSRFREALYY FSTMFDMLEE IAPSEDQERM VLEENMYGKE ILNVVACEGS ERIERPETYK  600
QWQVRNLRAG LRQLPLKQEI LNSAKAHVKL HYHKDFLVDE DKNWILQGWK GRILFALSFW  660
RPSKEP*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A8e-4729766225380Protein SCARECROW
5b3h_A8e-4729766224379Protein SCARECROW
5b3h_D8e-4729766224379Protein SCARECROW
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in plant development. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017982430.10.0PREDICTED: scarecrow-like protein 9
SwissprotQ9XE581e-150SCL14_ARATH; Scarecrow-like protein 14
TrEMBLA0A061GX400.0A0A061GX40_THECC; GRAS family transcription factor, putative
STRINGEOY339980.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G07530.11e-153SCARECROW-like 14
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]