PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG014574t1
Common NameTCM_014574
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 520aa    MW: 57510.8 Da    PI: 6.4608
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG014574t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS3733.8e-1141475181374
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlarsvs.elykal.ppsetseknsseelaalklfsevsPil 91 
                       lv+lL++cAeav+ +d++ a+alL++l+ +a   g+++qR+a++f+++La rla  ++ +++  + p+ +  + +s+++  al+l +e++P +
  Thecc1EG014574t1 147 LVQLLIACAEAVACRDKSHASALLSELRANALVFGSSFQRVASCFVQGLADRLALVQPlGTVGLVaPVMNIMDISSDKKEEALRLVYEICPHI 239
                       689**************************************************9944413333332333444444899999************ PP

              GRAS  92 kfshltaNqaIleavegeervHiiDfdis....qGlQWpaLlqaLasRpegpp.slRiTgvgspesgskeeleetgerLakfAeelgvpfefn 179
                       +f+h++aN+ Ilea+ege+ vH++D++++    +G QW  L+q+La+R++++p +lRiT+vg     s ++ + +g+ L+ +A+ lg+++ef+
  Thecc1EG014574t1 240 QFGHFVANSSILEAFEGESFVHVVDLGMTlglpHGHQWRHLIQSLANRAGKAPsRLRITAVGL----SDHRFHIIGQELEAYAKDLGMNLEFS 328
                       *************************98873333788*************877769********....99************************ PP

              GRAS 180 vlvakrledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvvveqeadhnsesFlerflealeyysalfdsle 272
                       v v ++le+l++e+++v  gE+l+Vn++lqlh +++es    +   +vL+++ +lsPkv+v+veq+++hn++ Fl rf+eal+yysa+fdsl+
  Thecc1EG014574t1 329 V-VKSNLENLRPEDIKVFDGEVLVVNSILQLHCVVKESRGALN---SVLQMIHELSPKVLVLVEQDSSHNGPFFLGRFMEALHYYSAIFDSLD 417
                       9.799******************************88877777...8********************************************** PP

              GRAS 273 aklpreseerikvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkd 364
                       a lp+ +  r+k+E++++++ei+n+v+ceg  r+erhe++ +Wr+r+++aGF+++pl+  +++qak  l k k  +gy+v e++g+lvlgWk+
  Thecc1EG014574t1 418 AMLPKYDTRRAKMEQFYFAEEIKNIVSCEGPGRVERHERVDQWRRRMSRAGFQAAPLR--MMTQAKQWLGKNKvCEGYTVVEDKGCLVLGWKS 508
                       ********************************************************96..89***********9******************* PP

              GRAS 365 rpLvsvSaWr 374
                       +p+v++S+W+
  Thecc1EG014574t1 509 KPIVAASCWK 518
                       *********8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098558.739121499IPR005202Transcription factor GRAS
PfamPF035141.3E-111147518IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 520 aa     Download sequence    Send to blast
MGHGFLANEF QKTDRIDEVI GLDLELSAMA FCYQPFMPIM GDNACGWSLP FSGEIRDTKR  60
LRRTISIPES IGSSGSLSSG GNSDSSLSRS GSTSSLNSFS RLHFRDHVLT YNQRYLAAEA  120
VEEAAAAMIS SEESGGEEDE TADGMRLVQL LIACAEAVAC RDKSHASALL SELRANALVF  180
GSSFQRVASC FVQGLADRLA LVQPLGTVGL VAPVMNIMDI SSDKKEEALR LVYEICPHIQ  240
FGHFVANSSI LEAFEGESFV HVVDLGMTLG LPHGHQWRHL IQSLANRAGK APSRLRITAV  300
GLSDHRFHII GQELEAYAKD LGMNLEFSVV KSNLENLRPE DIKVFDGEVL VVNSILQLHC  360
VVKESRGALN SVLQMIHELS PKVLVLVEQD SSHNGPFFLG RFMEALHYYS AIFDSLDAML  420
PKYDTRRAKM EQFYFAEEIK NIVSCEGPGR VERHERVDQW RRRMSRAGFQ AAPLRMMTQA  480
KQWLGKNKVC EGYTVVEDKG CLVLGWKSKP IVAASCWKC*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5b3g_A5e-6015451726378Protein SCARECROW
5b3h_A4e-6015451725377Protein SCARECROW
5b3h_D4e-6015451725377Protein SCARECROW
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007037896.20.0PREDICTED: DELLA protein RGL1
TrEMBLA0A061FZS10.0A0A061FZS1_THECC; GRAS family transcription factor
STRINGEOY223970.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16018814
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G66350.14e-77RGA-like 1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]