PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG004818t1
Common NameTCM_004818
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family GRAS
Protein Properties Length: 566aa    MW: 63796.7 Da    PI: 6.5275
Description GRAS family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG004818t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1GRAS161.38e-502224581241
              GRAS   1 lvelLlecAeavssgdlelaqalLarlselaspdgdpmqRlaayfteALaarlar.svselykalppsetseknsseelaalkl.fsevsPil 91 
                       lv+lLl++Ae v  +++e+a++lL+r + +as++++p+qR++  f+eAL++r+ + ++  +++ l+++ ++e ++   ++++++  ++  P++
  Thecc1EG004818t1 222 LVHLLLAAAEKVGYEQFERASRLLSRCEWIASERANPVQRVVYNFAEALRERIDKgTGRIISNELEAKVKNEIDHGLGTNVTSVsVHQYIPFV 314
                       68*****************************************************67777777788777776665444555555699****** PP

              GRAS  92 kfshltaNqaIleavegeervHiiDfdisqGlQWpaLlqaLasRp.egppslRiTgvgspesgskeeleetgerLakfAeelgvpfefnvlva 183
                       ++ ++   qaI e v++++++H+iD++i  G+QW  L+qaLa+R+      l+iT+v+    + ++++ee g+rL+++A++l++pf+f+v ++
  Thecc1EG004818t1 315 QVMQFSGIQAIIENVASASKIHVIDLQIRSGVQWTGLMQALAEREqCRVELLKITAVEL---VGNQKIEESGKRLESVAASLNLPFSFRVTYV 404
                       *********************************************444569*******9...789**************************** PP

              GRAS 184 krledleleeLrvkpgEalaVnlvlqlhrlldesvsleserdevLklvkslsPkvvvv 241
                       ++++d++ e +++ ++E+laV + l l +l+ ++ +le+    +++++k+l+P ++v+
  Thecc1EG004818t1 405 DDMKDIKEELFKIGSDESLAVYCPLVLRTLVWRPSCLEN----LMRVIKNLNPVIMVL 458
                       *******************************99999999....*************98 PP

2GRAS42.88.2e-14461551284374
              GRAS 284 kvErellgreivnvvacegaerrerhetlekWrerleeaGFkpvplsekaakqaklllrkvk.sdgyrveeesgslvlgWkdrpLvsvSaWr 374
                       k E + + + i+++va+eg+er++r  ++e W++ + ++ + ++ +s++   qa+l++++++     ++    +slv gWk+ p+ svSaW+
  Thecc1EG004818t1 461 KTESV-FYNGIRSIVAMEGDERIARSVKMEVWSAFFARFRMVELGFSDSSLYQANLVIKRFPcASSCTLGRIGKSLVVGWKGTPVHSVSAWK 551
                       55655.9999****************************************************9999*************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5098525.918196565IPR005202Transcription factor GRAS
PfamPF035142.8E-47222458IPR005202Transcription factor GRAS
PfamPF035142.8E-11461551IPR005202Transcription factor GRAS
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
Sequence ? help Back to Top
Protein Sequence    Length: 566 aa     Download sequence    Send to blast
MTSFHNSKMA SGLFSFSTPF DFSGIQGSYN LEDYEKEAAV IKGKEDHLFG VQEIGEDRDY  60
DPLFPEYGLY DQENVAKKPV DQGEQQQQLG QATKLDYLDE FDFSSAFSTT LAFQEFPRQE  120
NIQELVKPIK EIPSSLYLSS LELLSSYGNS FKNLKMGKSS GTGNETNARG HKKLSTEEIM  180
RVAGARYVQF SDQRYDDFSM VMHPFGHALS GLSADETKDV ELVHLLLAAA EKVGYEQFER  240
ASRLLSRCEW IASERANPVQ RVVYNFAEAL RERIDKGTGR IISNELEAKV KNEIDHGLGT  300
NVTSVSVHQY IPFVQVMQFS GIQAIIENVA SASKIHVIDL QIRSGVQWTG LMQALAEREQ  360
CRVELLKITA VELVGNQKIE ESGKRLESVA ASLNLPFSFR VTYVDDMKDI KEELFKIGSD  420
ESLAVYCPLV LRTLVWRPSC LENLMRVIKN LNPVIMVLRT KTESVFYNGI RSIVAMEGDE  480
RIARSVKMEV WSAFFARFRM VELGFSDSSL YQANLVIKRF PCASSCTLGR IGKSLVVGWK  540
GTPVHSVSAW KFSRDRGRAF AKYRF*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5hyz_A1e-1925255133375GRAS family transcription factor containing protein, expressed
Search in ModeBase
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017977950.10.0PREDICTED: DELLA protein RGL1
TrEMBLA0A061DR510.0A0A061DR51_THECC; GRAS family transcription factor, putative
STRINGEOX952690.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16533913
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G66350.17e-30RGA-like 1
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]