PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG038379t5
Common NameTCM_038379
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family CPP
Protein Properties Length: 668aa    MW: 72495.2 Da    PI: 5.5711
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG038379t5genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.61.6e-15530568341
               TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                       +k+CnCkk+kClk+YC+Cfaag +C++ C+C++C+N+ e
  Thecc1EG038379t5 530 CKRCNCKKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPE 568
                       89**********************************876 PP

2TCR49.21.1e-15616654139
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                       ++k+gCnCk+s ClkkYCeC++a++ Cs  C+Ce+CkN 
  Thecc1EG038379t5 616 RHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 654
                       589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.6E-16528569IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163434.793529656IPR005172CRC domain
PfamPF036381.6E-11531566IPR005172CRC domain
SMARTSM011141.9E-17616657IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.1E-11618653IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 668 aa     Download sequence    Send to blast
MDSPEPSKAP ISSSSAAASI SASSPVQESP FSNYISSLSP IKHDKVPHVA QGFLGLNSPP  60
LVFTSPHINT LRRPQSSSVE VSQNGEGDKK NIDGPGSLER SVSELQQGLI TDIKKEDDTK  120
DSVSVQPSSS SGCVDEYLAD PVEADCANSE YFINLNCKES KNAFQSSVNG LLETKNLKFA  180
GKNDVGREID AAQLLSGQSE EGLERKLTSH VKPVKIEDEQ HAGQVKSDEC PEFGSDMFDL  240
SSQGKECKNL DAQKVVEDHE DRCDGFLQLL PGSLQRVQEY EDFAENFEGV AEVTVDSMTN  300
DLEASEHQRG MSRRCLQFGD AQPEATANCS SSSLANDMIT SRSVATTSET EGLGLSHVDL  360
SVISRKRQLV NLSQLAINMI PQHYGEKSSL TVSKPSGIGL HLNSIVNAIP MGRGGTASMK  420
LAVDSMGIQG IKSASVMSCQ SMENMQSCSD AFEKVLAAPQ DGTLEAKACV IPGSAASESL  480
CTMESIDCQT TLHRKRELSS EHGDSNEMFN QQSPKKKRKK SSNSTDGEGC KRCNCKKTKC  540
LKLYCDCFAA GIYCADPCSC QGCFNRPEYE DTVLETRQQI ESRNPLAFAP KIVQPVTEFP  600
VTSREDGNWK TPSSARHKRG CNCKRSMCLK KYCECYQANV GCSIGCRCEG CKNVFGKKEG  660
EFCKKYH*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A4e-1853165412121Protein lin-54 homolog
5fd3_B4e-1853165412121Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1515519KKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007013826.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_007013827.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_017983279.10.0PREDICTED: CRC domain-containing protein TSO1
TrEMBLA0A061GWE40.0A0A061GWE4_THECC; Tesmin/TSO1-like CXC domain-containing protein, putative isoform 5
STRINGEOY314450.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.13e-62TESMIN/TSO1-like CXC 2
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]