PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG038379t6
Common NameTCM_038379
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family CPP
Protein Properties Length: 669aa    MW: 72836.6 Da    PI: 5.0659
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG038379t6genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.61.6e-15530568341
               TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                       +k+CnCkk+kClk+YC+Cfaag +C++ C+C++C+N+ e
  Thecc1EG038379t6 530 CKRCNCKKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPE 568
                       89**********************************876 PP

2TCR29.61.4e-09616637122
               TCR   1 kekkgCnCkkskClkkYCeCfa 22 
                       ++k+gCnCk+s ClkkYCeC++
  Thecc1EG038379t6 616 RHKRGCNCKRSMCLKKYCECYQ 637
                       589******************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.6E-16528569IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163426.111529665IPR005172CRC domain
PfamPF036381.6E-11531566IPR005172CRC domain
SMARTSM011146.4E-4616646IPR033467Tesmin/TSO1-like CXC domain
PfamPF036388.4E-6618637IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 669 aa     Download sequence    Send to blast
MDSPEPSKAP ISSSSAAASI SASSPVQESP FSNYISSLSP IKHDKVPHVA QGFLGLNSPP  60
LVFTSPHINT LRRPQSSSVE VSQNGEGDKK NIDGPGSLER SVSELQQGLI TDIKKEDDTK  120
DSVSVQPSSS SGCVDEYLAD PVEADCANSE YFINLNCKES KNAFQSSVNG LLETKNLKFA  180
GKNDVGREID AAQLLSGQSE EGLERKLTSH VKPVKIEDEQ HAGQVKSDEC PEFGSDMFDL  240
SSQGKECKNL DAQKVVEDHE DRCDGFLQLL PGSLQRVQEY EDFAENFEGV AEVTVDSMTN  300
DLEASEHQRG MSRRCLQFGD AQPEATANCS SSSLANDMIT SRSVATTSET EGLGLSHVDL  360
SVISRKRQLV NLSQLAINMI PQHYGEKSSL TVSKPSGIGL HLNSIVNAIP MGRGGTASMK  420
LAVDSMGIQG IKSASVMSCQ SMENMQSCSD AFEKVLAAPQ DGTLEAKACV IPGSAASESL  480
CTMESIDCQT TLHRKRELSS EHGDSNEMFN QQSPKKKRKK SSNSTDGEGC KRCNCKKTKC  540
LKLYCDCFAA GIYCADPCSC QGCFNRPEYE DTVLETRQQI ESRNPLAFAP KIVQPVTEFP  600
VTSREDGNWK TPSSARHKRG CNCKRSMCLK KYCECYQVWY LLLPESSVTL TMSSLLVYCC  660
WSYESSLY*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A2e-1253164112108Protein lin-54 homolog
5fd3_B2e-1253164112108Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1515519KKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007013826.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_007013827.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_017983279.10.0PREDICTED: CRC domain-containing protein TSO1
TrEMBLA0A061GPV50.0A0A061GPV5_THECC; Tesmin/TSO1-like CXC domain-containing protein, putative isoform 6
STRINGEOY314450.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22780.13e-48Tesmin/TSO1-like CXC domain-containing protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]