PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG020677t1
Common NameTCM_020677
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family CPP
Protein Properties Length: 782aa    MW: 85321.5 Da    PI: 5.822
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG020677t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR512.8e-16485523240
               TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                        +k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Thecc1EG020677t1 485 ACKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 523
                       589**********************************96 PP

2TCR50.44.3e-16570608139
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                       ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  Thecc1EG020677t1 570 RHKRGCNCKKSSCLKKYCECYQGGVGCSINCRCEGCKNA 608
                       589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.5E-18484525IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.522485610IPR005172CRC domain
PfamPF036381.1E-11487522IPR005172CRC domain
SMARTSM011142.9E-17570611IPR033467Tesmin/TSO1-like CXC domain
PfamPF036381.4E-11572608IPR005172CRC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0009934Biological Processregulation of meristem structural organization
GO:0048444Biological Processfloral organ morphogenesis
GO:0051302Biological Processregulation of cell division
GO:0005634Cellular Componentnucleus
Sequence ? help Back to Top
Protein Sequence    Length: 782 aa     Download sequence    Send to blast
MDTPEKTQIS SSLSKFEDSP VFNYINSLSP IKPVKSVHLT QTFNPLSFAS LPSIFTSPHL  60
ISHKESRFLK RHSYTDTSKP ELSSGEGTKV STNEEAGVEA GQLCGSSTEL QENFDPGVSL  120
GEASLELPNE ASRFAIELPR TLKYDCGSPN CDPAPCVIET NCVSESNCAS VSIVPFVQEA  180
SEKGLSDGGV EVAGVCQIEQ KRENIGCDWE NLISDTADLL IFNSPNGSEA FRDVIQKSLD  240
PDTRFCATLI SRFPQNDINE VSETTIDSDK HKDPSLQTGE AVELKEITHA HGNFENARLT  300
NCMSGSLTDN VETGMCAPFS FKPGSNLHRG LRRRCLDFEM LAARRKNLVD GSNTSSSVDN  360
QFVPSKPGND SSRRILPGIG LHLNALATTS RDNKNIKHET LSSGTQKLSF PSSTTSILLP  420
TAGQEAVHES LTSVSTERET DPVENGVQLA EDASQASAYL VNEEFNQNSP KKKRRRLEQA  480
GETEACKRCN CKKSKCLKLY CECFAAGVYC IEPCSCQDCF NKPIHEDTVL ATRKQIESRN  540
PLAFAPKVIR SSDSIPEVGD DSTKTPASAR HKRGCNCKKS SCLKKYCECY QGGVGCSINC  600
RCEGCKNAFG RKDGSAIVET EEEPEEEETD PCDKNGVEKN LEKTDILDNE EQNPVSALPT  660
TPLQLCRSLV QLPFSSKSKP PRSFIAIGSS STLYNGQRYG KPNIIRPQNI VEKHFQTVTE  720
DEMPEILRGN CSPGTGIKTS SPNSKRISPP QCELGSTPGR RSGRKLILQS IPSFPSLTPQ  780
H*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A5e-1748760712120Protein lin-54 homolog
5fd3_B5e-1748760712120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1469476PKKKRRRL
2470475KKKRRR
3472476KRRRL
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007034831.20.0PREDICTED: protein tesmin/TSO1-like CXC 2
TrEMBLA0A061EN220.0A0A061EN22_THECC; Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1
STRINGEOY057570.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14672891
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-175TESMIN/TSO1-like CXC 2
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]