PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG038379t2
Common NameTCM_038379
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family CPP
Protein Properties Length: 705aa    MW: 76365.8 Da    PI: 6.807
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG038379t2genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR48.61.7e-15294332341
               TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkee 41 
                       +k+CnCkk+kClk+YC+Cfaag +C++ C+C++C+N+ e
  Thecc1EG038379t2 294 CKRCNCKKTKCLKLYCDCFAAGIYCADPCSCQGCFNRPE 332
                       89**********************************876 PP

2TCR49.11.1e-15380418139
               TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                       ++k+gCnCk+s ClkkYCeC++a++ Cs  C+Ce+CkN 
  Thecc1EG038379t2 380 RHKRGCNCKRSMCLKKYCECYQANVGCSIGCRCEGCKNV 418
                       589***********************************6 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.6E-16292333IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163434.793293420IPR005172CRC domain
PfamPF036381.7E-11295330IPR005172CRC domain
SMARTSM011141.9E-17380421IPR033467Tesmin/TSO1-like CXC domain
PfamPF036382.2E-11382417IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 705 aa     Download sequence    Send to blast
MFDLSSQGKE CKNLDAQKVV EDHEDRCDGF LQLLPGSLQR VQEYEDFAEN FEGVAEVTVD  60
SMTNDLEASE HQRGMSRRCL QFGDAQPEAT ANCSSSSLAN DMITSRSVAT TSETEGLGLS  120
HVDLSVISRK RQLVNLSQLA INMIPQHYGE KSSLTVSKPS GIGLHLNSIV NAIPMGRGGT  180
ASMKLAVDSM GIQGIKSASV MSCQSMENMQ SCSDAFEKVL AAPQDGTLEA KACVIPGSAA  240
SESLCTMESI DCQTTLHRKR ELSSEHGDSN EMFNQQSPKK KRKKSSNSTD GEGCKRCNCK  300
KTKCLKLYCD CFAAGIYCAD PCSCQGCFNR PEYEDTVLET RQQIESRNPL AFAPKIVQPV  360
TEFPVTSRED GNWKTPSSAR HKRGCNCKRS MCLKKYCECY QANVGCSIGC RCEGCKNVFG  420
KKEDYCVTEE IVNRGGGEIS ESTVAAKKDF LNSDLCDPHY LTPLTPSFQC SDHGKNAPKS  480
RLLSRRCLPS PESDLTVLAK SPRSPRTSDS NDMLLETSKE NLDVGSYCEG INYNNADVLG  540
DGCHHTPLPN HPSIILGSTS SKARELTSLS RFPLGPRSGC LSSGGSLRWR SSPITPMSSL  600
DGTKNLQGLD SDGLSDILED DTPEILKDTS TPNKSVKTSS PNGKRVSPPH NLLQLGSSSS  660
GPLRSGRKFI LKAVPSFPPL TPCIDLKGSS NQNRSSCQEN SSND*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A4e-1829541812121Protein lin-54 homolog
5fd3_B4e-1829541812121Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1279283KKRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007013826.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_007013827.20.0PREDICTED: CRC domain-containing protein TSO1
RefseqXP_017983279.10.0PREDICTED: CRC domain-containing protein TSO1
TrEMBLA0A061GNJ00.0A0A061GNJ0_THECC; Tesmin/TSO1-like CXC domain-containing protein, putative isoform 1
STRINGEOY314450.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.14e-61TESMIN/TSO1-like CXC 2
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]