PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG021834t1
Common NameTCM_021834
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family TALE
Protein Properties Length: 650aa    MW: 70389.6 Da    PI: 7.0788
Description TALE family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG021834t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox28.82e-094474841954
                       HHH..SSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHH CS
          Homeobox  19 Fek..nrypsaeereeLAkklgLterqVkvWFqNrRak 54 
                       Fe   ++yp+ +++  LAk++gL+ +qV++WF N R +
  Thecc1EG021834t1 447 FEHflHPYPTDTDKLMLAKQTGLSRSQVSNWFINARVR 484
                       554459******************************88 PP

2BELL111.26e-36316384472
              BELL   4 elqkkkakLlslleeVdkrYkqyveqlqtvissFeavaglgsakpYtslAlkaiSrhFrcLkdaiaeqi 72 
                       e ++kk++L+s+l+eV+krYkqy++q+q+v++sFe vaglg+a+pY++lAlka+S+hFrcLk ai++q+
  Thecc1EG021834t1 316 ESRRKKSRLISMLDEVYKRYKQYYQQMQAVVASFEYVAGLGNAAPYANLALKAMSKHFRCLKSAITNQL 384
                       7899**************************************************************997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM005741.6E-51251382IPR006563POX domain
PfamPF075268.5E-40256381IPR006563POX domain
PROSITE profilePS5007112.098425488IPR001356Homeobox domain
SMARTSM003893.4E-10428492IPR001356Homeobox domain
CDDcd000864.41E-13428489No hitNo description
SuperFamilySSF466894.71E-17430495IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.2E-27431492IPR009057Homeodomain-like
PfamPF059201.2E-18445484IPR008422Homeobox KN domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010051Biological Processxylem and phloem pattern formation
GO:0010076Biological Processmaintenance of floral meristem identity
GO:0010077Biological Processmaintenance of inflorescence meristem identity
GO:0010089Biological Processxylem development
GO:0010154Biological Processfruit development
GO:0010223Biological Processsecondary shoot formation
GO:0010228Biological Processvegetative to reproductive phase transition of meristem
GO:0045892Biological Processnegative regulation of transcription, DNA-templated
GO:0048457Biological Processfloral whorl morphogenesis
GO:0048645Biological Processorgan formation
GO:0080006Biological Processinternode patterning
GO:0005634Cellular Componentnucleus
GO:0005829Cellular Componentcytosol
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 650 aa     Download sequence    Send to blast
MAEGFEPYHV PQQSRRDKLR IVAQNHPACV EATAATLSGC SALLPLYDPS LLSSDLLTCA  60
ANASATAGSH DFHHQTNHLS ASASGKNSPV CVVKEEGVNF MGFVGGIVNA SSSSSTSHHP  120
YLDPQSSLPL NPSSIQDMNN NPFLYTPQNL QSLRDFDHSY NNGGEVVVYK PEALSLNHES  180
STTGQGLSLS LSSHNAHQNN LPLELNLQRF ESAIYSDKVT ASGYVVPGIV GASASTSNEV  240
SRSSVPLGPF TGYASILKES RFMRPAQQLL EELCDVGRGS YAEKITPDSS LMEPPLENLS  300
PAGIVDDPLG GGDGGESRRK KSRLISMLDE VYKRYKQYYQ QMQAVVASFE YVAGLGNAAP  360
YANLALKAMS KHFRCLKSAI TNQLQFTNKA HGQISPGKEE GPRFGSTDRS LYSRPVHNAG  420
FLEHQPVWRP QRGLPERAVT VLRAWLFEHF LHPYPTDTDK LMLAKQTGLS RSQVSNWFIN  480
ARVRLWKPMV EEIHMLETRQ ALKASQKEDR NANKSSDHLS SANSLASENP STSTQRVQDT  540
PSKRTRSELP DLPAGSEPLN LSYNSLSSHP HVGVGVSMAG GSGGVSLTLG LHQNNGIGLS  600
EPFPINAAQR FGLGLEVSSE GYVIGGFEAQ NRHFGRDVIG GQLLHDFVG*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4xrm_A2e-16434492563Homeobox protein Meis2
4xrm_B2e-16434492563Homeobox protein Meis2
4xrs_A1e-16434491158Homeobox protein Meis1
4xrs_B1e-16434491158Homeobox protein Meis1
5bng_A1e-16434492159Homeobox protein Meis2
5bng_B1e-16434492159Homeobox protein Meis2
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJN1277620.0JN127762.1 Theobroma cacao clone TCC_BA003D04, complete sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007026881.10.0POX family protein, putative isoform 1
SwissprotQ9LZM81e-127BLH9_ARATH; BEL1-like homeodomain protein 9
TrEMBLA0A061ERT80.0A0A061ERT8_THECC; POX family protein, putative isoform 1
STRINGVIT_08s0007g01290.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM56882746
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G02030.11e-106TALE family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]