PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG025211t1
Common NameTCM_025211
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family NAC
Protein Properties Length: 348aa    MW: 40309.6 Da    PI: 4.3083
Description NAC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG025211t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NAM70.44.7e-2241603128
               NAM   3 pGfrFhPtdeelvveyLkkkvegkklelee....vikevdiykvePwdLpkkvka.......eekewyfFskrdk...kyatgkrknra.... 77 
                        G+ F+P+d ++v++yL   ++g++++       vi   diy+++P+ +   v++       ++++ + F++r++   k+a+gkr++r+    
  Thecc1EG025211t1   4 LGVVFDPSDRDIVSHYLPMLISGESMSSLGdlqyVIGFEDIYSTKPSVFFD-VNNgnglpflKSNQRFIFTHRQRiskKNANGKRPRRIlesh 95 
                       699**********************55533344379999********9952.22223455556678899999876222566899999988999 PP

               NAM  78 .........tksgyWkatgkdkevlskkgelvglkktLvfy.....kgrapkgektdWvmheyrl 128
                                 + gyW+++  +k++l+++++++g  +tL f+     k+ +++++kt+W+mheyrl
  Thecc1EG025211t1  96 hydetlgvgDSGGYWRSSTAEKPILDEQQKEIGFVRTLNFFefkdeKKCRKDATKTRWLMHEYRL 160
                       999998887679*****************************7766545566778*********98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5100527.9872176IPR003441NAC domain
SuperFamilySSF1019413.92E-254174IPR003441NAC domain
PfamPF023653.9E-135160IPR003441NAC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 348 aa     Download sequence    Send to blast
MYSLGVVFDP SDRDIVSHYL PMLISGESMS SLGDLQYVIG FEDIYSTKPS VFFDVNNGNG  60
LPFLKSNQRF IFTHRQRISK KNANGKRPRR ILESHHYDET LGVGDSGGYW RSSTAEKPIL  120
DEQQKEIGFV RTLNFFEFKD EKKCRKDATK TRWLMHEYRL PGDTFQEWVI CKIKDTSRSP  180
HDDYSDSIWE KELFGKLLLP HSDENYDHQD EYQSQIQSST VFNDGNLPSF EVDQLLDDDP  240
FKEVDQLLEI NDNNQIQTQS STVFNNGNLP RYEVDQLLYA HEKEVSKDDD PFKEVDQLLE  300
INDDNQIADY PFKEMEQLLG MNDNDPIADV DEALATMNSY YLQDLLG*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007029305.20.0PREDICTED: NAC domain-containing protein 67
TrEMBLA0A061EXM70.0A0A061EXM7_THECC; Uncharacterized protein
STRINGEOY098350.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM16333214
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G27410.32e-12NAC family protein
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]