PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG033629t1
Common NameTCM_033629
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family NAC
Protein Properties Length: 334aa    MW: 38023.4 Da    PI: 4.6895
Description NAC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG033629t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NAM181.81.7e-5661331128
               NAM   1 lppGfrFhPtdeelvveyLkkkvegkkleleevikevdiykvePwdLp.k.kvkaeekewyfFskrdkkyatgkrknratksgyWkatgkdke 91 
                       lppGfrFhPtdeelv +yL +k++g+++el e+i+evd+yk+ePwdLp k  + +++ ewyf+s+rdkky++g+r+nrat++gyWkatgkd++
  Thecc1EG033629t1   6 LPPGFRFHPTDEELVAYYLDRKISGRTIEL-EIIPEVDLYKCEPWDLPdKsFLPSKDMEWYFYSPRDKKYPNGSRTNRATRAGYWKATGKDRA 97 
                       79****************************.99**************96435566888*********************************** PP

               NAM  92 vlskkgelvglkktLvfykgrapkgektdWvmheyrl 128
                       v+s ++++vg+kktLv+y+grap+g +t+Wvmheyrl
  Thecc1EG033629t1  98 VHS-QKRAVGMKKTLVYYRGRAPHGIRTNWVMHEYRL 133
                       ***.9999***************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019414.71E-643158IPR003441NAC domain
PROSITE profilePS5100561.0256158IPR003441NAC domain
PfamPF023654.6E-297133IPR003441NAC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 334 aa     Download sequence    Send to blast
MAPMSLPPGF RFHPTDEELV AYYLDRKISG RTIELEIIPE VDLYKCEPWD LPDKSFLPSK  60
DMEWYFYSPR DKKYPNGSRT NRATRAGYWK ATGKDRAVHS QKRAVGMKKT LVYYRGRAPH  120
GIRTNWVMHE YRLVESSSAT APASSLKDSY ALCRIFKKTI QIPNKTKEAV ENNINAEKEV  180
GWVSDEQLFG DDASGTEISR RRDAEDENFN TSSSDVTQGT PNETGMADDY HQAPFTSDEA  240
NSSANMCSLP ADFSSNLFQE MQMPGYTSLH YQVPYPPLEL EDFPQIDISE TKPEIIDEYM  300
IYDKYRGCMN GSLEEIFSLC SSQDNSMPLS MQD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ut4_A1e-59115812165NO APICAL MERISTEM PROTEIN
1ut4_B1e-59115812165NO APICAL MERISTEM PROTEIN
1ut7_A1e-59115812165NO APICAL MERISTEM PROTEIN
1ut7_B1e-59115812165NO APICAL MERISTEM PROTEIN
3swm_A1e-59115815168NAC domain-containing protein 19
3swm_B1e-59115815168NAC domain-containing protein 19
3swm_C1e-59115815168NAC domain-containing protein 19
3swm_D1e-59115815168NAC domain-containing protein 19
3swp_A1e-59115815168NAC domain-containing protein 19
3swp_B1e-59115815168NAC domain-containing protein 19
3swp_C1e-59115815168NAC domain-containing protein 19
3swp_D1e-59115815168NAC domain-containing protein 19
4dul_A1e-59115812165NAC domain-containing protein 19
4dul_B1e-59115812165NAC domain-containing protein 19
Search in ModeBase
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00205DAPTransfer from AT1G54330Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017980051.10.0PREDICTED: NAC domain-containing protein 45
TrEMBLA0A061FB920.0A0A061FB92_THECC; NAC domain containing protein 20
STRINGEOY142960.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM81232640
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G54330.11e-110NAC domain containing protein 20
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]