PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG018621t1
Common NameTCM_018621
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family NAC
Protein Properties Length: 343aa    MW: 38745.9 Da    PI: 7.4019
Description NAC family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG018621t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1NAM181.42.3e-56161421129
               NAM   1 lppGfrFhPtdeelvveyLkkkvegkkleleevikevdiykvePwdLpkkvkaeekewyfFskrdkkyatgkrknratksgyWkatgkdkevl 93 
                       lppGfrFhPtdeel+++yL +kv ++ + +  +i evd++k+ePwdLp k+k +ekewyfF+ rd+ky+tg r+nrat++gyWkatgkdke++
  Thecc1EG018621t1  16 LPPGFRFHPTDEELITHYLSQKVLNSCFCA-IAIGEVDLNKCEPWDLPWKAKMGEKEWYFFCVRDRKYPTGLRTNRATEAGYWKATGKDKEIF 107
                       79***********************99888.78***************888999*************************************** PP

               NAM  94 skkgelvglkktLvfykgrapkgektdWvmheyrle 129
                       +  ++lvg+kktLvfy+grapkgekt+Wvmheyrle
  Thecc1EG018621t1 108 K-AKTLVGMKKTLVFYRGRAPKGEKTNWVMHEYRLE 142
                       *.999*****************************85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019416.28E-6312166IPR003441NAC domain
PROSITE profilePS5100558.84616166IPR003441NAC domain
PfamPF023652.2E-3017141IPR003441NAC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 343 aa     Download sequence    Send to blast
MENISSFRKE DEQMELPPGF RFHPTDEELI THYLSQKVLN SCFCAIAIGE VDLNKCEPWD  60
LPWKAKMGEK EWYFFCVRDR KYPTGLRTNR ATEAGYWKAT GKDKEIFKAK TLVGMKKTLV  120
FYRGRAPKGE KTNWVMHEYR LEGKYSIYNL PKTAKNEWVI CRVFQKSPGG KKTHISGFSR  180
LSSYGNDLPP SVLPPLMDSS PHNSETRTGA GETSHVTCFS DPMEDQKTPE EMIDSFNTSL  240
LASSSSSDIS PTSILLSKTF LPSSAYTNQI IPNIGNLQYS DSFWMQDQSI LKMLLESPRV  300
NSRQNSKAEF SQDSVVSNPE MIQDPSCSAG PADLGCLWSY KI*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1ut4_A1e-531317214171NO APICAL MERISTEM PROTEIN
1ut4_B1e-531317214171NO APICAL MERISTEM PROTEIN
1ut7_A1e-531317214171NO APICAL MERISTEM PROTEIN
1ut7_B1e-531317214171NO APICAL MERISTEM PROTEIN
3swm_A1e-531317217174NAC domain-containing protein 19
3swm_B1e-531317217174NAC domain-containing protein 19
3swm_C1e-531317217174NAC domain-containing protein 19
3swm_D1e-531317217174NAC domain-containing protein 19
3swp_A1e-531317217174NAC domain-containing protein 19
3swp_B1e-531317217174NAC domain-containing protein 19
3swp_C1e-531317217174NAC domain-containing protein 19
3swp_D1e-531317217174NAC domain-containing protein 19
4dul_A1e-531317214171NAC domain-containing protein 19
4dul_B1e-531317214171NAC domain-containing protein 19
Search in ModeBase
Functional Description ? help Back to Top
Source Description
UniProtBinds to the promoter regions of genes involved in chlorophyll catabolic processes, such as NYC1, SGR1, SGR2 and PAO. {ECO:0000269|PubMed:27021284}.
Regulation -- Description ? help Back to Top
Source Description
UniProtINDUCTION: Repressed by the microRNA miR164. {ECO:0000269|PubMed:15294871, ECO:0000269|PubMed:17098808}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007032595.20.0PREDICTED: NAC domain-containing protein 92
SwissprotQ9FLJ21e-130NC100_ARATH; NAC domain-containing protein 100
TrEMBLA0A061EF120.0A0A061EF12_THECC; NAC domain containing protein 80
STRINGEOY035210.0(Theobroma cacao)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM40028174
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G61430.11e-129NAC domain containing protein 100
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]