PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG006109t1
Common NameTCM_006109
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family WRKY
Protein Properties Length: 1450aa    MW: 164460 Da    PI: 6.9825
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG006109t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY75.37.5e-24296356259
                       --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
              WRKY   2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59 
                       +Dg+nWrKYGqK++ +++fpr YYrC+++   gC ++k+v+r++edp +++ tY+g H+++
  Thecc1EG006109t1 296 SDGCNWRKYGQKDILNARFPREYYRCAHRhtqGCFATKEVQREDEDPMFITATYKGMHTCT 356
                       6**************************998899**************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.20.25.803.7E-24287356IPR003657WRKY domain
PROSITE profilePS5081119.921290358IPR003657WRKY domain
SuperFamilySSF1182903.01E-22291357IPR003657WRKY domain
SMARTSM007741.1E-33295357IPR003657WRKY domain
PfamPF031061.7E-22297355IPR003657WRKY domain
SuperFamilySSF525407.96E-7650862IPR027417P-loop containing nucleoside triphosphate hydrolase
PfamPF009316.4E-11767892IPR002182NB-ARC
SuperFamilySSF520584.25E-259571112IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.103.7E-139571108IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520584.25E-2512441395IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.101.2E-612531391IPR032675Leucine-rich repeat domain, L domain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0006952Biological Processdefense response
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043531Molecular FunctionADP binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1450 aa     Download sequence    Send to blast
MVVHLILSFF FFFFSNRDME SWERKNVRNE LRQGRKLAKQ LQANLKRSSS EENPELVQKI  60
VSSFEKALSM LNCSTSSMAA KLQPIAHSSL ARDGSHQSKD PDHDIKKQEF KVKDFFMKRE  120
VTESQPKGLA TEMFKFPPPF SGSFSSAGNL EAFDGKAAEL KPIIIGTEFD HDFMEEHELK  180
VNETFNKSCS VKDESKPAGH AKVMFKSLSD YNLEKKKHTV GDASKKSAAG ESQPTEVEFK  240
KPKSSHSLGE SLPSKDSDYN FKEQELKVNE GSGKRNARRS WTVLVHSDMV QGELPSDGCN  300
WRKYGQKDIL NARFPREYYR CAHRHTQGCF ATKEVQREDE DPMFITATYK GMHTCTLAPD  360
LMPPGPPEIL APLDTVLGAD GNDKKDSQSN LQSSVHSPDN QTCISSTKLT SELPNLGLNL  420
NVFPEKSFKS YPTWKNFYEN EVRKNWKVLN RKKDVLLLLS SYPMIMIDKS DTDKWIIDVL  480
ATMRHVKSTE KILFGVGVAK HWPGMTTLQE LSGRLQKLLD VPLMNDIEGV LPVDLVENLY  540
RTTEADLRPL LEVEQNIISG KTSKSRGSAS NSEGAAMEAE KELQPMPAKC KTLVEDTELP  600
AKGTLNVPEE IFDLAIYLAV RQILKCINRG YIWCITISGR DKKRVLGAVK QHQDIVSEFG  660
YIIVFTVSED QSGANVHGVF HLQKGFWLGG CFDSVDLTHE YFDNLCSPGI LLLTEDDYDK  720
NMNLDQSTLP LLINLNKLVD HKHSDSRFII FTSKMATDME IRMEDHLLSW KLFCRIVGEG  780
LLSPSIQQIA ASLVKECRGN LLAIILTARS LEKVTDDVNL WELAVKRLTM LPPSQIEDTD  840
NVLINALTFI WERMNNKTRH CIKFFTWYPK GQKINRVSLI QHWIQDRLVD THDEGTNIIQ  900
NLVDTSLLNI VELNRVQLRR EIYDVLVNPL ILQMHPFYLL LGRARLIKPP EEEEWDAKVI  960
NLMDNKLSDL PESPSIKSLP ESLSSLVNLR ELLLKGCELF IRLPSHVGEL KNLEKLDLDE  1020
TQIIDLPAEI GQLSKLKILR VSFYGYMNCS KTRLRQDTII PPGTISGLSE LTELSIDVDP  1080
DDERWNATVK DIIEEACNLK TLRQLNLYLP NIEILWKRRT GSASLLHYPL PRFRFTVGYH  1140
KRQVISRVPE EVEAHFNKSN KCLKFVKGKD IPAEMRKVLN HSTAFFLEGH ATARSLSDFG  1200
IENTRLLKCC LLTECNGVKT IIDLSQGGGH SQVYTRGKGK SESLKFPEEQ TDALGNLQDL  1260
NIYYMKNLES IWKGPVHKHC LASLKFLALH KCPRLSTIFS LDLVANLDNL EELIVEHCPQ  1320
LTSLVSPTGH VSSNSTPQPN CFFPSLKRIS LLYVPNLVSI SSGLWIAPEL EKVGFYNCPK  1380
LKSLSAMEMS SDHLTRIKGE SHWWEALEWK NSEWGNPLDY LQSIFSPLIK ERDVKAQLAE  1440
EGIMHHAST*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6ir8_A2e-162973551068OsWRKY45
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017970777.10.0PREDICTED: uncharacterized protein LOC18607098 isoform X7
RefseqXP_017970778.10.0PREDICTED: uncharacterized protein LOC18607098 isoform X7
TrEMBLA0A061DY230.0A0A061DY23_THECC; Uncharacterized protein
STRINGEOX969860.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G24110.11e-24WRKY DNA-binding protein 30
Publications ? help Back to Top
  1. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]