PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Thecc1EG021580t1
Common NameTCM_021580
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma
Family WRKY
Protein Properties Length: 1239aa    MW: 135151 Da    PI: 7.7205
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Thecc1EG021580t1genomeCGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY103.98.6e-33908966160
                       ---SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS-- CS
              WRKY   1 ldDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhek 60 
                       ++DgynWrKYGqK+vkgse+prsYY+Ct+++Cp+kkkvers  d++++ei+Y+g+Hnh+k
  Thecc1EG021580t1 908 VQDGYNWRKYGQKQVKGSENPRSYYKCTYPNCPTKKKVERSL-DGQITEIVYKGSHNHPK 966
                       58****************************************.***************85 PP

2WRKY103.11.5e-3210711129159
                        ---SS-EEEEEEE--TT-SS-EEEEEE-STT---EEEEEE-SSSTTEEEEEEES--SS- CS
              WRKY    1 ldDgynWrKYGqKevkgsefprsYYrCtsagCpvkkkversaedpkvveitYegeHnhe 59  
                        ldDgy+WrKYGqK+vkg+++prsYY+Ct+ gCpv+k+ver+++d ++v++tYeg+Hnh+
  Thecc1EG021580t1 1071 LDDGYRWRKYGQKVVKGNPNPRSYYKCTTIGCPVRKHVERASHDLRAVITTYEGKHNHD 1129
                        59********************************************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:3.80.10.101.2E-4014298IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520584.08E-665170IPR032675Leucine-rich repeat domain, L domain-like
SMARTSM00367107398IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM00367210100125IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM0036723126151IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM0036729152176IPR006553Leucine-rich repeat, cysteine-containing subtype
SuperFamilySSF520471.13E-41177527No hitNo description
SMARTSM0036739177202IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM00367200203227IPR006553Leucine-rich repeat, cysteine-containing subtype
PfamPF135160.012253275IPR001611Leucine-rich repeat
SMARTSM003673.0E-5253278IPR006553Leucine-rich repeat, cysteine-containing subtype
Gene3DG3DSA:3.80.10.102.3E-45299516IPR032675Leucine-rich repeat domain, L domain-like
PfamPF135160.22329350IPR001611Leucine-rich repeat
SMARTSM003670.0016329354IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM003670.0078355380IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM00367280381406IPR006553Leucine-rich repeat, cysteine-containing subtype
PfamPF135164.3E-5407429IPR001611Leucine-rich repeat
SMARTSM003675.9407431IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM0036717432456IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM003674.6E-4457482IPR006553Leucine-rich repeat, cysteine-containing subtype
SMARTSM003670.0044483508IPR006553Leucine-rich repeat, cysteine-containing subtype
Gene3DG3DSA:3.80.10.102.3E-13517659IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520471.57E-6521617No hitNo description
SMARTSM003670.0056534559IPR006553Leucine-rich repeat, cysteine-containing subtype
PfamPF135160.65560582IPR001611Leucine-rich repeat
SMARTSM003674.9560584IPR006553Leucine-rich repeat, cysteine-containing subtype
Gene3DG3DSA:2.20.25.802.2E-27897967IPR003657WRKY domain
SuperFamilySSF1182901.44E-24903967IPR003657WRKY domain
SMARTSM007742.4E-35908966IPR003657WRKY domain
PfamPF031067.2E-25909965IPR003657WRKY domain
PROSITE profilePS5081123.342910967IPR003657WRKY domain
Gene3DG3DSA:2.20.25.802.9E-3610561131IPR003657WRKY domain
SuperFamilySSF1182901.22E-2810631131IPR003657WRKY domain
PROSITE profilePS5081138.36610661131IPR003657WRKY domain
SMARTSM007744.4E-3810711130IPR003657WRKY domain
PfamPF031067.4E-2510721129IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1239 aa     Download sequence    Send to blast
MKKQKKECFG GFNPFDLLSE EIIFMILDLL HRNPLDKKSF SLVCKSFYAT ESNHRRTLKP  60
LRQEHLPAIL CRYSNITHLD LTLCSRVTDA SLSIISNACT STLRSVDFSR SRLFSTSGLL  120
GLALNCKNLV EIDLSNGTDL KDSAMAAVAE AKNLEKLWLA RCKSITDLGV GCVAVGCRKL  180
RFVCLKWCLG VGDLGVGLIA VKCKQILYLD LSYLPITNKC LSSVLKLQHL EDLVMEGCFG  240
IDDDSLAVLK HGCKSLKSLD VSTCQNITDS GLSSLISGAE GLQQLTLAHG SPVTSSLADC  300
LKKLSLLQSV KLDGCLITYD GLKTIGNWCL SLRELSLSKC LGVTDEGLSS VVTKHKDLRK  360
LDITCCRKIT DVSVAHITNS CNFLSSLRME SCTLVSRKAF GLIGQQCHLL EELDLTDNEI  420
DDEGLKSISR CSKLSNLKLG ICLNITDEGL IHIGRGCSKL IELDLYRCAE ITDLGILAIA  480
QGCPGLEMIN IAYCKDITDR SLLSLSKCSC LKTFESRGCS RITSLGLTAI AVGCKELSKL  540
DIKKCHNIDD AGMLPLAHFS QNLRQINLSH SSVTDVGLLS LASISCLQNI TILHLKGLTP  600
SGLAAALLAC AGLRKVKLQA AFRWLLPHRL FEHLEARGCG IRSQVLEITV GRSNAIEILE  660
DLVGTNTSNT SSINSQTAFP FSNSNPMAAS SSFMNMDNIN QNRSTSITSW GLSGHQVTDK  720
FGIEIPKFKS LAPSSLPISP APVSPSSYLV TPPAAFSPTD FLDSPVLFST SSIFPSPTTG  780
AFAGQTLNWR SNSNDNQQGI KGEHNNFFDF SFQPQPGPSS TSSSTFQSSS NIVSVEQSTW  840
NFSEPMKQPE LPVEKAARVK SEFAPMQNFS SEMAPSQTTM QQSNTGSQPA GYNQYNQSTQ  900
YTRENRKVQD GYNWRKYGQK QVKGSENPRS YYKCTYPNCP TKKKVERSLD GQITEIVYKG  960
SHNHPKPQST RRSSSHAACT NSEISDQSGG TLGNEQTDSF LVHEDTSGSI GEDEFDQASP  1020
LSNPGGDDNE NEPDAKRWKG ENENEGIIGS GSRTVREPRI VVQTTSDIDI LDDGYRWRKY  1080
GQKVVKGNPN PRSYYKCTTI GCPVRKHVER ASHDLRAVIT TYEGKHNHDV PAARGSGYAI  1140
NRPSTTNNSN APMPIRPSAV PSQASNTSYP NSLQTRLPTS GSQPPVTLEM LQNQGSYGFS  1200
GFGKPIGSYM SQAQFSEGAF ARAKDEPEDD SFFDGFLS*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1wj2_A2e-368961132578Probable WRKY transcription factor 4
2lex_A2e-368961132578Probable WRKY transcription factor 4
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAY3311680.0AY331168.1 Theobroma cacao WRKY 10 (tcw10) gene, tcw10-1 allele, partial cds.
GenBankAY3311690.0AY331169.1 Theobroma cacao WRKY 10 (tcw10) gene, tcw10-2 allele, partial cds.
GenBankAY3311700.0AY331170.1 Theobroma cacao WRKY 10 (tcw10) gene, tcw10-3 allele, partial cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017977792.10.0PREDICTED: F-box/LRR-repeat protein 3
SwissprotQ8RWU50.0FBL3_ARATH; F-box/LRR-repeat protein 3
TrEMBLA0A061EXQ10.0A0A061EXQ1_THECC; RNI-like superfamily protein
STRINGEOY070490.0(Theobroma cacao)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G38470.11e-127WRKY DNA-binding protein 33
Publications ? help Back to Top
  1. Callis J,Vierstra RD
    Protein degradation in signaling.
    Curr. Opin. Plant Biol., 2000. 3(5): p. 381-6
    [PMID:11019805]
  2. Xiao W,Jang J
    F-box proteins in Arabidopsis.
    Trends Plant Sci., 2000. 5(11): p. 454-7
    [PMID:11077244]
  3. Motamayor JC, et al.
    The genome sequence of the most widely cultivated cacao type and its use to identify candidate genes regulating pod color.
    Genome Biol., 2013. 14(6): p. r53
    [PMID:23731509]