PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Carubv10025742m
Common NameCARUB_v10025742mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Capsella
Family WRKY
Protein Properties Length: 1412aa    MW: 159396 Da    PI: 5.7938
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Carubv10025742mgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY86.62.2e-2712151275259
                       --SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
             WRKY    2 dDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59  
                       +D ++WrKYG+Ke+ gs fprsY+rCt++   gC+++k+v+rs++dp++++itY +eHnh+
  Carubv10025742m 1215 EDLWSWRKYGKKEILGSLFPRSYFRCTHKfaqGCKATKQVQRSDTDPNMFTITYLSEHNHP 1275
                       79**************************9999****************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF522006.41E-817108IPR000157Toll/interleukin-1 receptor homology (TIR) domain
PfamPF015821.0E-41899IPR000157Toll/interleukin-1 receptor homology (TIR) domain
SuperFamilySSF525408.26E-50156395IPR027417P-loop containing nucleoside triphosphate hydrolase
PfamPF009318.3E-27158404IPR002182NB-ARC
Gene3DG3DSA:3.40.50.3006.1E-10168290IPR027417P-loop containing nucleoside triphosphate hydrolase
PRINTSPR003642.4E-23170185No hitNo description
PRINTSPR003642.4E-23242256No hitNo description
PRINTSPR003642.4E-23335349No hitNo description
SuperFamilySSF467858.98E-12372473IPR011991Winged helix-turn-helix DNA-binding domain
SuperFamilySSF520589.35E-34508690IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.109.6E-17511703IPR032675Leucine-rich repeat domain, L domain-like
PfamPF077259.6E-9575594IPR011713Leucine-rich repeat 3
Gene3DG3DSA:3.80.10.101.7E-15704874IPR032675Leucine-rich repeat domain, L domain-like
SuperFamilySSF520589.35E-34729882IPR032675Leucine-rich repeat domain, L domain-like
PRINTSPR003642.4E-23807823No hitNo description
SuperFamilySSF467851.6E-710951179IPR011991Winged helix-turn-helix DNA-binding domain
Gene3DG3DSA:2.20.25.805.4E-2612011276IPR003657WRKY domain
PROSITE profilePS5081124.3412091277IPR003657WRKY domain
SuperFamilySSF1182901.22E-2312121276IPR003657WRKY domain
SMARTSM007742.7E-3812141276IPR003657WRKY domain
PfamPF031067.4E-2412151275IPR003657WRKY domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0007165Biological Processsignal transduction
GO:0008219Biological Processcell death
GO:0009816Biological Processdefense response to bacterium, incompatible interaction
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0005515Molecular Functionprotein binding
GO:0043531Molecular FunctionADP binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1412 aa     Download sequence    Send to blast
MTNCENDAGF VCISCVDEVR CSFVSHLSEA LRRKGINYAV VDVSSDELFS KESREKVEKA  60
RVSVIVLPGN CEPSRVCLDN FAMVLECQRR MVVPVLYGDS PLREEWLSEL DLKGLSPVHK  120
SRKECSDSTL VEEIVGDVYE KLFYSGRIGI YSKLLEIEKL VGNQPFGIRC VGIWGMPGIG  180
KTTLAKAVFD QMSGAFDASC FIEDYEKSIH EKGLYCLLEE HLLKESPGND ATIMNLSSLR  240
DRLNSKRVLV VLDDVRNGLV AESFLEGFDW LEPGSLIIIT SRDKQVFRLC QINQIYEVQG  300
LNEREALQLF LLCASLKDMG EQNLREFSLK VINYANGNPL AINVYGRELK GKKKLSEMET  360
VFLKVKRRPP FKIVDAFKSS YDTLSDNEKN IFLDIACFFH GENVNYVIQL LEGCGFFPHV  420
GIDVLVEKSL VTVSENRVRL HNLTQAVGQE IINGETVQIE RRKRLWEPWS IKYLLEYNEP  480
KANEEPKTTF KRAQGSEEIE GMFLDASNLK FDVQPSAFKN MLNLRLLKIY CSNPEVHPVI  540
NFPKDFFHSL PDELRLLHWE NYPLQSLPQS FDPRHLVEIN MPYSQLQKLW GGTKNLEALR  600
TVRLCHSQHL VDIDDLVKAQ NLEVIDLQGC TRLQNFPAAG QLLHLRVVNL SGCIEIKSFL  660
EIPPNIETLH LQGTGILALP LSTVKPNHRE LLNFLTEIPS LSEALKLERL TSLLECRTSC  720
QDLGKLICLE LKDCSCLQSL PNLANLDLLN VLDLSGCSRL NSIQGFPRFL KELYLTGTAI  780
REVPQLPQSL ELLNAHGSIV QSLPDMANLE FLKVLDLSGC SELETVQGFP RNLKELYLAG  840
TTLREVPQLP LSLELLNAHG SVSLKSIRPN YYKLPMHYTF SNLFDLSPQV VNDFLVKALT  900
NVKHISRKYM QNLNKAPTFS FSAPSHANQN APLGLQPGSS VMTRLNPYWR NMLVGFGMMV  960
EVAFSEDYCD ATGFGISCVC RWSNKEGRSY KIERNFHCWP PGKVVPKVLK NHTFVFCDIN  1020
MSPSTDGGND PGIWADLVVF EFFPINQQTK SLIDRFTVTR CGIRVIDVTT GYTSLKNISL  1080
VLSLNPMEVS GYEVVEEVLR VSYDDLQEMD KVLFLYIACL FNDEDVDVVA PLIAGIDLAV  1140
SSGLKVLADV SLISVSSNGE IVMHSLLRKM AKEILHGQAI VLSDCESTMA DNLSDIPKKR  1200
RKRNIKKVVC ATANEDLWSW RKYGKKEILG SLFPRSYFRC THKFAQGCKA TKQVQRSDTD  1260
PNMFTITYLS EHNHPSSTEW MALAGPSRPT RSTSSSNYSA VTTSASSRVS QNKVKSNKLH  1320
LPSSSTPPGN AGVQLKEKDM EEFQDNMELD NDVEDICTLE LFPEFQHQPE ENPSSSTFDN  1380
KDWSDWFSMF SIPKFQDQPE EGPFSPDWLW E*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4c6t_A2e-5461494151PROBABLE WRKY TRANSCRIPTION FACTOR 52
4c6t_C2e-5461494151PROBABLE WRKY TRANSCRIPTION FACTOR 52
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
111971201KKRRK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHQ1706310.0HQ170631.1 Arabidopsis thaliana RRS1-R mRNA, complete cds.
GenBankJX1355600.0JX135560.1 UNVERIFIED: Arabidopsis thaliana ecotype Be-0 (Bensheim) resistance to ralstonia solanacearum 1-like mRNA, partial sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006279894.10.0hypothetical protein CARUB_v10025742mg
SwissprotC4B7M50.0WR52W_ARATH; Disease resistance protein RRS1
TrEMBLR0EUJ30.0R0EUJ3_9BRAS; Uncharacterized protein
STRINGAT5G45260.10.0(Arabidopsis thaliana)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM2506814
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G45260.10.0Disease resistance protein (TIR-NBS-LRR class)