PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.006G051700.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family WRKY
Protein Properties Length: 1103aa    MW: 122342 Da    PI: 6.3764
Description WRKY family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.006G051700.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1WRKY80.22.2e-25142203159
                           ---SS-EEEEEEE--TT-SS-EEEEEE-ST...T---EEEEEE-SSSTTEEEEEEES--SS- CS
                  WRKY   1 ldDgynWrKYGqKevkgsefprsYYrCtsa...gCpvkkkversaedpkvveitYegeHnhe 59 
                           ldDg+ WrKYGqK++ g+++pr+YYrCt++   gC ++k+++r++ dp +++++Y g+H+++
  Sobic.006G051700.1.p 142 LDDGFIWRKYGQKDILGAKHPRGYYRCTHRhmqGCLATKQIQRTDGDPLLLDVVYIGSHTCT 203
                           58***************************99999**************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.20.25.807.0E-26129205IPR003657WRKY domain
SuperFamilySSF1182902.35E-24137205IPR003657WRKY domain
PROSITE profilePS5081122.068137205IPR003657WRKY domain
SMARTSM007741.8E-39142204IPR003657WRKY domain
PfamPF031062.1E-24143202IPR003657WRKY domain
Gene3DG3DSA:3.40.50.3001.4E-10297441IPR027417P-loop containing nucleoside triphosphate hydrolase
SuperFamilySSF525401.18E-37297548IPR027417P-loop containing nucleoside triphosphate hydrolase
CDDcd000095.01E-6301439No hitNo description
PfamPF009313.8E-29315575IPR002182NB-ARC
PRINTSPR003646.8E-9325340No hitNo description
PRINTSPR003646.8E-9398412No hitNo description
SuperFamilySSF520583.91E-42612833IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.101.3E-18683823IPR032675Leucine-rich repeat domain, L domain-like
PRINTSPR003646.8E-9697713No hitNo description
SuperFamilySSF520583.91E-428791089IPR032675Leucine-rich repeat domain, L domain-like
Gene3DG3DSA:3.80.10.108.2E-168821092IPR032675Leucine-rich repeat domain, L domain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043531Molecular FunctionADP binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1103 aa     Download sequence    Send to blast
MEGMPEEKCS LAAVAAELAQ IHDMAKQLVE QVADPQQGGG DGDAAAGGGY QRVRELTSTI  60
CANVDKALHM LTSNSLDGSP AAGQPESTPS SGGHGSSRGA VLDSDQAGGG TGNAPGQGKD  120
RKTLSKWSTQ VRVSNAQDAT YLDDGFIWRK YGQKDILGAK HPRGYYRCTH RHMQGCLATK  180
QIQRTDGDPL LLDVVYIGSH TCTQPWGAAA HPNIQSMLPT TEQTTTSGSE SGSVLTSEIP  240
GSMASRKRDT GGETRLSKTM IEEPHSTPYK DVMAWFSMGK LRQKGRLSHT CMQQAGQELL  300
GRDDIRQQVM EKILLDRNGV NNCTVICIYG WSGLGKTSLL HALYNDQQLL DAFDKRIWIQ  360
ISDKIDISML FRKIVEFAMN EHCSITNIDF LRELVVEEIT DKKFLLFLDD ADIVNQQFWT  420
TLLEVLNTGA KGSVVVMATR SSTVAAVRNV ATHSYSLNPL SEENNLMLLQ QYAVVGTDIQ  480
SNPDLALIAN RFISRFRYNL LHLKAIGGLL CHTDTFSVEK DKFEGSVMPL WICHDVLPVH  540
LKRCLALCSL FPEGYIFGKH HMVLLWISHG CVRPVEGYEL EDVGVEYFNE LLCRSFFQCS  600
PVHSDKNEMF VMHELMYKVV ESVSPDKYFK SEDPVISIPE NVFHCSLITS QFQTVELMHR  660
MKQLKHLQTF MVVQPEWKPN NISLPTLNLV GLDDFFLKFT SLETLDLSHT ETEELPASIA  720
GLRNLRYLSV NSTNVRALPC ELCSLSNLQT LEAKHCRFLT ELPRDIKMLV KLRHLDLTKE  780
LGYVDLPHGI GELIELQTLP VFHVSGDSSC CSISELGSLH NLRGCLWLSG LESVKTGSKA  840
KEANLKDKHC LNDLTLQWHD DGIDIEDEGE DSKDVADEQV LEGLKPHVNL QVLTIRGYEG  900
RRFPAWMQGS SPSLPNLVTL TLDNCCNCTE FPTIVQLPSL KSLSVRKMYD VQQLSSHTDT  960
HGNGSTAKFP SLELLNLWEM YGLEELFSKE SEGDCPRLRK VCISRCPDLR RLPSARSLTE  1020
LVLHCGKQLP DISELASLVS LKIEGFHGTK SFGLPAAAAL RKLEIRSCKE LASVDGLSAV  1080
LTTVQRLKIA GCPKLVLPGR NQ*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002446300.10.0hypothetical protein SORBIDRAFT_06g013840
TrEMBLC5YES60.0C5YES6_SORBI; Putative uncharacterized protein Sb06g013840
STRINGSb06g013840.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP1356923
MonocotsOGMP125771216
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G11070.14e-30WRKY family protein