PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PDK_30s929831g001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Arecales; Arecaceae; Coryphoideae; Phoeniceae; Phoenix
Family Trihelix
Protein Properties Length: 440aa    MW: 51121.6 Da    PI: 6.9862
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PDK_30s929831g001genomePDKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix70.43.3e-22123219186
           trihelix   1 rWtkqevlaLiearremeerlrrgk..........lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkr.tsessstcpy 81 
                        +Wt+ +v++Li++ +++ e   +            +kk++W+ vsk+m+er + +sp+qC++k+++lnkryk+++e+ +++ +++++++ ++
  PDK_30s929831g001 123 KWTDAMVRLLITVISYLTEGTAELDssakkklavlQKKGKWKLVSKVMAERNCCVSPQQCEDKFNDLNKRYKRLTEILGRGtSCKVVENPVL 214
                        7**************98888886322456678888**********************************************66999999999 PP

           trihelix  82 fdqle 86 
                        +d ++
  PDK_30s929831g001 215 LDYMH 219
                        99987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF819952.01E-52981No hitNo description
PfamPF138372.8E-20121245No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 440 aa     Download sequence    Send to blast
MEGNMSAGNM MPGPSSYGSF DLQVSMAMHQ QQQQHGCFHH QQQQPTQTQN QGPMVHQPMN  60
SIFPQSVGQM NEREQQPMLQ MMDYSKVDHG KISTSEEDDT EDGADGPNKS ERGKKAAPWQ  120
RVKWTDAMVR LLITVISYLT EGTAELDSSA KKKLAVLQKK GKWKLVSKVM AERNCCVSPQ  180
QCEDKFNDLN KRYKRLTEIL GRGTSCKVVE NPVLLDYMHH ISDKVKDDVR KILSSKHLHY  240
EEMCSYHNKN RLHLPADPEV QRSVQSLLKT SDDHDAKRAA RDDFDENDQD EDSDDREDDL  300
EENDTLFRDI GGSCFPKRMK QGLVCEDVNF RNSLGPQNNP RRPNSQSISF DVNQVFPEGS  360
TTPWVQKQLI RSRSIQLEEQ RLQIQAEMLE IEKQQFKWKR FRKKKDRELD KMRIQNERMK  420
LENERLALEL KRRELVLDLN
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008793938.10.0PREDICTED: uncharacterized protein LOC103710100
TrEMBLA0A068UNC01e-159A0A068UNC0_COFCA; Uncharacterized protein
TrEMBLB9T2G21e-159B9T2G2_RICCO; Transcription factor, putative
STRINGGLYMA10G36980.11e-152(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP29203886
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G21200.19e-86sequence-specific DNA binding transcription factors