PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PDK_30s745091g001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Arecales; Arecaceae; Coryphoideae; Phoeniceae; Phoenix
Family Trihelix
Protein Properties Length: 324aa    MW: 36389.6 Da    PI: 7.992
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PDK_30s745091g001genomePDKView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix47.83.8e-1525111273
           trihelix   2 WtkqevlaLiearr.........emeerlrrgk..............lkkplWeevskkmrergferspkqCkekwenlnkrykkikegekk 70 
                        Wt +e+++Liea++         +                         + +W++v +++++ g+ rs++qC+++w+nl +++kk+++ e +
  PDK_30s745091g001  25 WTLNETMVLIEAKKidnerrmkrS--------IesegreggsssssrPSEMRWKWVEDYCWRCGCYRSQNQCNDRWDNLMRDFKKVRAYEMS 108
                        **************6555444442........133334444555555999*************************************98877 PP

           trihelix  71 rts 73 
                          +
  PDK_30s745091g001 109 LGA 111
                        644 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS500907.2241796IPR017877Myb-like domain
PfamPF138375.4E-1124106No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 324 aa     Download sequence    Send to blast
MADKSAMFGV VGGEERRREY RKGNWTLNET MVLIEAKKID NERRMKRSIE SEGREGGSSS  60
SSRPSEMRWK WVEDYCWRCG CYRSQNQCND RWDNLMRDFK KVRAYEMSLG AEEGDRLSYW  120
KLDRHERKER NLPSNLLPGI YEALIEVVDR RGVEGVSGSI SNMAELVGDE RHMGSTSASM  180
PPVMQHNRPV PTSQGPQPPP LSPTHELPQA QAAGTIDSDD DEHSNSPERK RRRGEGSSSK  240
NSSHKLTSAI SKSASILAEA FQAGEEKEER RHNDLRRIEE RKAKMEQSKA EISIQSMDGL  300
AAAINQLASS ILGSLADKGP AAPK
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1228232RKRRR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008787373.10.0PREDICTED: uncharacterized protein LOC103705444
TrEMBLM0STW61e-109M0STW6_MUSAM; Uncharacterized protein
STRINGGSMUA_Achr4P32040_0011e-108(Musa acuminata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP32943571
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G35640.11e-47Trihelix family protein