PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID XP_021666517.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Malpighiales; Euphorbiaceae; Crotonoideae; Micrandreae; Hevea
Family HB-PHD
Protein Properties Length: 731aa    MW: 82590 Da    PI: 10.1376
Description HB-PHD family protein
Gene Model
Gene Model ID Type Source Coding Sequence
XP_021666517.1genomeNCBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox33.28.7e-11508552953
                     HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHH CS
        Homeobox   9 keqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRa 53 
                     ++ +e L ++F++n  ps++ +e L+k+lgL+  +V+ WF+N R+
  XP_021666517.1 508 PSAVEKLRQVFAENELPSRTVKENLSKELGLDPGKVSKWFKNARY 552
                     67899**************************************97 PP

2PHD35.91.7e-12238292150
                     SBTTTSSTCT..TSSEEEBS.SSSSEEETTTSTSSSSHHSHHSS..TBSSHHHHT CS
             PHD   1 rCkvCgksde..egelvlCd.gCkewfHlkClglkleseekpeg..ewlCeeCke 50 
                     +C+ C++++   ++++vlCd +C+ +fH+kCl+++l+++++p g   w+C+ C++
  XP_021666517.1 238 FCAKCKSNEVfpDNDIVLCDgTCNCAFHQKCLDPPLDTANIPPGdqGWFCKFCEC 292
                     6999**944345********56********************9999*******97 PP

Sequence ? help Back to Top
Protein Sequence    Length: 731 aa     Download sequence    
MRGAGKKVMH QESGKSSFSK TENGSMLIAS LKLKKDSKIS HCKKHKPKSK SKSHLKAVGA  60
NLLKSTVTDP SSKGIRNDST SKKLISRKIL KKAHEPKKLA SSKPRGKHSS VIASEENGKN  120
ANREVTFKNL NKKKNKRRRK EKAELDEPSR MQRRARYLLI KMKLEQNLID AYSGEGWKGQ  180
SREKIKPEKE LLRAKKQILK CKLGIRDTIH QLDSLSRVGC IEDSVMAPDG SVSHEHIFCA  240
KCKSNEVFPD NDIVLCDGTC NCAFHQKCLD PPLDTANIPP GDQGWFCKFC ECRMEIIEAM  300
NAHLGTQFSV SGCWQDIFKE AANFSDGGSM LLNPEQEWPS DDSEDDDYDP ERRENSISGA  360
GTDDDASDDA SSSTSLGWSS DGEVFSGSRK WEMESTDFRN QSIYSSLDSD ETSDEEIMCG  420
PRKRRAVDYK KLYDEMFGKD APAYEQVSED EDWGPGKRKR REKESDAAST LMTLYESEKK  480
CKKVETIDVK RKLSRDSQVR RPFFRIPPSA VEKLRQVFAE NELPSRTVKE NLSKELGLDP  540
GKVSKWFKNA RYLALKSRKA GRAKELRNSS RKISRPRLDN MKDKTADIVE LNNASMETSI  600
CSPKNSQQVL QRKEPKSLSS SLVKNERKRA SIGSPSKSNK ISVEYSDDVS LKKLLKSKTK  660
RGKKRNNSIS VTLSQVAEAE MERLCRAKVR LENMKQTLLG LQIGKSRKSN KNQLHQESVI  720
FVPIAEIREK I
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1136163KRRRKEKAELDEPSRMQRRARYLLIKMK
2137164KRRRKEKAELDEPSRMQRRARYLLIKMK
3137142RRRKEK
4457483KRKRREKESDAASTLMTLYESEKKCKK
5458463RRRKEK
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G29940.24e-93HB-PHD family protein