PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla021261
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family HD-ZIP
Protein Properties Length: 744aa    MW: 81942.3 Da    PI: 5.9625
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla021261genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.85e-20109164156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                +++ ++++ eq++eLe++F+++++p++++r eL+++l+L+++qVk+WFqNrR+++k
  Cla021261 109 KKRYHRHSVEQIQELEAMFKECPHPDEKQRLELSRRLSLETKQVKFWFQNRRTQMK 164
                6778999**********************************************999 PP

2START173.21.7e-542764904205
                HHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS.........SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT... CS
      START   4 eeaaqelvkkalaeepgWvkssesengdevlqkfeeskv........dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevissg... 88 
                ++a++e vk+a +eep+Wv++   +n++e+l+ f+ s +        +  ea+r +++v+ ++  l+ +l+d   +W e+++     a+t++vis g   
  Cla021261 276 LAAMDEIVKMANEEEPLWVREK--LNEEEYLRMFSGSCFgvkhinngFVFEASRQTALVFLNTSALLDTLMDPD-RWVEMFPnlvaTASTTDVISGGmgg 372
                689*******************..**********77666*****99999**********************999.************************* PP

                ...EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHH CS
      START  89 ...galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllrsl 184
                   galqlm aelq lsp+vp R   f+R+++q+ +g+wv+vdvS+d+ ++ ++s+++    +lpSg+li++++ng+skvtwveh +++++ +h+l+r+l
  Cla021261 373 trnGALQLMHAELQILSPMVPvRQLSFLRFCKQHAEGVWVVVDVSIDPITDTSSSPPCR---RLPSGCLIHEMPNGYSKVTWVEHSEYDESYIHELYRPL 469
                ***************************************************88555544...6************************************* PP

                HHHHHHHHHHHHHHHTXXXXX CS
      START 185 vksglaegaktwvatlqrqce 205
                ++sgl +ga +w atlqrq e
  Cla021261 470 IRSGLGFGAPRWIATLQRQSE 490
                ******************987 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.605.8E-2295166IPR009057Homeodomain-like
SuperFamilySSF466896.27E-2097166IPR009057Homeodomain-like
PROSITE profilePS5007117.783106166IPR001356Homeobox domain
SMARTSM003891.1E-18107170IPR001356Homeobox domain
PfamPF000461.8E-17109164IPR001356Homeobox domain
CDDcd000864.38E-17113166No hitNo description
PROSITE patternPS000270141164IPR017970Homeobox, conserved site
PROSITE profilePS5084839.158264494IPR002913START domain
SuperFamilySSF559611.79E-29268491No hitNo description
CDDcd088756.03E-101272490No hitNo description
SMARTSM002344.0E-38273491IPR002913START domain
PfamPF018527.2E-47276490IPR002913START domain
SuperFamilySSF559614.51E-17509727No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 744 aa     Download sequence    Send to blast
MSFLPRFHSS GAGGGNGDGD GDGSLMAAGA IAQPHLITQS SFTKSMFSSP TLSLALTNID  60
GLGVGEMVAA DVRRRSREEE AAESRSGSEN MDGGSGEEVE GGGGRKKRKK RYHRHSVEQI  120
QELEAMFKEC PHPDEKQRLE LSRRLSLETK QVKFWFQNRR TQMKTQLERH ENTLLRQENE  180
KLRTENMAIR EAMREPICSN CGGPAIIGEI SIEEQQLRIE NARLKDELDR VCALAGKFLG  240
RAAVPVSPPL ASSSCLEEYG GGMMMMERCV YLEMGLAAMD EIVKMANEEE PLWVREKLNE  300
EEYLRMFSGS CFGVKHINNG FVFEASRQTA LVFLNTSALL DTLMDPDRWV EMFPNLVATA  360
STTDVISGGM GGTRNGALQL MHAELQILSP MVPVRQLSFL RFCKQHAEGV WVVVDVSIDP  420
ITDTSSSPPC RRLPSGCLIH EMPNGYSKVT WVEHSEYDES YIHELYRPLI RSGLGFGAPR  480
WIATLQRQSE CLAILSTPID YSGISTNGRR SMVKLAQRMT VNFCAGICGS TIYKWNKLNT  540
RNNNVGEDVR VMTRKSVEDP GEPPGTVLSA ATSVWVAEAA ERVFEFLRDE RLRSEWDILS  600
NGGPMQEMLH IPKSQHHHHP NAVSLLRATQ SLNPNQSSML ILQETCTDAS GSLVVYAPVD  660
IPAMQAVMNG GDSAYVALLP SGFAVVPAAE DGGGGSLLTV AFQILVNSLP TDKLTVESVE  720
TVNNLISCTV QKIKTALRCH APST
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1104110RKKRKKR
2105109KKRKK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00418DAPTransfer from AT3G61150Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_022924364.10.0homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ9M2E80.0HDG1_ARATH; Homeobox-leucine zipper protein HDG1
TrEMBLA0A1S3CBG80.0A0A1S3CBG8_CUCME; homeobox-leucine zipper protein ANTHOCYANINLESS 2
STRINGXP_008460172.10.0(Cucumis melo)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Horstman A, et al.
    AIL and HDG proteins act antagonistically to control cell proliferation.
    Development, 2015. 142(3): p. 454-64
    [PMID:25564655]