PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla006197
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family HD-ZIP
Protein Properties Length: 749aa    MW: 82666 Da    PI: 6.106
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla006197genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.41.9e-2155110156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
   Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                r+k +++t++q++eLe +F+++++p+ ++r eL+++lgL+++qVk+WFqNrR+++k
  Cla006197  55 RKKYHRHTPHQIQELEIFFKECPHPDDKQRSELSRRLGLETKQVKFWFQNRRTQMK 110
                7999*************************************************999 PP

2START177.76.7e-562634812205
                HHHHHHHHHHHHHHHC-TT-EEEE...EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT. CS
      START   2 laeeaaqelvkkalaeepgWvkss...esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevissg. 88 
                la++a++elvk+a+ + p+W +s+   e +n de+ ++f++s +      ++ea r++++v+ ++  lve+l+d + +W e+++    +a+t++vissg 
  Cla006197 263 LALAAMNELVKMAQMDGPLWIRSRdgkETLNLDEYSRTFPSSAGmkhsnWTTEATRDTTMVIINSLALVETLMDAN-RWAEMFPcliaRATTIDVISSGm 361
                6899********************999999*********99988*99999**************************.*********************** PP

                .....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHHHHH CS
      START  89 .....galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphwllr 182
                     galqlm ael +lsplvp R   f+R+++q+  g w++vdvS+ + ++ +   s+  +++lpSg+++++++ng skvtwveh+++++ ++h+l+r
  Cla006197 362 ggtrnGALQLMHAELRVLSPLVPvRTLKFLRFCKQHANGLWAVVDVSIGEGSNSN---SFFGCRRLPSGCVVQDMPNGFSKVTWVEHTEYDETVIHQLYR 458
                ***************************************************9965...8***************************************** PP

                HHHHHHHHHHHHHHHHHTXXXXX CS
      START 183 slvksglaegaktwvatlqrqce 205
                +l++sg  +g ++w+atlqrqc 
  Cla006197 459 QLISSGSGFGSQRWLATLQRQCD 481
                *********************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.51E-2039112IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-2241112IPR009057Homeodomain-like
PROSITE profilePS5007117.52452112IPR001356Homeobox domain
SMARTSM003892.3E-2054116IPR001356Homeobox domain
CDDcd000862.45E-2055112No hitNo description
PfamPF000464.8E-1955110IPR001356Homeobox domain
PROSITE patternPS00027087110IPR017970Homeobox, conserved site
PROSITE profilePS5084838.987253485IPR002913START domain
SuperFamilySSF559619.06E-30254482No hitNo description
SMARTSM002341.1E-35262482IPR002913START domain
CDDcd088754.50E-112262480No hitNo description
PfamPF018525.6E-48263481IPR002913START domain
SuperFamilySSF559611.37E-18510743No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009827Biological Processplant-type cell wall modification
GO:0042335Biological Processcuticle development
GO:0043481Biological Processanthocyanin accumulation in tissues in response to UV light
GO:0048765Biological Processroot hair cell differentiation
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 749 aa     Download sequence    Send to blast
MDGYGEVCLL GDGFDPTGIV RIREDGYDSR SGSDNIDGAV SGDDQDANEE QPPKRKKYHR  60
HTPHQIQELE IFFKECPHPD DKQRSELSRR LGLETKQVKF WFQNRRTQMK TQIERHENAI  120
LKQENDKLRA ENSVMKDAIS DPTCNTCGGP SIPVHLSFEE HQLRIENARL RDELHRLYAV  180
TNKFLGWPVV PFANHGSSPS SDSCLELSVG RNGIGNLSTV SDSMGLNLGN ELFNAGPVMP  240
VSKPEIGMLS NDIPLERTIY VDLALAAMNE LVKMAQMDGP LWIRSRDGKE TLNLDEYSRT  300
FPSSAGMKHS NWTTEATRDT TMVIINSLAL VETLMDANRW AEMFPCLIAR ATTIDVISSG  360
MGGTRNGALQ LMHAELRVLS PLVPVRTLKF LRFCKQHANG LWAVVDVSIG EGSNSNSFFG  420
CRRLPSGCVV QDMPNGFSKV TWVEHTEYDE TVIHQLYRQL ISSGSGFGSQ RWLATLQRQC  480
DCLAILMSST IPSEDPAGIS PCGRRSMLKL SQRMVDNFCS GVCSSTLHKW DKLVVGNISE  540
DVKVMARKSI NDPGEPPGIV LSAATSVWMP VTQQRLFAFL QDECLRSEWD ILSNSRPMLE  600
MLRISKGQGP DNRVSLLRAN PMNANESTMF ILQETWTDVS GSLVVYAPVD TSSVNLVMRG  660
GDSAYVSLLP SGFAILPNGP SNYACTNDQD GSVKSVVNSG HGGGCLLTVA FQILVNSLPT  720
AKLTVESVET VNNLISCTIQ KIKTALQIS
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00421DAPTransfer from AT4G00730Download
Motif logo
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818480.0LN681848.1 Cucumis melo genomic scaffold, anchoredscaffold00006.
GenBankLN7132600.0LN713260.1 Cucumis melo genomic chromosome, chr_6.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011651140.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
RefseqXP_011651141.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0A0L5U50.0A0A0A0L5U5_CUCSA; Uncharacterized protein
STRINGXP_004157198.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]