PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.363170.1
Common NameCsa_6G074030, LOC101222248
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family HD-ZIP
Protein Properties Length: 842aa    MW: 90136.3 Da    PI: 5.7946
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.363170.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox651.1e-20135190156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     +++ +++t++q++eLe++F+++++p++++r eL+++l L++rqVk+WFqNrR+++k
  Cucsa.363170.1 135 KKRYHRHTPQQIQELEAVFKECPHPDEKQRLELSRRLCLETRQVKFWFQNRRTQMK 190
                     688999***********************************************999 PP

2START211.23.8e-663435691206
                     HHHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
           START   1 elaeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                     ela++a++elvk+a+ +ep+W  s     e++n++e++++f++  +     + +ea+r+sg+v+ ++  lve+l+d++ +W e+++    + +t+
  Cucsa.363170.1 343 ELALAAMDELVKMAQTDEPLWIGSLeggrEILNQEEYMRTFTPCIGmkpngFVTEASRESGMVIINSLALVETLMDSN-RWAEMFPcmiaRTTTT 436
                     5899**************************************9988999*9***************************.**************** PP

                     EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--...-TTSEE-EESSEEEEEEEECTCEEEEEEEE CS
           START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppe..sssvvRaellpSgiliepksnghskvtwve 168
                     +vis+g      galqlm aelq+lsplvp R++ f+R+++q+ +g+w++vdvSvd  ++ p+   ss+  +++lpSg+++++++ng+skvtwve
  Cucsa.363170.1 437 DVISTGmggtrnGALQLMHAELQVLSPLVPvREVNFLRFCKQHAEGVWAVVDVSVDAMRETPTggGSSFGNCRRLPSGCVVQDMPNGYSKVTWVE 531
                     *************************************************************999******************************* PP

                     -EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
           START 169 hvdlkgrlphwllrslvksglaegaktwvatlqrqcek 206
                     h++++++++h+l+r+l++sg+ +ga++wv tlqrqce+
  Cucsa.363170.1 532 HAEYDDSQVHQLYRPLLSSGMGFGAQRWVTTLQRQCEC 569
                     ************************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466898.98E-21117192IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.604.5E-22122192IPR009057Homeodomain-like
PROSITE profilePS5007117.2132192IPR001356Homeobox domain
SMARTSM003896.3E-18133196IPR001356Homeobox domain
CDDcd000868.96E-19134192No hitNo description
PfamPF000462.9E-18135190IPR001356Homeobox domain
PROSITE patternPS000270167190IPR017970Homeobox, conserved site
PROSITE profilePS5084843.789334572IPR002913START domain
SuperFamilySSF559615.01E-33336569No hitNo description
CDDcd088754.96E-126338568No hitNo description
SMARTSM002344.9E-51343569IPR002913START domain
PfamPF018524.5E-58343569IPR002913START domain
SuperFamilySSF559612.28E-23597767No hitNo description
SuperFamilySSF559612.28E-23794834No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009827Biological Processplant-type cell wall modification
GO:0042335Biological Processcuticle development
GO:0043481Biological Processanthocyanin accumulation in tissues in response to UV light
GO:0048765Biological Processroot hair cell differentiation
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 842 aa     Download sequence    Send to blast
MSFGGFLDGG GGGGGGGGAR ILADLPYTNN STTNANNNPT GGIGGGGNMS SSAIAPPRLI  60
TQSLTKSMFN SPGLSLALTN MDGGPGDLAA RLPEGFEHNV GRRGREEEHE SRSGSDNMDG  120
GSGDDQDAAD NPPRKKRYHR HTPQQIQELE AVFKECPHPD EKQRLELSRR LCLETRQVKF  180
WFQNRRTQMK TQLERHENTL LRQENDKLRA ENMSIRDAMR NPICSNCGGP AIIGEISLEE  240
QQLRIENARL KDELDRVCAL AGKFLGRPIS SLANSIAPPL PSSSLELGVG SNGFGSLTMA  300
TSMPIGPDFG GGLSGNLAVV QAPARPTPGM GLDRSVERSM LLELALAAMD ELVKMAQTDE  360
PLWIGSLEGG REILNQEEYM RTFTPCIGMK PNGFVTEASR ESGMVIINSL ALVETLMDSN  420
RWAEMFPCMI ARTTTTDVIS TGMGGTRNGA LQLMHAELQV LSPLVPVREV NFLRFCKQHA  480
EGVWAVVDVS VDAMRETPTG GGSSFGNCRR LPSGCVVQDM PNGYSKVTWV EHAEYDDSQV  540
HQLYRPLLSS GMGFGAQRWV TTLQRQCECL AILMSSAVPI RDHTAITAGG RRSMLKLAQR  600
MTANFCAGVC ASTVHKWNKL NAGSVDEDVR VMTRKSVDDP GEPPGIVLSA ATSVWLPVSP  660
QRLFDFLRDE RLRSEWDILS NGGPMQEMAH IAKGQDHGNC VSLLRASAMN ANQSSMLILQ  720
ETCIDAAGSL VVYAPVDIPA MHVVMNGGDS AYVALLPSGF AIVPDGAVTG GLTATNGSSP  780
SGGEGPQSQR AAGGGSLLTV AFQILVNSLP TAKLTVESVE TVNNLISCTV QKIKAALQCE  840
T*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00421DAPTransfer from AT4G00730Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6819250.0LN681925.1 Cucumis melo genomic scaffold, anchoredscaffold00052.
GenBankLN7132650.0LN713265.1 Cucumis melo genomic chromosome, chr_11.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004144982.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0A0KBG60.0A0A0A0KBG6_CUCSA; Uncharacterized protein
STRINGXP_004144982.10.0(Cucumis sativus)
STRINGXP_004168157.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  3. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  4. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]