PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.250870.1
Common NameCsa_3G180430, LOC101218216
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family HD-ZIP
Protein Properties Length: 784aa    MW: 86189.9 Da    PI: 5.9503
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.250870.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox66.63.2e-2187142156
                     TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     r+k +++t++q++eLe +F+++++p+ ++r eL+++lgL+++qVk+WFqNrR+++k
  Cucsa.250870.1  87 RKKYHRHTPHQIQELEIFFKECPHPDDKQRNELSRRLGLETKQVKFWFQNRRTQMK 142
                     7999*************************************************999 PP

2START175.82.6e-552955152205
                     HHHHHHHHHHHHHHHC-TT-EEEE.....EXCCTTEEEEEEESSS......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEE CS
           START   2 laeeaaqelvkkalaeepgWvkss.....esengdevlqkfeeskv.....dsgealrasgvvdmvlallveellddkeqWdetla....kaetl 82 
                     la++a++elvk+a+ + p+W +s+     e +n de+ ++f++s +      ++ea r++++v+ ++  lve+l+d + +W e+++    +a+t+
  Cucsa.250870.1 295 LALAAMNELVKMAQMDGPLWIRSRadsgkETLNLDEYSRTFPSSAGmkhtnWTTEATRDTTMVIINSLALVETLMDAN-RWAEMFPcliaRATTT 388
                     6899************************99***********99988**9999**************************.**************** PP

                     EEECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-E CS
           START  83 evissg......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehv 170
                     +vissg      galqlm ael +lsplvp R   f+R+++q+ +g w++vdvS+ + ++ +   s+  +++lpSg+++++++ng skvtwveh+
  Cucsa.250870.1 389 DVISSGmggtrnGALQLMHAELRVLSPLVPvRTLKFLRFCKQHADGLWAVVDVSIGEGSNSN---SFSGCKRLPSGCVVQDMPNGFSKVTWVEHT 480
                     **********************************************************9966...799999************************ PP

                     E--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
           START 171 dlkgrlphwllrslvksglaegaktwvatlqrqce 205
                     ++++ ++h+l+r+l++sg  +g ++w+atlqrqc 
  Cucsa.250870.1 481 EYDETVIHQLYRQLISSGSGFGSQRWLATLQRQCD 515
                     *********************************96 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466892.05E-2071144IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.601.3E-2273144IPR009057Homeodomain-like
PROSITE profilePS5007117.49184144IPR001356Homeobox domain
SMARTSM003894.3E-2086148IPR001356Homeobox domain
CDDcd000862.89E-2087144No hitNo description
PfamPF000467.9E-1987142IPR001356Homeobox domain
PROSITE patternPS000270119142IPR017970Homeobox, conserved site
PROSITE profilePS5084838.227285519IPR002913START domain
SuperFamilySSF559612.06E-28286516No hitNo description
CDDcd088752.27E-111294514No hitNo description
SMARTSM002344.0E-34294516IPR002913START domain
PfamPF018521.7E-47295515IPR002913START domain
SuperFamilySSF559616.18E-17544777No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 784 aa     Download sequence    Send to blast
MPGGAITDPS FLPYPIPSSM PSSSEFSSSL RKMDGYGEVC LLGDGFDPTG IVRIREDEYD  60
SRSGSDNIDG AVSGDDHDAN DEQPPKRKKY HRHTPHQIQE LEIFFKECPH PDDKQRNELS  120
RRLGLETKQV KFWFQNRRTQ MKTQIERHEN AILKQENDKL RAENSVMKDA ISNPTCSTCG  180
GPSIPVHLSF EEHQLRIENA RLREELHRLY AVTNKFLGWP VVPFANHGSS PSSDSCLELS  240
VGRNGPGNLS TVSDSMGLNL GNELFSAGSV MPISKPDIGM LTNDIPLERT IYVDLALAAM  300
NELVKMAQMD GPLWIRSRAD SGKETLNLDE YSRTFPSSAG MKHTNWTTEA TRDTTMVIIN  360
SLALVETLMD ANRWAEMFPC LIARATTTDV ISSGMGGTRN GALQLMHAEL RVLSPLVPVR  420
TLKFLRFCKQ HADGLWAVVD VSIGEGSNSN SFSGCKRLPS GCVVQDMPNG FSKVTWVEHT  480
EYDETVIHQL YRQLISSGSG FGSQRWLATL QRQCDCLAIL MSSTIPTEDP AGISPSGRRS  540
MLKLSQRMVD NFCSGVCSST LHKWDKLVVG NISEDVKVMA RKSINDPGEP PGIVLSAATS  600
VWMPVTQQRL FAFLQDECLR SEWDILSNSR PMLEMLRISK SQGPDNRVSL LCANPMNANE  660
NTMFILQETW TDVSGSLVVF APVDTSSVNL VMRGGDSAYV SLLPSGFAIL PIDQSNYACT  720
NGEDASVKSS INSGHGGACL LTVAFQILVN SLPTAKLTVE SVETVNHLIS CTIQKIKTAL  780
QVS*
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor involved in the regulation of the tissue-specific accumulation of anthocyanins and in cellular organization of the primary root. {ECO:0000269|PubMed:10402424}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818480.0LN681848.1 Cucumis melo genomic scaffold, anchoredscaffold00006.
GenBankLN7132600.0LN713260.1 Cucumis melo genomic chromosome, chr_6.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004140784.10.0PREDICTED: homeobox-leucine zipper protein ANTHOCYANINLESS 2-like isoform X1
SwissprotQ0WV120.0ANL2_ARATH; Homeobox-leucine zipper protein ANTHOCYANINLESS 2
TrEMBLA0A0A0L5U50.0A0A0A0L5U5_CUCSA; Uncharacterized protein
STRINGXP_004157198.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF119234106
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.10.0HD-ZIP family protein
Publications ? help Back to Top
  1. Duarte JM, et al.
    Expression pattern shifts following duplication indicative of subfunctionalization and neofunctionalization in regulatory genes of Arabidopsis.
    Mol. Biol. Evol., 2006. 23(2): p. 469-78
    [PMID:16280546]
  2. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  3. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  4. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]