PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cucsa.050000.1
Common NameCsa_7G429520, LOC101221600
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Cucumis
Family HD-ZIP
Protein Properties Length: 327aa    MW: 36157 Da    PI: 5.5855
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cucsa.050000.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox57.42.5e-18166220256
                     T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
        Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                     rk+ +++keq   Lee F+++++++ ++++ LAk+l+L  rqV vWFqNrRa+ k
  Cucsa.050000.1 166 RKKLRLSKEQSAFLEESFKEHNTLNPKQKQALAKQLNLRPRQVEVWFQNRRARTK 220
                     778899***********************************************98 PP

2HD-ZIP_I/II127.94.3e-41166256192
     HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreelk 92 
                     +kk+rlskeq+++LEesF+e+++L+p++K++la++L+l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l+een+rL+ke +eLr +lk
  Cucsa.050000.1 166 RKKLRLSKEQSAFLEESFKEHNTLNPKQKQALAKQLNLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCETLTEENRRLQKELQELR-ALK 256
                     69*************************************************************************************9.555 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.1E-17148220IPR009057Homeodomain-like
SuperFamilySSF466892.38E-18159223IPR009057Homeodomain-like
PROSITE profilePS5007117.054162222IPR001356Homeobox domain
SMARTSM003891.3E-15164226IPR001356Homeobox domain
CDDcd000863.46E-16166223No hitNo description
PfamPF000469.9E-16166220IPR001356Homeobox domain
PROSITE patternPS000270197220IPR017970Homeobox, conserved site
PfamPF021836.5E-11222256IPR003106Leucine zipper, homeobox-associated
SMARTSM003402.0E-24222265IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 327 aa     Download sequence    Send to blast
MELGLSLGDA PKPFRFVEKQ SQVPSPQRRL SRKDLGFCVD LSIGRSVAGD KEDDEDKPNQ  60
GEDESDGNED PPTQLDLLPH NPVPRNLTNP YQGFPWPSSE NDEGEDRTAS PNSAASSFQM  120
EFGLYGSGGN ISSRRDQMEN GVMNEVGESE RASSRASDED ENGCTRKKLR LSKEQSAFLE  180
ESFKEHNTLN PKQKQALAKQ LNLRPRQVEV WFQNRRARTK LKQTEVDCEY LKRCCETLTE  240
ENRRLQKELQ ELRALKTTNS LYMQLPATTL TMCPSCERVT SSSAASTVAA TEGVTKRSGL  300
AIGGGRPGSS SFPFSAKTQS HQSTAS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1164170TRKKLRL
2214222RRARTKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818041e-172LN681804.1 Cucumis melo genomic scaffold, anchoredscaffold00060.
GenBankLN7132551e-172LN713255.1 Cucumis melo genomic chromosome, chr_1.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004135907.10.0PREDICTED: homeobox-leucine zipper protein HOX11-like
SwissprotP466657e-92HAT14_ARATH; Homeobox-leucine zipper protein HAT14
TrEMBLA0A0A0KBJ30.0A0A0A0KBJ3_CUCSA; Uncharacterized protein
STRINGXP_004135907.10.0(Cucumis sativus)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF23353382
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06710.13e-85homeobox from Arabidopsis thaliana
Publications ? help Back to Top
  1. Ren Y, et al.
    An integrated genetic and cytogenetic map of the cucumber genome.
    PLoS ONE, 2009. 4(6): p. e5795
    [PMID:19495411]
  2. Guo S, et al.
    Transcriptome sequencing and comparative analysis of cucumber flowers with different sex types.
    BMC Genomics, 2010. 11: p. 384
    [PMID:20565788]
  3. Li Z, et al.
    RNA-Seq improves annotation of protein-coding genes in the cucumber genome.
    BMC Genomics, 2011. 12: p. 540
    [PMID:22047402]