PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla018100
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family Trihelix
Protein Properties Length: 271aa    MW: 32869.4 Da    PI: 9.3893
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla018100genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix73.43.8e-2332113286
   trihelix   2 WtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                W+  e+++L+++r  +++++++ k++  lW +v++km+++gf+r+ +qCk+kw+nl +ryk +++++ k    + +++p++d+l+
  Cla018100  32 WSVPETKELLAIRAALDRSFSEMKQNRMLWISVADKMKAKGFNRNDEQCKCKWKNLVTRYKGCETMDPKA--LK-QQFPFYDDLH 113
                99******************************************************************95..44.479*****98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138374.1E-1931115No hitNo description
Gene3DG3DSA:1.10.10.604.5E-43187IPR009057Homeodomain-like
PROSITE profilePS500907.2823188IPR017877Myb-like domain
CDDcd122032.22E-163295No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 271 aa     Download sequence    Send to blast
MEDYCDLRRA ILDRRPNRIG VNAEAGDRFP PWSVPETKEL LAIRAALDRS FSEMKQNRML  60
WISVADKMKA KGFNRNDEQC KCKWKNLVTR YKGCETMDPK ALKQQFPFYD DLHAIFTARM  120
QNNWWIEAED RSGGSKWKTT ERFSSEDQDG NKEENDEDEA FGSSTKRKKG TKKGRWIRDQ  180
LEHKRKLKEI LKGFVEREME MERQWREAFR VREEERRLKE EEWRMKMEAI EREKMMMEIL  240
WREKEEKRRE REEERAEKRD ALISALLRSL T
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1166187RKKGTKKGRWIRDQLEHKRKLK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN7132623e-70LN713262.1 Cucumis melo genomic chromosome, chr_8.
GenBankLN6818843e-70LN681884.1 Cucumis melo genomic scaffold, anchoredscaffold00086.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_011653700.11e-111PREDICTED: trihelix transcription factor GT-3b isoform X2
TrEMBLA0A0A0KZY41e-111A0A0A0KZY4_CUCSA; Uncharacterized protein
STRINGPOPTR_0006s10240.12e-72(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF17813385
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G38250.11e-42Trihelix family protein