PlantTFDB
Plant Transcription Factor Database
PlantRegMap/PlantTFDB v5.0
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Solyc01g073910.2.1
Common NameLOC101250729
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family HD-ZIP
Protein Properties Length: 366aa    MW: 39963.4 Da    PI: 7.7691
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Solyc01g073910.2.1genomeITAGView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox55.68.8e-18192246256
                         T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
            Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                         rk+ +++keq   Lee F+++++++ +++  LAk+l+L  rqV vWFqNrRa+ k
  Solyc01g073910.2.1 192 RKKLRLSKEQSAYLEESFKEHHTLNPKQKLALAKQLSLRPRQVEVWFQNRRARTK 246
                         778899***********************************************98 PP

2HD-ZIP_I/II126.89.5e-41192282192
         HD-ZIP_I/II   1 ekkrrlskeqvklLEesFeeeekLeperKvelareLglqprqvavWFqnrRARtktkqlEkdyeaLkraydalkeenerLekeveeLreel 91 
                         +kk+rlskeq++ LEesF+e+++L+p++K +la++L l+prqv+vWFqnrRARtk+kq+E+d+e+Lkr++++l++en+rL+ke +eLr +l
  Solyc01g073910.2.1 192 RKKLRLSKEQSAYLEESFKEHHTLNPKQKLALAKQLSLRPRQVEVWFQNRRARTKLKQTEVDCEYLKRCCETLTDENRRLHKELQELR-AL 281
                         69*************************************************************************************9.55 PP

         HD-ZIP_I/II  92 k 92 
                         k
  Solyc01g073910.2.1 282 K 282
                         5 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.608.3E-18166249IPR009057Homeodomain-like
SuperFamilySSF466891.33E-18179249IPR009057Homeodomain-like
PROSITE profilePS5007117.184188248IPR001356Homeobox domain
SMARTSM003891.9E-15190252IPR001356Homeobox domain
PfamPF000464.1E-15192246IPR001356Homeobox domain
CDDcd000861.83E-15192249No hitNo description
PROSITE patternPS000270223246IPR017970Homeobox, conserved site
PfamPF021835.9E-11248282IPR003106Leucine zipper, homeobox-associated
SMARTSM003406.3E-26248291IPR003106Leucine zipper, homeobox-associated
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 366 aa     Download sequence    Send to blast
MELGLSLGDA NTSSSPKSFF LSKKTVISTC NDHEKKKNLG FCMALGINSS SERVQQDIED  60
EENNTSEEGT NNTLPVQLDL LPLVPLPNPP TNLPQSHHWS SDNGSSENGS SGNGGLPAAR  120
GFDVNRLPAA GMDEVSSPNS VASSFRMDFG LFKSCVNIVG VGNKRNSESA GGEPERASSR  180
ASDDDENGAN TRKKLRLSKE QSAYLEESFK EHHTLNPKQK LALAKQLSLR PRQVEVWFQN  240
RRARTKLKQT EVDCEYLKRC CETLTDENRR LHKELQELRA LKTSNPFYMQ LPATTLTMCP  300
SCERVASTTT TTSAATAPPI TATTATTTTS DSIPKAIPFL NSRPRFFPFT ATNNNPNHSH  360
QSAAS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1190196TRKKLRL
2240248RRARTKLKQ
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapSolyc01g073910.2.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_004229348.10.0homeobox-leucine zipper protein HOX11-like
SwissprotP466659e-91HAT14_ARATH; Homeobox-leucine zipper protein HAT14
TrEMBLA0A3Q7F0M40.0A0A3Q7F0M4_SOLLC; Uncharacterized protein
STRINGSolyc01g073910.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA29024186
Representative plantOGRP19616156
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G06710.12e-75homeobox from Arabidopsis thaliana
Publications ? help Back to Top
  1. Wang Y,van der Hoeven RS,Nielsen R,Mueller LA,Tanksley SD
    Characteristics of the tomato nuclear genome as determined by sequencing undermethylated EcoRI digested fragments.
    Theor. Appl. Genet., 2005. 112(1): p. 72-84
    [PMID:16208505]