PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sme2.5_00014.1_g00019.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum
Family bHLH
Protein Properties Length: 477aa    MW: 53223.4 Da    PI: 7.3076
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sme2.5_00014.1_g00019.1genomeEGDView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH407e-13331377455
                              HHHHHHHHHHHHHHHHHHHHHCTSCC.C...TTS-STCHHHHHHHHHHHHHHH CS
                      HLH   4 ahnerErrRRdriNsafeeLrellPk.askapskKlsKaeiLekAveYIksLq 55 
                              +h e+Er+RR+++N++f  Lr ++P+ +      K++Ka+ L  A+ YI +Lq
  Sme2.5_00014.1_g00019.1 331 NHVEAERQRREKLNQRFYALRAVVPNiS------KMDKASLLGDAIAYITDLQ 377
                              799***********************66......*****************99 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF142153.9E-4449227IPR025610Transcription factor MYC/MYB N-terminal
PROSITE profilePS5088816.66327376IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474594.19E-18327393IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000831.18E-13330381No hitNo description
PfamPF000102.1E-10331377IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:4.10.280.102.6E-17331395IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003531.7E-15333382IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 477 aa     Download sequence    Send to blast
MSEKLFLKGE DKVIMEEVLG SEAVEFFSWS ASNHMLTEFT SSRGDLGVQE ALCKIVEGSD  60
WTYAIYWQVA KSKSGKSALI WGDGHCREAK LGQSERGDDS RHQKMMDGNK KKMVLQKIHT  120
CFGGSEDDNI AAKLESVSDV EVFYLTSMYY IFPFDKASTP SQSFNSARSI WVSDLKGCVE  180
HFQSRSYLAK LARFETLVFV PLKSGVVELG SVKSIPEDQN LIQMVKTSVV VSNPPQPKTI  240
PKIFGRELSL GGAKSGPISI NFSPKVEDEL TFASDAYEVQ AALGTSQVYG SSSNGYRSDE  300
GEGKLYKEEL DERKPRKRGR KPANGREEAL NHVEAERQRR EKLNQRFYAL RAVVPNISKM  360
DKASLLGDAI AYITDLQARI RVLDAEKEMV GDKQKQQVIP EIDFHQRQDD AVVRVSCPLN  420
VHPVSRVLKT FQEHQVVAQE SNVSLTENSK LVHTFSIRAP GPAAEDLKEK LTAALSK
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4yz6_A2e-14482269189Transcription factor MYC3
4rs9_A2e-14482269189Transcription factor MYC3
4rqw_B2e-14482269189Transcription factor MYC3
4rqw_A2e-14482269189Transcription factor MYC3
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1312320RKPRKRGRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00099PBMTransfer from AT4G16430Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9755180.0HG975518.1 Solanum lycopersicum chromosome ch06, complete genome.
GenBankHG9754450.0HG975445.1 Solanum pennellii chromosome ch06, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006352746.10.0PREDICTED: transcription factor bHLH3
SwissprotO234871e-161BH003_ARATH; Transcription factor bHLH3
TrEMBLM1BS130.0M1BS13_SOLTU; Uncharacterized protein
STRINGPGSC0003DMT4000516710.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA66692433
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G16430.11e-163bHLH family protein