PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.supercontig_2611.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Caricaceae; Carica
Family Trihelix
Protein Properties Length: 481aa    MW: 55761.3 Da    PI: 5.6383
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.supercontig_2611.1genomeASGPBView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.77.7e-2940124187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+++e+laL+++r++m+ ++r++  k+plWeevs+k++e g++rs+k+Ckek+en+ k+++++ke +++r   +s+t+++
  evm.model.supercontig_2611.1  40 RWPREETLALLKIRSDMDAAFRDSGHKAPLWEEVSRKLSELGYNRSAKKCKEKFENIFKYHRRTKECRSGR--SNSKTYRF 118
                                   8********************************************************************97..56678*** PP

                      trihelix  82 fdqlea 87 
                                   f+qlea
  evm.model.supercontig_2611.1 119 FEQLEA 124
                                   ****85 PP

2trihelix102.92.4e-32316401187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+k+e+ aLi++r ++++++ +++ k+plWee+s++m++ g+ r++k+Ckekwen+nk++kk+ke++kkr se+s+tcpy
  evm.model.supercontig_2611.1 316 RWPKEEIEALIKLRANLDTQYDESAPKGPLWEEISAAMKKLGYDRNAKRCKEKWENMNKYFKKVKESNKKR-SEDSKTCPY 395
                                   8*********************************************************************8.89999**** PP

                      trihelix  82 fdqlea 87 
                                   f+ql+a
  evm.model.supercontig_2611.1 396 FQQLDA 401
                                   ****85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007179.5E-43799IPR001005SANT/Myb domain
CDDcd122036.78E-2439104No hitNo description
PROSITE profilePS500907.4573997IPR017877Myb-like domain
PfamPF138377.4E-2039125No hitNo description
Gene3DG3DSA:1.10.10.606.2E-6313373IPR009057Homeodomain-like
SMARTSM007173.8E-5313375IPR001005SANT/Myb domain
SuperFamilySSF466894.44E-5314375IPR009057Homeodomain-like
CDDcd122035.82E-25315380No hitNo description
PfamPF138372.8E-22315402No hitNo description
PROSITE profilePS500907.735315373IPR017877Myb-like domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 481 aa     Download sequence    Send to blast
MENSTSVTDN RDTEEAADRL GQEEGRVKVE ESDRYFPGNR WPREETLALL KIRSDMDAAF  60
RDSGHKAPLW EEVSRKLSEL GYNRSAKKCK EKFENIFKYH RRTKECRSGR SNSKTYRFFE  120
QLEALDNHNS LYMPLPLDSV QTSMPPPINT VPTNISHVNF AHDPIPCSIK NPTTNYADTS  180
PSTSFSSKES DGMHKKKRKL AEFFEGLMRK VMEKQENLQK KFVEAIEKFE QDRIAREEAW  240
KMQELARIKR ERELLVQERS VVAAKDAAVL AFLQKFSEQT NSVQMPQIPV AVEKVVERQE  300
DGDGGENFIH QMSSSRWPKE EIEALIKLRA NLDTQYDESA PKGPLWEEIS AAMKKLGYDR  360
NAKRCKEKWE NMNKYFKKVK ESNKKRSEDS KTCPYFQQLD ALYKEKTRKP DSSVSSGYNM  420
KPEELLMHMM DSQEEQRPDL HVEDGESESA DQSEEHRDNE DEGNNYQIGA DNSFSMEIIE  480
*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1194199KKKRKL
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapevm.model.supercontig_2611.1
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021887620.10.0trihelix transcription factor GT-2-like isoform X1
SwissprotQ391171e-118TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A1R3GCN90.0A0A1R3GCN9_COCAP; Uncharacterized protein
STRINGevm.model.supercontig_2611.10.0(Carica papaya)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM59952847
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.11e-131Trihelix family protein