PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID evm.model.supercontig_37.120
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Caricaceae; Carica
Family Trihelix
Protein Properties Length: 664aa    MW: 72377.3 Da    PI: 5.9536
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
evm.model.supercontig_37.120genomeASGPBView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix65.79.6e-212691887
                      trihelix 18 eerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87
                                  +  +r++  k+plWeevs+k++e g++rs k+Ckek+en++k+yk++keg+ +r++++s  +++f+qlea
  evm.model.supercontig_37.120  2 DAVFRDTTIKGPLWEEVSRKLAELGYNRSGKKCKEKFENVHKYYKRTKEGRAGRQDGKS--YKFFTQLEA 69
                                  566889999********************************************866665..*******85 PP

2trihelix109.42.2e-34407492187
                      trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpy 81 
                                   rW+k evlaLi +r  +e+r++++  k+plWee+s+ m++ g++rs+k+Ckekwen+nk++kk+ke++kkr +e+ +tcpy
  evm.model.supercontig_37.120 407 RWPKAEVLALISIRGGLESRYQEAGPKGPLWEEISAGMQRMGYQRSAKRCKEKWENINKYFKKVKESNKKR-PEDAKTCPY 486
                                   8*********************************************************************8.99999**** PP

                      trihelix  82 fdqlea 87 
                                   f+ql+a
  evm.model.supercontig_37.120 487 FHQLDA 492
                                   ****85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd122039.19E-19149No hitNo description
PfamPF138375.7E-15870No hitNo description
PROSITE profilePS500905.1921542IPR017877Myb-like domain
PROSITE profilePS500906.516400464IPR017877Myb-like domain
PfamPF138374.1E-24406493No hitNo description
CDDcd122031.48E-31406471No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 664 aa     Download sequence    Send to blast
MDAVFRDTTI KGPLWEEVSR KLAELGYNRS GKKCKEKFEN VHKYYKRTKE GRAGRQDGKS  60
YKFFTQLEAL HGSSSTNLSA SPSMATAGAA TAHVPATLDV SPVSVGIPGP MPISSVRNIN  120
PPPQPFSLSN LPASSSAALN IHVFQPATVS GSAPPPPGAA AVPAHFGVSF SSNSSSSSPG  180
SDDDDDGDED DRFFAGEPSV GAAATSRKRK RSSQSSKRGS GDDDRMMEFF EGLMKQVMQK  240
QEAMQQSFLE AIEKREQDRM VREEAWKRQE MARLAREQEL MAQERAISAA RDAAIISFLQ  300
KITGQTIELP PQVPVPSAPP PQSAPSTLAT THIPVQAKPA NPPASLLQQP HHMRQTEKTV  360
TETVADKGQH SIASQALVTI PEQQRPVPRE QEDQIGASGS SEPPSSRWPK AEVLALISIR  420
GGLESRYQEA GPKGPLWEEI SAGMQRMGYQ RSAKRCKEKW ENINKYFKKV KESNKKRPED  480
AKTCPYFHQL DALYRQKTLG GSSTAGGGTS GGTSSISFTK QQQPETVPAP TIQLQVDRTD  540
VAAPESENKN EGSNAGAGEL QARSEGFTGN LFGEDREAGA TKKPEDIVNE LMEQQQRPLT  600
VDEYEKMEET DSDQNVEEEE GDEETEDEGK MSYKIEFQPQ NSGAPNGHGF HSQRLSGVHH  660
FIS*
Cis-element ? help Back to Top
SourceLink
PlantRegMapevm.model.supercontig_37.120
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankKU9477022e-52KU947702.1 Toxicodendron vernicifluum microsatellite c24641 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021900326.10.0trihelix transcription factor GTL1
TrEMBLA0A061G9R20.0A0A061G9R2_THECC; Duplicated homeodomain-like superfamily protein, putative isoform 1
STRINGevm.model.supercontig_37.1200.0(Carica papaya)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM62262746
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.23e-51Trihelix family protein