PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Mapoly0155s0008.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Marchantiophyta; Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia
Family Trihelix
Protein Properties Length: 662aa    MW: 72803 Da    PI: 8.5849
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Mapoly0155s0008.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix100.61.3e-31170254187
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                          rW++qe+laLi++r+em+ ++r++ lk+plWe+vs+k++e g++rs+k+Ckek+en++k+ykk+k+g+ +r++++s  +++f+qlea
  Mapoly0155s0008.1.p 170 RWPRQETLALIKIRSEMDANFRDSGLKGPLWEDVSRKLAELGYHRSSKKCKEKFENVHKYYKKTKDGRAGRQDGKS--YRFFSQLEA 254
                          8*********************************************************************866665..*******85 PP

2trihelix1062.6e-33476560186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          rW+k ev+aLi++r++me+r++++  k+plWee+s  m+  g++r++k+Ckekwen+nk+++k+ke++kkr +++ +tcpyf+ql+
  Mapoly0155s0008.1.p 476 RWPKPEVHALIRLRSQMETRFQEAGPKGPLWEEISTGMACLGYNRNAKRCKEKWENINKYFRKTKETNKKR-PDNAKTCPYFHQLD 560
                          8*********************************************************************8.89999*******97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007174.8E-4167229IPR001005SANT/Myb domain
PfamPF138373.8E-22169255No hitNo description
CDDcd122039.23E-28169234No hitNo description
PROSITE profilePS500906.876169227IPR017877Myb-like domain
PROSITE profilePS500907.189469533IPR017877Myb-like domain
SMARTSM007170.0034473535IPR001005SANT/Myb domain
CDDcd122032.94E-30475540No hitNo description
PfamPF138372.5E-22475561No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0010192Biological Processmucilage biosynthetic process
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 662 aa     Download sequence    Send to blast
MQAGGQYGVP QEFVVRSAPA HMFTLSHEMT LSPSTGHGHP HHLQHAQHHQ QHHHPQHQQH  60
HHPQHPQHSQ HAQHPQHPQQ QHPTHPHQQP HQQHAQHQQH QQQQQQQQQQ QQQQQQQQQQ  120
QLQQLGLGPD SPEAPSPVAS RPSPTSYKQR ESLGEDDGGE EEGRGTGGNR WPRQETLALI  180
KIRSEMDANF RDSGLKGPLW EDVSRKLAEL GYHRSSKKCK EKFENVHKYY KKTKDGRAGR  240
QDGKSYRFFS QLEALYGGGG GNNHNQQADG GSMMLTGGSG GAIGGGAEGN LSSQRPAENS  300
SGVQLSSDSE DDYDDPGDND EQDKSKKRKR KDGRLGSSKM MYIEGLVKKL MEKQEAMQRK  360
FLDAIERREQ DRLVREEAWK RQEMARMTRE HELRAQEHAL AATRDAALVA FLQKVTGQTL  420
QLPQIAPPPP LTVVVPEAHH GAGVGAGGAG VGAGVGGAGA DDHGGEKESF DPNSKRWPKP  480
EVHALIRLRS QMETRFQEAG PKGPLWEEIS TGMACLGYNR NAKRCKEKWE NINKYFRKTK  540
ETNKKRPDNA KTCPYFHQLD MLYRKGVLGN PSSKLNKLDD NSDELLDHPS QRGEDSGGHG  600
GQGGGGGAGG DASEILAMMP SGEGPVGPGA STSNGGATHF FSSPDNGSSG DRGSKKGLTS  660
R*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1325330KKRKRK
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00243DAPTransfer from AT1G76880Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieveRetrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_024359363.10.0trihelix transcription factor GTL1-like
TrEMBLA0A2R6W4G10.0A0A2R6W4G1_MARPO; Uncharacterized protein
STRINGPP1S85_58V6.10.0(Physcomitrella patens)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.13e-42Trihelix family protein