PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Mapoly0061s0069.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Marchantiophyta; Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia
Family MYB
Protein Properties Length: 1262aa    MW: 140904 Da    PI: 6.1468
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Mapoly0061s0069.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding346.6e-11434479146
                          TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
      Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                          + +WT+ Ed++l+   ++  + +W++Ia+++g++Rt+ +c  r+q 
  Mapoly0061s0069.1.p 434 KSSWTKTEDKKLLSIAQRNKTTNWEKIAQELGTNRTPAECLTRYQR 479
                          578*****************************************96 PP

2Myb_DNA-binding42.81.2e-13487533148
                          TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
      Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                          r  WT+eEd +l  av++lG  +W+++a+++  gR   qc  rw+k+l
  Mapoly0061s0069.1.p 487 RSVWTPEEDAKLRAAVEELGESDWSLVAACLE-GRNNSQCLMRWYKVL 533
                          678*****************************.************986 PP

3Myb_DNA-binding52.61e-16540586148
                          TSSS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHHHT CS
      Myb_DNA-binding   1 rgrWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqkyl 48 
                          +grW+ eEd++l  av ++G + Wk Ia++++ gRt+ qc++rw ++l
  Mapoly0061s0069.1.p 540 KGRWSVEEDKRLNWAVSLHGARKWKQIANHVP-GRTDIQCRERWCNVL 586
                          79******************************.***********9975 PP

4Myb_DNA-binding37.26.6e-12594635345
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHH CS
      Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwq 45 
                           WT+eEd+ l + v+++G+++W++Ia  +   Rt+kqc  rw 
  Mapoly0061s0069.1.p 594 TWTEEEDQTLRESVALHGPHRWSAIATDLK-VRTDKQCWRRWK 635
                          7****************************9.9*********95 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007173.8E-4335430IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.609.1E-12338354IPR009057Homeodomain-like
CDDcd001670.00113391427No hitNo description
Gene3DG3DSA:1.10.10.609.1E-12401435IPR009057Homeodomain-like
PROSITE profilePS500906.214403428IPR017877Myb-like domain
SuperFamilySSF466891.35E-16411478IPR009057Homeodomain-like
PROSITE profilePS5129415.835429485IPR017930Myb domain
SMARTSM007171.2E-9433483IPR001005SANT/Myb domain
PfamPF002492.3E-10434479IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.601.0E-11436483IPR009057Homeodomain-like
SuperFamilySSF466894.38E-18467537IPR009057Homeodomain-like
SMARTSM007172.1E-13486535IPR001005SANT/Myb domain
PfamPF002493.5E-11487533IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.7E-16487536IPR009057Homeodomain-like
PROSITE profilePS5129411.766487534IPR017930Myb domain
CDDcd001671.70E-11490533No hitNo description
PROSITE profilePS5129426.661535590IPR017930Myb domain
Gene3DG3DSA:1.10.10.606.5E-19539585IPR009057Homeodomain-like
SMARTSM007172.7E-15539588IPR001005SANT/Myb domain
PfamPF002491.6E-14540586IPR001005SANT/Myb domain
CDDcd001672.42E-12542586No hitNo description
SuperFamilySSF466899.71E-24566639IPR009057Homeodomain-like
Gene3DG3DSA:1.10.10.602.8E-16586635IPR009057Homeodomain-like
PROSITE profilePS5129417.109591642IPR017930Myb domain
SMARTSM007178.0E-11591640IPR001005SANT/Myb domain
PfamPF002492.8E-11594635IPR001005SANT/Myb domain
CDDcd001676.14E-11594635No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1262 aa     Download sequence    Send to blast
MVIHAGDDSE EEEDDEDIAM EEGFQGHLDA LKRACLLNIG QDSFLTTGAG SLPDTASTLE  60
EDDEELFESI QKKYFLGSLD KVSIPQGRPS SNSDDEDEEQ DDEEILRSVE KLYSVTELSR  120
QISNEVDENA VFLEVRIDSP VHAGSEAPDE SSSRPEDSDK DVTHPVKDLG RAVSTLLPSN  180
WNEEEDPEFN PPSPNSAVVA YDPYKHHTDE LLCEDYGNYG SEDKEKMIGE ALQKNLSYQQ  240
NLRNILNRIE SKQKHNAEMQ RRVRTLLDFE KYCKRKYAVF FTDQANPSLK LFSVGRGGYQ  300
SRNDDKSDYH PGGPPDNADV EVYKMLKQKA TFASMSRRWR AEELRELAKG VKQQIYEVLV  360
CEAMDAISNG DIIGDNALEN SLSAIRNKEL TPEEMRNAIA DVNWNEVARV YVVNRTPEEC  420
RTRWLNHEDP LINKSSWTKT EDKKLLSIAQ RNKTTNWEKI AQELGTNRTP AECLTRYQRS  480
LNASIMRSVW TPEEDAKLRA AVEELGESDW SLVAACLEGR NNSQCLMRWY KVLHPRKQKK  540
GRWSVEEDKR LNWAVSLHGA RKWKQIANHV PGRTDIQCRE RWCNVLNPDI KLDTWTEEED  600
QTLRESVALH GPHRWSAIAT DLKVRTDKQC WRRWKILFPE EQPDYKRTVF IQRNALIGNF  660
QGRKKERPKL GPQDFVSKAD TFGVPSNIVR ETKSKRRTSS RSAEERTTSR SAKKRTNSRS  720
AKKSSVSVPA DECGRNSSTP TTAASIQRDS AHSSQDQQDS TNHEGQDLTI YGERGTITSE  780
STVIGALEEV SVNGNDTLAD SKLRRLARIH KLKLSRANKL ANSANLVLPR SAQSTRSTPD  840
NQSEYGQNEV TDPSSSAQVQ IEKKKGRKRS RTEGSDGLGN PSKRPRKISS KRTSDVLECD  900
IQSSSQLSEV SPHLSSEETE PSSSGGLLAV TDGAEQGVTN GGTAEVICKS KKRSRVSACP  960
QPRRVSKRRR PDNMCLPSRD SDIQTAAEEP PVVRPEPGPA ILSLVDLVHS INEFCKYTFL  1020
LNEEQHVQAL PDPDVSDTAV RVGVEDITSG SSALGPQLCP DPVQERYLRL VHDHDTPFVD  1080
TESIPANGHG CRTERRMVAS KPTAGGACDA LWCPNSGVNR IAVRSDGEGE TRLITHANTE  1140
SNMGGVQDGP SRLSPEDEEV AAATKAILSW PFFYGVQQLT LRADEYLNSK APRKSSTARP  1200
PKSTKKGKSL GTSETIVETQ PVAVPTPVTA VDNPVSKRPR SARCAARMRG TSRERTSEQD  1260
S*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
1h88_C2e-284036342155MYB PROTO-ONCOGENE PROTEIN
1h89_C2e-284036342155MYB PROTO-ONCOGENE PROTEIN
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1950969KKRSRVSACPQPRRVSKRRR
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
Representative plantOGRP73551617
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G18100.21e-103myb domain protein 4r1