PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Bathy05g04420
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; prasinophytes; Mamiellophyceae; Mamiellales; Bathycoccaceae; Bathycoccus
Family WOX
Protein Properties Length: 1524aa    MW: 173142 Da    PI: 4.5385
Description WOX family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Bathy05g04420genomeORCAEView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding25.43.2e-089911040248
                       SSS-HHHHHHHHHHHHHTTTT-HHHHHHHHT....TTS-HHHHHHHHHHHT CS
  Myb_DNA-binding    2 grWTteEdellvdavkqlGggtWktIartmg....kgRtlkqcksrwqkyl 48  
                       ++W  +E + l+++v + Ggg W+ I + +g    ++Rt+ ++k++w+++l
    Bathy05g04420  991 NPWGLDETQALIEGVSRCGGGKWADIKK-LGfpeiEHRTAVDLKDKWRTLL 1040
                       68***********************966.5556999************986 PP

2Homeobox43.74.5e-1413991459257
                     T--SS--HHHHHHHHHHHHH.SSS--HHHHHHHHHHC....TS-HHHHHHHHHHHHHHHHC CS
       Homeobox    2 rkRttftkeqleeLeelFek.nrypsaeereeLAkkl....gLterqVkvWFqNrRakekk 57  
                     + R++ t+ qle+Le+lF++ + +p  e+ ++ +++l     ++e++V++WFqN++ + kk
  Bathy05g04420 1399 KVRWQRTTAQLERLEQLFANdTTTPRGEKLKQVTEELsalgPIQECNVFNWFQNKKSRLKK 1459
                     67*****************99*****************9999****************997 PP

3Wus_type_Homeobox60.83e-2013991459363
  Wus_type_Homeobox    3 rtRWtPtpeQikiLeelyksGlrtPnkeeiqritaeLeeyGkiedkNVfyWFQNrkaRerq 63  
                         + RW  t+ Q++ Le+l+ + + tP  e+++++t+eL++ G i++ NVf+WFQN+k+R ++
      Bathy05g04420 1399 KVRWQRTTAQLERLEQLFANDTTTPRGEKLKQVTEELSALGPIQECNVFNWFQNKKSRLKK 1459
                         67********************************************************875 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF562818.39E-1710177IPR001279Metallo-beta-lactamase
Gene3DG3DSA:3.60.15.101.4E-1613178IPR001279Metallo-beta-lactamase
SMARTSM008492.7E-413220IPR001279Metallo-beta-lactamase
PfamPF075222.2E-12354459IPR011084DNA repair metallo-beta-lactamase
PROSITE profilePS5129412.7769851044IPR017930Myb domain
Gene3DG3DSA:1.10.10.602.7E-119871042IPR009057Homeodomain-like
SMARTSM007171.5E-69891042IPR001005SANT/Myb domain
PfamPF002494.7E-69911040IPR001005SANT/Myb domain
SuperFamilySSF466897.1E-149911049IPR009057Homeodomain-like
CDDcd116601.02E-179921041No hitNo description
SuperFamilySSF1097153.27E-511841243No hitNo description
PfamPF087662.7E-711911243IPR014876DEK, C-terminal
PROSITE profilePS5007111.35313951460IPR001356Homeobox domain
SMARTSM003896.1E-513971464IPR001356Homeobox domain
SuperFamilySSF466894.71E-1013971463IPR009057Homeodomain-like
CDDcd000869.10E-413991461No hitNo description
PfamPF000461.0E-1113991459IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.609.6E-713991464IPR009057Homeodomain-like
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1524 aa     Download sequence    Send to blast
MNDRGGGFGG HNTTTSHALG SGGGALLSEC TIRVENRFEI RVDTWGKKNA FSTNNNDNIL  60
HQIPTFYFLT HMHQDHLRGL REDTFENDNN GRIYCTEITN ILLVKRFPRL ESKVKVLEFD  120
SVEVVEVVSN KKNTKEEDLR FNVYCLDAGH CPGSAMFVFE GTFGKVLHTG DFRREDWSGS  180
LPSGKRMSLP TPGSNGSGSN NGLRKNFFSM LNDVPLSAMW SSSSPSLTTG SQDLGYDNNN  240
NTTLKNATPL PKVLDNSKTQ AVDVLYLDNT YNNPEYDHPP RAVALERIVK LVTEIEPERP  300
VILLLDSLGK EDIVIALSQA TKSKVYLHKD RYNDWLKLGF EKEYVCNSLG ENESTRVRVL  360
PKAMGRHKEN VCGPLVAGLK NFKEWGPLVI SPTGWARVTE QMREEDEKQE MDTERREKMT  420
NEEKTDWVRR SVPYSLHSSY SELETFVKRI QPRKIIGNTR DDTVKEAERK GLSEVRSVHV  480
RSLKTFLRPS SQSEENLVLK AVFETERRRE QATATTTKMN TTKLSVNSPR DDISDVRLLS  540
ASSPRGVKGD VKETLHKMRI VSPSKPATKY ALANKKLPLR LQRFQQKERE NDATKPQERE  600
EEEEEEEEEE EEEEEEEEEE EKEEETPEEE TPEEISVLPL PPNRRRLPLP KSLTTTRDVI  660
LHSSEEDLYI KSNLKSQPSV RGPSKEQPVN RIPLVNNQKS DGFDVLNAYL EQRMRQKEEE  720
RLERERQQEL GDVNADADIE DEDDEEEEEV EEKEEEEEEE EEEEEEEEDA DDDIVPTQSA  780
DGAIGHGNEE KEVTATLASP SKWFSSIIGP VKRLFSRNGE KKTPLEEKLP EPRVKEEEKE  840
EEKEEEKEED DDECTTMIDD IDSDEDSEIV DESEDDGSDL EDFIVPDDEV DGMVIPPPNH  900
ATIDKEWNDW EPGSPGSMRF KEIVNVIETH AKIQADITAK VTDMDDNDGE AEHGGGNDAI  960
AGTNTKSTGT TAACGSEPNP QGGPRRSKHH NPWGLDETQA LIEGVSRCGG GKWADIKKLG  1020
FPEIEHRTAV DLKDKWRTLL RTATLPTPSG REAAGKSGGD KKREIPRAML DRVRELAMLH  1080
AKLKARDAGH TLKDTVDDGN KETYTSQLEA TFPALIDGSY LSVEKPKEDG ADVGEEREDA  1140
AAAAAAAEEE LDNDNNKNNN NNSGDASEDD DDDDMPIRGS HHRARKSGSP PTEEELKEAV  1200
FEIIVRVKEG GGKRISTKEV RKELEKKFGC SLKAYKEKIQ DCIYFKVIDR DTTIVRKKRN  1260
NSRKEGSRLK RKRELKTEQD VKDDVRETEG VAEEEVKEEE REVKKRPQRK KKVKKLRKSN  1320
PLRYADSDAS SSDDDDEDDE EEEEEEEEEY NDGDNSDDDN ENEDEMDDKE SKSVEKPHKE  1380
RKTPASIFSN TGVYSVPGKV RWQRTTAQLE RLEQLFANDT TTPRGEKLKQ VTEELSALGP  1440
IQECNVFNWF QNKKSRLKKL EEDAAREKME AAANKRSRDS DDMENGTVVE PEIEKEEQDS  1500
EIDNDLRLLW EKMKVEKRKR PRT*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5aho_A1e-1840456122965' EXONUCLEASE APOLLO
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112561272KKRNNSRKEGSRLKRKR
213071314QRKKKVKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankFO0822740.0FO082274.1 Bathycoccus prasinos genomic : chromosome_5.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_007513035.10.0predicted protein
TrEMBLK8F4Y00.0K8F4Y0_9CHLO; Uncharacterized protein
STRINGXP_007513035.10.0(Bathycoccus prasinos)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP1276916
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G72650.18e-17TRF-like 6