PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID PK22848.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Rosales; Cannabaceae; Cannabis
Family CPP
Protein Properties Length: 791aa    MW: 86400.4 Da    PI: 6.2379
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
PK22848.1genomeCCBRView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR49.96.3e-16497534340
        TCR   3 kkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                +k+CnCkkskClk+YCeCfaag++C e C+C++C+Nk 
  PK22848.1 497 CKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQECFNKP 534
                89**********************************96 PP

2TCR50.54e-16581619139
        TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                ++k+gCnCkks+ClkkYCeC++ g+ Cs +C+Ce+CkN 
  PK22848.1 581 RHKRGCNCKKSSCLKKYCECYQGGVGCSISCRCEGCKNA 619
                589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011144.0E-17495536IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163436.776496621IPR005172CRC domain
PfamPF036381.6E-11498533IPR005172CRC domain
SMARTSM011144.4E-17581622IPR033467Tesmin/TSO1-like CXC domain
PfamPF036381.3E-11583619IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 791 aa     Download sequence    Send to blast
MDTPEKNKIG TPKSKFEDSP VFNYISSLSP IKPVKSLHIT QTFSSLSFGS PPSVFTSPHV  60
SSNKESRFLR RYNSSDQAKP EVSSENGVDV PASDDAVIET VQVYNNSVEL RENCKSDVNV  120
EEASAEPQNE GSAFVIELPR VLKYDCGSPD CVSTPHDVVA DFESNSSDPS ANLVPHVQVA  180
PEIGLSDNEA QIQEVCLSEP KKQGAIGDWE CLMSDAVLIF DSPNTSETFK GLIHDSLEPV  240
GRCSNSLGTE FQQNEINNEH ETQIVDPGCS EQHNGEEPLS QTGDVSHLED MEQTHDRFNS  300
NSGMATNPSK RKDKEAETRM AFTCKSVFSL HRGMRRRCLD FDVSVARRKN LGDGSNSSSV  360
LVQPDEETPA NERRQGFIRH CGEAPARRIL PGIGLHLNAI ATTSKDCKIT KNVSSGIHIR  420
LPGSTVSIHS PMDDQEGLDK ALIPVSSEMD INTIENGVQL LEDPSQAPGP ITNEEFNQNS  480
PKKKRRRSEN AGETEGCKRC NCKKSKCLKL YCECFAAGVY CIEPCSCQEC FNKPIHEDTV  540
LATRKQIESR NPLAFAPKVI RGSDPAPDYG DESSKTPASA RHKRGCNCKK SSCLKKYCEC  600
YQGGVGCSIS CRCEGCKNAF GRKDGSLMTG SEEQDEEEAE ACEKSLVAPV QKMEIHNNED  660
PNPGSSHPIT PLRISRSLVQ LPFSSKGKPP RSSLVTGTSS SSGLYSQMHG KPSILRSSQP  720
KLEKHVQTIP DEEMPEILVG DGSPNSAVKT SSPNSKRVSS PHCHFGSPGR RPGRKLILHS  780
IPSFPSLTPQ H
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A2e-1850061814120Protein lin-54 homolog
5fd3_B2e-1850061814120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1481486KKKRRR
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBM25215497Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010092418.10.0protein tesmin/TSO1-like CXC 2 isoform X1
TrEMBLA0A2P5E3850.0A0A2P5E385_PARAD; Lin-54-like protein
STRINGXP_010092418.10.0(Morus notabilis)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF26193375
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-163TESMIN/TSO1-like CXC 2