PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_D02G0389
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 755aa    MW: 82566.3 Da    PI: 5.1309
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_D02G0389genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.93.1e-16459497240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Gh_D02G0389 459 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 497
                  689**********************************96 PP

2TCR52.31.1e-16544582139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCkks+ClkkYCeCf+ g+ Cs +C+Ce+CkN 
  Gh_D02G0389 544 RHKRGCNCKKSNCLKKYCECFQGGVGCSINCRCEGCKNA 582
                  589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.5E-18458499IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.365459584IPR005172CRC domain
PfamPF036381.0E-11461496IPR005172CRC domain
SMARTSM011142.1E-17544585IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.6E-12546582IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 755 aa     Download sequence    Send to blast
MMDTPEKTQI SSSLSKFEDS PVFNYINSLS PIKPVKSIHV TQTFNPLSFA SLPSIFTSPH  60
VSSHKESRFL KSYTDPLKPE SSSADGTKVS TNEEAGADAQ ENFDQGVSLG ETSFEMPNEP  120
SRIAIGLPQT LKYDCGSPDC DATPCVFKTT CVSDTSLAIV PFFQEASEKG LSDGVEIRDT  180
FQVEQKRDTI GSEWESLISD TSDLLIFNSP NDSEAFRGVI QKSLDPGVLI SQFSQDDINE  240
ACQTTVDLDK YKDQTEGAVE MNEMNPVNES FEDASVTNFI SGSLTDYMET RMSGPYSFKP  300
DSNLHRGFRR RCLDFEMLAA RRKNFDGGST TNSSTDNKLV PGKPDSDSPR CIVPGIGLHL  360
NALAIASRDN KNMKLETLSS GTQKLSFPSF NSPRIGGAET AYDSLPSAST ERESDAVENG  420
VQLAEDASQA SAYLVNEELN QNSPKKKRRR LEQAGEGESC KRCNCKKSKC LKLYCECFAA  480
GVYCIEPCSC QDCFNKPIHE DTVLATRKQI ESRNPLAFAP KVIRTSDSVP EVRDDLITTP  540
SSARHKRGCN CKKSNCLKKY CECFQGGVGC SINCRCEGCK NAFGRKDGSA IVETDGEPGE  600
EEMDPSEKNA LDKNFEKPDI LNNEEQNPAS ALPTTPLQLC RPLVQLPFSS KSKPPRSFIA  660
IGSSSALYTG QRYGKPSIIR PQNIIEKHFQ TIAEDETPEI LRGNSSPGTG IKTSSPNSKR  720
ISPPQCELGS TPGGQSGRKL ILQSIPSFPS LTPKH
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A1e-1646358114120Protein lin-54 homolog
5fd3_B1e-1646358114120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1443450PKKKRRRL
2444449KKKRRR
3446450KRRRL
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.108450.0boll
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016714367.10.0PREDICTED: protein tesmin/TSO1-like CXC 2
TrEMBLA0A1U8LIE40.0A0A1U8LIE4_GOSHI; protein tesmin/TSO1-like CXC 2
STRINGGorai.005G044800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14672891
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-150TESMIN/TSO1-like CXC 2