PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gh_A02G0324
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family CPP
Protein Properties Length: 755aa    MW: 82529.3 Da    PI: 5.1932
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gh_A02G0324genomeNAU-NBIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR50.93.1e-16459497240
          TCR   2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNke 40 
                  ++k+CnCkkskClk+YCeCfaag++C e C+C+dC+Nk 
  Gh_A02G0324 459 SCKRCNCKKSKCLKLYCECFAAGVYCIEPCSCQDCFNKP 497
                  689**********************************96 PP

2TCR52.31.1e-16544582139
          TCR   1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39 
                  ++k+gCnCkks+ClkkYCeCf+ g+ Cs +C+Ce+CkN 
  Gh_A02G0324 544 RHKRGCNCKKSNCLKKYCECFQGGVGCSINCRCEGCKNA 582
                  589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011141.5E-18458499IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163437.358459584IPR005172CRC domain
PfamPF036381.0E-11461496IPR005172CRC domain
SMARTSM011142.1E-17544585IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.6E-12546582IPR005172CRC domain
Sequence ? help Back to Top
Protein Sequence    Length: 755 aa     Download sequence    Send to blast
MMDTPEKTQI SSSLSKFEDS PVFNYINSLS PIKPVKSVHV AQTFNPLSFA SLPSIFTSPH  60
VSSHKESRFL KTYTEPLKPE LSSADGTKVS TNEEAGADAQ ENFDQGVSLG ETSFEMPNEP  120
SRIAIGLPQT LKYDCSSPDC DATPSVIKTT CVSDTSLAIV PFVQETSEKG LSDRVEIRDT  180
FQVEQKRDTI GSEWESLISD TSDLLIFNSP NDSEAFRGVI QKSLDPSVLI SQFSQDDINE  240
ACQTTVDLDK YKDQTEGAGD MNEMNPVNES FEVASVTNFI SGSLTDYMET RMSAPYSFKP  300
DSNLHRGFRR RCLDFEMLAA RRKNLDGGST TNSSTDNKLV PGKPDSDSPR CIVPGIGLHL  360
NALAIASRDN KNMKLETLSS GTQKLSFPSL NSPCTGGAET TYESLPSAST ERESDAVENG  420
VQLAEDASQA SAYLVNEEFN QNSPKKKRRR LEQAGEGESC KRCNCKKSKC LKLYCECFAA  480
GVYCIEPCSC QDCFNKPIHE DTVLATRKQI ESRNPLAFAP KVIRTSDSVP EVGDDLITTP  540
ASARHKRGCN CKKSNCLKKY CECFQGGVGC SINCRCEGCK NAFGRKDGSA IMETDGEPGE  600
EETDPSEKNA LDKNFEKPDI LNNEEQNPAS ALPTTPLQLC RPLVQLPFSS KSKPPRSFIA  660
IGSSSALYTG QRYGKPSIIR PQNIIEKHFQ TIAEDETPEI LRGNSSPGTG IKTSSPNSKR  720
ISPPQYELGS TPGGRSGRKL ILQSIPSFPS LTPKH
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
5fd3_A4e-1646358114120Protein lin-54 homolog
5fd3_B4e-1646358114120Protein lin-54 homolog
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1443450PKKKRRRL
2444449KKKRRR
3446450KRRRL
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Ghi.108450.0boll
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_017633780.10.0PREDICTED: protein tesmin/TSO1-like CXC 2 isoform X1
TrEMBLA0A0B0P3S60.0A0A0B0P3S6_GOSAR; Protein lin-54
STRINGGorai.005G044800.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM14672891
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G14770.11e-151TESMIN/TSO1-like CXC 2