PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA01g01880
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family AP2
Protein Properties Length: 1191aa    MW: 133136 Da    PI: 7.1479
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA01g01880genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP237.27.3e-12127174556
         AP2   5 GVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                 G+r+ k +gr+ A Ird      +rk+++lg+f+t eeA +a+ + +++le+e
  CA01g01880 127 GIRRQK-TGRYGAVIRDtI----RRKQVWLGTFDTVEEASQAYFNKKLELENE 174
                 999877.9*********33....35*************************986 PP

2AP236.51.2e-11279326556
         AP2   5 GVrwdkkrgrWvAeIrd.psengkrkrfslgkfgtaeeAakaaiaarkklege 56 
                 GVr+ k +gr+ A Ird      +rk+++lg+f+t eeA  a+   +++le+e
  CA01g01880 279 GVRRQK-NGRYGAVIRDtI----RRKQVWLGTFDTVEEASLAYFSKKLELENE 326
                 9**888.8*********33....35************************9986 PP

3AP229.91.4e-09561606554
         AP2   5 GVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkle 54 
                 G+r+ k +g++ A I d    + ++k+++lg+f+t eeA +a+   + ++e
  CA01g01880 561 GIRRQK-TGKYGAVITD----KiRHKQIWLGTFDTVEEASQAYFSKKFEFE 606
                 899777.9*********....4345******************88887776 PP

4AP234.26.5e-11710757455
         AP2   4 kGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkleg 55 
                  GVr+ k +gr+ A +rd    + +rk+++lg+f+t eeA +a+   + +le+
  CA01g01880 710 VGVRRQK-NGRYGAVVRD----KiRRKQVWLGTFDTVEEASQAYFSKKSELEK 757
                 59**888.8*********....5456******************998888876 PP

5AP231.54.5e-10867913454
         AP2   4 kGVrwdkkrgrWvAeIrdpseng.krkrfslgkfgtaeeAakaaiaarkkle 54 
                 +G+r+ k +gr+ A I d      k+k+++lg+f+t eeA +a+   + +l+
  CA01g01880 867 RGIRRQK-SGRYGAVITD----RiKHKKVWLGTFDTVEEASQAYLSKKSELK 913
                 9***888.7*********....4446******************88877765 PP

6AP225.53.3e-089761013646
         AP2    6 VrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaa 46  
                  V+ +k +g++  eIr p    ++ kr++lg+f taeeA + +
  CA01g01880  976 VHKRKGSGKYTTEIRNP----ISkKRIWLGTFNTAEEASRVY 1013
                  888999999*******7....246**************9977 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5103212.339124183IPR001471AP2/ERF domain
PfamPF008477.3E-6126174IPR001471AP2/ERF domain
SuperFamilySSF541711.18E-9126173IPR016177DNA-binding domain
CDDcd000186.35E-11127164No hitNo description
Gene3DG3DSA:3.30.730.107.2E-11127173IPR001471AP2/ERF domain
SMARTSM003802.7E-5127185IPR001471AP2/ERF domain
PROSITE profilePS5103212.55276335IPR001471AP2/ERF domain
SuperFamilySSF541711.37E-10278326IPR016177DNA-binding domain
PfamPF008472.2E-6278326IPR001471AP2/ERF domain
CDDcd000182.23E-12279316No hitNo description
Gene3DG3DSA:3.30.730.103.5E-11279324IPR001471AP2/ERF domain
SMARTSM003808.2E-6279337IPR001471AP2/ERF domain
PROSITE profilePS5103212.181558618IPR001471AP2/ERF domain
SMARTSM003801.4E-4558621IPR001471AP2/ERF domain
SuperFamilySSF541713.4E-9560606IPR016177DNA-binding domain
CDDcd000185.35E-11561598No hitNo description
Gene3DG3DSA:3.30.730.101.4E-9561604IPR001471AP2/ERF domain
PfamPF008471.4E-4561607IPR001471AP2/ERF domain
SMARTSM003809.2E-5708776IPR001471AP2/ERF domain
PROSITE profilePS5103212.115708767IPR001471AP2/ERF domain
CDDcd000185.43E-12710748No hitNo description
PfamPF008472.9E-6710757IPR001471AP2/ERF domain
SuperFamilySSF541712.55E-9710754IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.107.1E-11711755IPR001471AP2/ERF domain
PROSITE profilePS5103212.536865922IPR001471AP2/ERF domain
SMARTSM003802.6E-4865911IPR001471AP2/ERF domain
SuperFamilySSF541713.66E-9866912IPR016177DNA-binding domain
Gene3DG3DSA:3.30.730.101.7E-10867906IPR001471AP2/ERF domain
CDDcd000181.90E-12867905No hitNo description
PfamPF008477.5E-6867913IPR001471AP2/ERF domain
CDDcd000181.46E-109711013No hitNo description
SMARTSM003801.3E-49721027IPR001471AP2/ERF domain
PROSITE profilePS5103214.1979721034IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.102.7E-129731021IPR001471AP2/ERF domain
SuperFamilySSF541711.18E-89761021IPR016177DNA-binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1191 aa     Download sequence    Send to blast
METDEQGLLA DENEEETSCK KQKITQIVVD KVKFTIPLEH TNHKAQTTSV ATERSKETDS  60
KMALFGNEWQ NMGTFLGSEE FKGFSSNMHK FQGKETSCIM SNVHGIESFG ECNTSISSGK  120
GKISLIGIRR QKTGRYGAVI RDTIRRKQVW LGTFDTVEEA SQAYFNKKLE LENEKLNQQG  180
NKEDRPEENC DQIQQPESPV VQCLSMANDQ TSDTACVNRI NSHETTRIVE VHKNKMSGEE  240
PGSSKETTCG MASVRGTESS VECNTSTSCN SKGEISLIGV RRQKNGRYGA VIRDTIRRKQ  300
VWLGTFDTVE EASLAYFSKK LELENENLNQ QGNKESKSKE NIDQIQPPES PSVQCLSVGN  360
DQIQPPESPS VQCLSVANDP IQQPESPSLQ CLSVADDQIQ QPESPGAQCL SVANDQIQQP  420
ESHRVRCLSV ANDQIQQPES PGMQCLSVGN DQIQPPESSS VQCLSVANNQ TLDTASVSRI  480
NSHITTTHIL GVHKKNWSGK EQEFSKVTSC LMANVQGTES SNECNTTTSC LPTEKRSLLG  540
IRRQKNGRYG AVITDKRSLL GIRRQKTGKY GAVITDKIRH KQIWLGTFDT VEEASQAYFS  600
KKFEFEKLSQ QDNKDNKPKE NLDQIQQPES PVMQCLSMAI DPTLDTAGVN RINPHETTHI  660
VEVHKNNMSG KEPGSSKETP CLMASVHGTE SSAECNTSTS CNPTGKISLV GVRRQKNGRY  720
GAVVRDKIRR KQVWLGTFDT VEEASQAYFS KKSELEKKKL NQQRNKDNRS KKNGDRIQQP  780
GSPVVLASLS VTDVQAFDTA SVGMRNERID FHKTTHIVGV HKSKTAGKEP ESSKETSCLM  840
DNVHDTESYD EGNTTTSRDP TAKRSLRGIR RQKSGRYGAV ITDRIKHKKV WLGTFDTVEE  900
ASQAYLSKKS ELKKLERQSD KEDKPKKNCD QVQQPESHVV ASFPVANHDQ TLNAARVDRR  960
YKRFDPHETE TRYFRVHKRK GSGKYTTEIR NPISKKRIWL GTFNTAEEAS RVYQSNKLEF  1020
QKLVHAKRQC SNEQTFSKQD GKSEKLVNIK QGHENVDSEL ESAGGSEIVV QVSNSSNGGT  1080
EQRIHSHEIG TCEEAFYDYL SNKFDLQISN KVELQSNMPT DSSAREEKQE GQEDDEDLWM  1140
GEWVQLPGNR AVKFSLKLGL PIIDNYGSLL GEFSTLDDLS ICKTEDGNET *
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1725731DKIRRKQ
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016566287.10.0PREDICTED: uncharacterized protein LOC107864432 isoform X2
TrEMBLA0A2G3ADS10.0A0A2G3ADS1_CAPAN; Uncharacterized protein
STRINGPGSC0003DMT4000317510.0(Solanum tuberosum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA43411240
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G68550.26e-12ERF family protein