PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID IGS.gm_6_00181
Common NameCHLNCDRAFT_143607
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Trebouxiophyceae; Chlorellales; Chlorellaceae; Chlorella
Family AP2
Protein Properties Length: 1334aa    MW: 132297 Da    PI: 9.3947
Description AP2 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
IGS.gm_6_00181genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1AP235.62.3e-11155203154
             AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkr.krfslgkfgtaeeAakaaiaarkkle 54 
                     s+y+GV w++++g+W+A+I++        k+  lg++ t+e Aa+a+++ + + +
  IGS.gm_6_00181 155 SRYRGVVWHRSNGKWEARIHE------AgKQRFLGYHATEEAAARAHDEQAVRVH 203
                     79*******************......339*********99******99776665 PP

2AP242.91.2e-13358407155
             AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                     s+++GV+w+++ g+W+A+++       +  +++g+f  + eAa+a+++a+++++g
  IGS.gm_6_00181 358 SRFRGVSWNSSCGKWRAQVWKG-----SEVHHVGYFEDEAEAARAYDRAALRIRG 407
                     789***************7663.....6**********99*************98 PP

3AP226.61.4e-08503546149
             AP2   1 sgykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaa 49 
                     s y+GV+wd+ r  WvAe++       ++r  lg f +++eAa+a++ a
  IGS.gm_6_00181 503 SSYQGVSWDPLRAGWVAELWTG-----TQRRLLGVFPSEQEAARAYDLA 546
                     679***************9994.....69999******99*******97 PP

4AP2414.9e-13776823355
             AP2   3 ykGVrwdkkrgrWvAeIrdpsengkrkrfslgkfgtaeeAakaaiaarkkleg 55 
                     y GV+wdk+++rW+++I+        kr+ lg+ +++e Aa+a+++a+ +l+g
  IGS.gm_6_00181 776 YNGVSWDKRKQRWFSQIQQH-----GKRHFLGYCDSEEAAARAYDRAAVRLYG 823
                     88**************9994.....4***********************9988 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd000181.91E-11155213No hitNo description
SuperFamilySSF541713.27E-13155213IPR016177DNA-binding domain
PfamPF008471.9E-5155197IPR001471AP2/ERF domain
SMARTSM003801.1E-12156218IPR001471AP2/ERF domain
PROSITE profilePS5103215.04156212IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.103.4E-13156213IPR001471AP2/ERF domain
CDDcd000185.09E-11358417No hitNo description
SuperFamilySSF541716.47E-14358417IPR016177DNA-binding domain
PfamPF008479.8E-8358407IPR001471AP2/ERF domain
SMARTSM003801.4E-9359421IPR001471AP2/ERF domain
PROSITE profilePS5103214.486359415IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.101.6E-11359416IPR001471AP2/ERF domain
SuperFamilySSF541711.44E-10503561IPR016177DNA-binding domain
PfamPF008475.9E-5503546IPR001471AP2/ERF domain
SMARTSM003803.2E-5504566IPR001471AP2/ERF domain
PROSITE profilePS5103214.249504560IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.102.9E-9504560IPR001471AP2/ERF domain
SuperFamilySSF541713.86E-13774832IPR016177DNA-binding domain
SMARTSM003801.0E-8775837IPR001471AP2/ERF domain
PROSITE profilePS5103214.262775831IPR001471AP2/ERF domain
Gene3DG3DSA:3.30.730.101.5E-12775833IPR001471AP2/ERF domain
PfamPF008472.5E-6775823IPR001471AP2/ERF domain
Gene3DG3DSA:1.20.1000.103.7E-49361006No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0003700Molecular Functiontranscription factor activity, sequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1334 aa     Download sequence    Send to blast
MIAGAEEQTR TLHDGGSELG PSTAATGLPA ASHQRADDGA PPPDADAATP VAAGSGGGVE  60
VPAQAVVSPP LAGRSSPRVL ASSGGDAAAG QARLTSGAAR RSARPSSMRA LQLGISRGKD  120
AAVDPSRSDR GGRPAAVAAP APAQAAADPS PPRVSRYRGV VWHRSNGKWE ARIHEAGKQR  180
FLGYHATEEA AARAHDEQAV RVHGDLSKVN FPERYAHLGQ PTAAAGSGGR ARSRSRSRGS  240
PRRSSRSRGG GAAGGGSGGS TPQSAVPAAA SRRSPFASAA AGSGDSRDAS PASAEQPSSA  300
AAATAAAAAA GTPTAAPAGA TSASPPAPAT GSALRQRPAR GGGGGRGGPR VQPVKGSSRF  360
RGVSWNSSCG KWRAQVWKGS EVHHVGYFED EAEAARAYDR AALRIRGPDT PTNFPASEYV  420
DPGSVGAAAE APAAAAAGDP GQQQQPQPAR EQQQQQQQQQ AAVKEEEHLG AAGSGTATAP  480
PSGQAATAPG PPAGGGGGGS SGSSYQGVSW DPLRAGWVAE LWTGTQRRLL GVFPSEQEAA  540
RAYDLATLAE QGPQAATNLP LAGYDAELAA AAALRTLGRE PVAHTPPPQQ LGQDAGGKQQ  600
QQQQQLGVGG SGQGLPPPQQ QEQGFGQAGG GGAAAAGLQP MHADGSSPPR DAAALLAEVP  660
PLSPGLAAAV ALAVQPLPSF KPLLATPPQL PSAPAAAQEG PPGAPTKAGA APSAAAAATA  720
AASKVPPAGR AAGAPAGAPG AAAAAAPAGQ RVEAPAGGNV SAQNRCCGSN RGLSAYNGVS  780
WDKRKQRWFS QIQQHGKRHF LGYCDSEEAA ARAYDRAAVR LYGPQAQLNL PESAALAAAA  840
ATPAQGAATA SGGPGGSEVR SPSRGSDAGG SGRQRAAGSG AGRHGGLAKT TSDAATAAAA  900
AGGMAAAAAA AAATLPVVEE PEGQHDPASG SALVPAAADR AKAFQQQQQQ QQQQQQQGQT  960
PSPLSAIEQL ARLQQLHQLE QQHELAQQQQ LLLQAVQEKQ RWHEQEQALL LSVLQRQQSL  1020
ESLPALPPSL ALAATAQAAS QPPASQQALQ GLLQALGGTG GGSASLPAAS LSSPDLGALA  1080
ARLQQQLQAA GTQQQQQQQQ QQQQQQQQQA PLERLLLAQL AAQQQAQQQA HTGLPLAAPL  1140
QHQWALLAGS DGGSGSALSG HKRGSDELES AMQQLLAAQQ QAAPQPQASN LQHACAHACE  1200
ESTLPAMLHI SRGEFLSLVA KRARMLAEAA SAALLVLPGG DRAGSAPAAA APAPSAAGAA  1260
APAVPGDTTA PAPASAAGAG GEPGGAPVAA PARSRGGAQP EGEPAASGGA AAEGAAASGG  1320
CEPGARRQQP APP*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1229237RARSRSRSR
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_005849298.10.0hypothetical protein CHLNCDRAFT_143607
TrEMBLE1ZA350.0E1ZA35_CHLVA; Uncharacterized protein
STRINGXP_005849298.10.0(Chlorella variabilis)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G39250.14e-09AP2 family protein
Publications ? help Back to Top
  1. Blanc G, et al.
    The Chlorella variabilis NC64A genome reveals adaptation to photosymbiosis, coevolution with viruses, and cryptic sex.
    Plant Cell, 2010. 22(9): p. 2943-55
    [PMID:20852019]