PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen12g015990.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family TCP
Protein Properties Length: 2260aa    MW: 254630 Da    PI: 8.6034
Description TCP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen12g015990.1genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCP101.11.9e-3120662163191
               TCP    1 aagkkdrhskihTkvggRdRRvRlsaecaarfFdLqdeLGfdkdsktieWLlqqakpaikeltgtssssasec.eaesssssasnsssg.. 88  
                        a+g+kdrhsk++T++g++dRRvRls ++a++f+d+qd+LG+d++sk+i+WL+++ak+ai++l + +++ +s   ++++ + s ++++s+  
  Sopen12g015990.1 2066 ATGRKDRHSKVSTAKGPKDRRVRLSPNTAIQFYDVQDRLGYDRPSKAIDWLIKEAKAAIDALGEFPNNFHSTKlNPKKMQYSFDQEQSPef 2156
                        589***********************************************************99944444333455555555554444434 PP

               TCP   89 ....kaa 91  
                            +  
  Sopen12g015990.1 2157 sqenRGV 2163
                        3322222 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF037327.7E-17204300IPR005162Retrotransposon gag domain
SuperFamilySSF577565.06E-7389425IPR001878Zinc finger, CCHC-type
Gene3DG3DSA:4.10.60.108.2E-5407424IPR001878Zinc finger, CCHC-type
SMARTSM003434.4E-4408424IPR001878Zinc finger, CCHC-type
PROSITE profilePS5015810.823409424IPR001878Zinc finger, CCHC-type
PfamPF082842.4E-19492611IPR013242Retroviral aspartyl protease
CDDcd003036.78E-15511592No hitNo description
SuperFamilySSF566726.56E-1726651094No hitNo description
Gene3DG3DSA:3.10.10.102.1E-35690831No hitNo description
PROSITE profilePS5087813.508722901IPR000477Reverse transcriptase domain
CDDcd016471.44E-92725901No hitNo description
PfamPF000782.8E-30741900IPR000477Reverse transcriptase domain
Gene3DG3DSA:3.30.70.2702.5E-5832909No hitNo description
CDDcd092742.74E-589951110No hitNo description
Gene3DG3DSA:3.30.420.101.3E-3012731430IPR012337Ribonuclease H-like domain
SuperFamilySSF530987.33E-4312741433IPR012337Ribonuclease H-like domain
PROSITE profilePS5099420.33112741437IPR001584Integrase, catalytic core
PfamPF006651.7E-1212831394IPR001584Integrase, catalytic core
SuperFamilySSF541601.32E-515461626IPR016197Chromo domain-like
PfamPF003857.4E-715831630IPR023780Chromo domain
PfamPF036341.3E-2920672160IPR005333Transcription factor, TCP
PROSITE profilePS5136930.01320692127IPR017887Transcription factor TCP subgroup
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0015074Biological ProcessDNA integration
GO:0003676Molecular Functionnucleic acid binding
GO:0008270Molecular Functionzinc ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 2260 aa     Download sequence    Send to blast
MVRTRATTTS TPAPAGQGAS EPATGAVARG RAAARGRGRG RGRTSSRGRG QAPAPSGTRA  60
VTPPPTEEVV IEGEEGENEQ VQNEGLPPQP TPEMINQVLA YLSGLSDQGQ TPPVFSAPAP  120
QVPEVQQATA AAPRMDAPLE IGTFPRLTTG PIMTSDQHEL FSKFLKLKPP VFKGAESEDA  180
YDFLVDCHEL LHKMGIVERF GVEFVTYQFQ GNAKMWWRSH VECQPTEAPP MTWASFSSLF  240
MEKYIPRTLR DRRRDEFLSL EQGRMTVTAY EAKFRALSRY ATQLCFSSQE RIRRFVKGLR  300
SDLRISALQV AATAKSFQEV VDFVIEVEGV KPDDFTMAST SKRFRKGGEF NGSYSRGQGS  360
GGYSVRPIQS SLQTVVGGPP QTGQHFSEVG GYSQTSSFSQ RPMLDSRECY GCGETGHIRR  420
NCPKQSYRPP IARGRGGHGR GRYSGGRGGR GNGGHQNGRG DGQTGATTTP HGRGNGQTGD  480
RAHCYAFPGR SEAETSDVVI TGNLLVCDCM ASVLFDPGST FSYVSSSFAT GLNLHCELLD  540
MPIRVSTPVG ESVIVEKVYR SCLVTFMGSN TRVDLVILEM VDFDVILGMT WLSPNFAILD  600
CNAKTVTLAK PGTDPLVWEG NYTSTPVRII SFLRAKKMVS KGCLAFLAHL RDDTTQVPSI  660
ESVSVVREFL DVFPADLPGM PPDRDIDFCI DLEPGTRPIS IPPYRMAPAE LRELKAQLQE  720
LLGKGFIRPS ASPWGAPVLF VKKKDGSLRM CIDYRQLNKV TVKNRYPLPR IDDLFDQLQG  780
ACVFSKIDLR SGYHQLKIRA ADVPKTAFRT RYGHYEFLVM SFGLTNAPAA FMSLMNGILK  840
PYLDLFVIVF IDDILIYSKS KKEHEEHLRM VLEMLREKKL YAKFSKCEFW LDSVSFLGHV  900
VSKDGVMVDP SKIEVVKNWV RPTNVSEIRS FVGLASYYRR FVKGFSSIAS QLTNLTKQNV  960
PFVWSDECEE SFQKLKTLLT TAPILTLPVE GKNFIVYCDA SYSGLGAVLM QEKNVIAYAS  1020
RQLKVHERNY PTHDLELAAV VFALKQWRHY LYGVKCEVYT DHRSLQYVFT QKDLNLRQRR  1080
WMELLKDYDV TILYHPGKAN VVADALSRKA GSMGSLAHLQ ASRRPLAREV QTLANDLMRL  1140
EVNEKGGFLA SVEARSSFLD KIKGKQFNDE KLIRIRDKVL RGEAKEAIID EEGVLRIKGR  1200
VCVPRVDDLI NTILTEAHSS RYSIHPGATK MYRDLKQHFW WSRMKRDIVD FVAKCPNCQQ  1260
VKYEHQRPGG TLQRMPIPEW KWERIAMDFV VGLPKTLGKF DSIWVIVDRL TKSAHFIPVK  1320
VTYNAEKLAK LYISEIVRLH GVPLSIISDR GTQFTSKFWR TLHAELGTRL DLSTAFHPQT  1380
DGQSERTIQV LEDMLRACVI EFGGHWDNFL PLAEFSYNNS YHSSIDMAPF EALYGRRCRS  1440
PIGWFDAFEV RPWGTDLLRD SLEKVKSIQE KLLAAQSRQK EYADRKVRDL EFMEGEQVLL  1500
KVSPMKGVMR FGKRGKLSPR YIGPFEVLKR VGEVAYELAL PPGLSGVHPV FHVSMLKRYH  1560
GDGNYIIRWD SVLLDENLSY EEEPVAILDR EIRKLRSREI ASIKVQWKNR PVEEATWEKE  1620
ADMQERYPHL FTDSGTPFCP CFSSCDRSRT NDGTDRHRRD GPSQTLGKNL VSELCDGAAG  1680
RTVAGTTARH RLRNPRLGRI SVKKVSKNRN AEDCAKELTH KAIHDTSFHR RSIGVGRRLE  1740
VQASGSNSGN SYQLWSYGRT DSPSTDRRIV LANFGLDQRV VLVPEVEEKL KLNLTLSRMN  1800
DSHNLANVGD EINRQPPPPV DYQVIVENPE DKIRGYDVVE EVNGELVHNS WKAAEIPQKL  1860
TPIPRPPPPF MQRLVKNMMD GKYRGFITLL KQLSINVPLI EAPEQMFGYA MFMKDMVTTK  1920
RSISFDDDDQ MQHCSSIAIR SPVQNKEDPV GFGLPKAHCN APTDGRSNGE EADFVIIDCE  1980
FQVSDEEDGS EEEEEEEEEV FQENDIGNLQ SHYQNQQMPQ SLCKKPEKWA NFTVSEQELN  2040
KGTRRMKPKR AKTDVIEGHG GRIIRATGRK DRHSKVSTAK GPKDRRVRLS PNTAIQFYDV  2100
QDRLGYDRPS KAIDWLIKEA KAAIDALGEF PNNFHSTKLN PKKMQYSFDQ EQSPEFSQEN  2160
RGVPNSECGV QDKQQEVNYD IPNLFSSSDG LKIPFLSDLQ SYPHGHFLNF QSLQDDTILS  2220
SGNHHQGSFF TTTSVNHFPS VLSQIRCFLT GNPFSPVSSL
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4ol8_A4e-89665110834475Reverse transcriptase/ribonuclease H
4ol8_B4e-89665110834475Reverse transcriptase/ribonuclease H
4ol8_E4e-89665110834475Reverse transcriptase/ribonuclease H
4ol8_F4e-89665110834475Reverse transcriptase/ribonuclease H
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13039RAAARGRGRG
220432049RRMKPKR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754510.0HG975451.1 Solanum pennellii chromosome ch12, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_027771432.10.0LOW QUALITY PROTEIN: uncharacterized protein LOC114076513
TrEMBLQ6F2D60.0Q6F2D6_SOLDE; Putative polyprotein, identical
STRINGSolyc11g020560.1.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA41122145
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT2G31070.13e-33TCP domain protein 10