PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre03.g149400.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family Nin-like
Protein Properties Length: 2124aa    MW: 209767 Da    PI: 5.0179
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre03.g149400.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK642.5e-2018811930352
              RWP-RK    3 keisledlskyFslpikdAAkeLgvclTvLKriCRqyGIkRWPhRkiksl 52  
                           +++ + l+  ++lpi++AA  L++++TvLK+ CR++ I+RWP+Rk++s+
  Cre03.g149400.t1.1 1881 RQLTKQSLKDVYHLPINEAAAALNIGVTVLKKYCRKFSIPRWPYRKLNSV 1930
                          58999*******************************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151916.15318651950IPR003035RWP-RK domain
PfamPF020423.4E-1918821930IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 2124 aa     Download sequence    Send to blast
MDLDVPDLLA DFNPGAGLVV PIASVGQALL PLARNLPKEE LENVDVTGLD ALLSSDPSAL  60
NDVTFQRSSP SAPTPQATPT APSTSARPGS STTLAAGASV HQPYLREQQR QTLPHLPSLS  120
GFASAGAGQQ PACPAGSHLG PAAALGGTQA CLSKISDPGC GKQTTSNRSS LSQMDEDGGD  180
DAGAPEHEAQ DVRNGDSGRG DGGGGGHMYA SEPPRGPEPM TLISGCSETA SGHRIDGGEA  240
PSVSHSPSGA GGGASAPGAA GGASGATRPE QAETAPSRMS ANHQLPGADP TAGASPRNGS  300
GIAAADGDAE GSGDGDGDEE EEEEDDEELG DALAGAPAEG AVSGSRPAKA ARSKAQPRGR  360
RPKAAKQDGG GEGGSDSEDE DGGESRHLGR RVIEYAEKIV EQAWAANQLS DVPHTATLDC  420
NELPGIGFKK AGRILATAAV RLLVRGEEAQ KKGTLRTLLR SAENAGILAQ QGGPAGAAGI  480
AAGGGAARGT AGGADMGASA AAAAAAAAAV ASMPNLASMD PETLRVCLAA AAAAAAAAAG  540
TPSATSGVTQ QPQTHAQANA IVNLASELQQ HGLGQWAVQA AAEVAMKAAA AVLQQQQQHA  600
AAAAAADPRG AGSSVDAGPS SLNLLTAAAV GPPDMRHFGG FAAAPSHAAA GGNMATLYSA  660
QGNPRVEPAA APGPFQQMVL EAALADDCNP DGGAGAGLTT GGELECTLSG PRSSTDELLQ  720
AVFDGMGDPG GARALTGNSG GGPSGGSNVV AGVAEPRKSD GAAWDLLGQM FDSYCGGGAP  780
AAATDTQASV SLPHRQHHQP LQLPLPHVSL QHPALAMIAA AEAAGPTGGF RLNGGSITML  840
DAWVDAQNDE HVDAAVVAAM VAGDDDVPRL HDTLGPVVGG AATAACADGA GLSTASRQHQ  900
HQHPQQQAAP TPTLLAPMPV RGGGGAGVSA LADVSPGDHS PGSDGGGLMA PPQPRPPICI  960
SPPPEDLDPS ELRDDFPDLP LTASMFRISN MSIADGPPAP GSARPSASAF GANPFGQFGA  1020
AGGVGGGGTP NRLSNALSIL RAGSAGCMDM LMSNDFMDAL AAADPLLAAE VSAGGAAGGG  1080
GRASHADKFL MSIDALPPVP PLLPPSLIPP GYANALVMTT GDAGAGLSNS GPLSAAAGAA  1140
GYRRHQQGME AHAALERVAE EQPDVDAEGT EEPEEEWGQG RRPRRGAASR AVTARSGAPR  1200
EVLQEGEDEG EDDGGTVAAA PWAARAAGSS RGVPSARRQL VPASPMSSPM MTSPMRSPTA  1260
PARTFVEGPT AAAAVAAAGA IMAPQCRAVA GVLGGGTTSI ALGGNGHRGA PPQAAPGPSS  1320
FSAPFGGGGG GSSLSGNSQG GPVTARALFM MDGAGGLAGG GGGAGVTFAD GLGGDCAPSL  1380
MPPPPPFQHG HGHGHHNLFA DPRYGAAAHD LEAMNRQQLS APAVLEHPGA AMLCAGAGDA  1440
TATSTSGVSG GGCGGGVGGG SVVSSAADPS GGLFAGPAAM LASLTAAAAA GGGVMGMAAS  1500
APQPQVPVGL VSMGTGHSSG SQLYASAASV APGGGGGLWA EQPQVQLQQH TLHTPAPGMQ  1560
QHAHSQAAPD VLSLQQLHSR DKASPLRSAS FSRGSVGSST RSPRRAHLNA AIQAVVKAAS  1620
ASPSPSPLAR GSDGRVIRRG AGAAAAAGFA GSMESPLAGS AAAAAGGNSR AGSVAPQLPP  1680
LPTLPQLPQL PQLSPVPQYA PHSDLGDGAA HHGLTAEMGL AAYDGVTAAG VADGAAAAAE  1740
EAKEASTLAA AIAAATTGGG SVDGFTRKKG GAKAAGGGRA GRGAKRGGAG SADIGSGSDD  1800
MDAGSAPSGG GAAPTAERPS GVGRLGAAAP SGRGSQMDSG DEESGGGGGG DEEGDGGGVS  1860
GGGEGVSVVN LRKVTQDGAP RQLTKQSLKD VYHLPINEAA AALNIGVTVL KKYCRKFSIP  1920
RWPYRKLNSV NKLMETFERY KRDALLGGNV TGGEECEVVL QSLGKMKVEL YEDPDKDIDE  1980
RIKKLRQANF KVEYRARQDT TGKQQQPQQQ QHQDEFQQLL QQQLQQHLMQ QQQQQMFLPQ  2040
QQQQQQQQQE LPGAQGQQML MPVGDGFGMH GGMPQGAAGM LPAMPPPFLL PPFNPAALPS  2100
QFLPEGSAQG FGGGCPAANP TNF*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
114481460SGGGCGGGVGGGS
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Cre.184610.0normal conditions
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre03.g149400.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001695568.10.0RWP-RK transcription factor
TrEMBLA0A2K3DVL10.0A0A2K3DVL1_CHLRE; Uncharacterized protein
STRINGEDP013550.0(Chlamydomonas reinhardtii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP841650
Representative plantOGRP5671777
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G66990.12e-22RWP-RK domain-containing protein