PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre01.g004600.t1.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family Nin-like
Protein Properties Length: 3177aa    MW: 318711 Da    PI: 9.3928
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre01.g004600.t1.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK59.18.6e-19954752
              RWP-RK  7 ledlskyFslpikdAAkeLgvclTvLKriCRqyGIkRWPhRkiksl 52
                        le+l++ F+l+ +dA+ +L+++++ LK+iCRq+GI+RWPhRk++sl
  Cre01.g004600.t1.1  9 LEQLRELFHLSARDACITLNISQSRLKKICRQHGISRWPHRKLASL 54
                        689****************************************997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151915.516174IPR003035RWP-RK domain
PfamPF020423.2E-18954IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 3177 aa     Download sequence    Send to blast
MATVGEPPLE QLRELFHLSA RDACITLNIS QSRLKKICRQ HGISRWPHRK LASLDSLRQQ  60
LKGDSGLTTQ DRAVLLARLE AQVAAVIAEP DRPLEKLFED MRQTSYKVRY HARNKLRRGT  120
GAAGEGADDG DAAGTASGGD ASSPAGRALS TERGGGRRGR RGAASPRRGP QRRAASSGPA  180
SGSRLRRAGR RRGSDWGGSS GEEEAASDST EAASDDEEQA AGGAAAGGYS ERDDDEDDEG  240
DDGEDSDEDD EDDEEDEDWV AGGRAGKAGG RQARADMAAA ATAGGSAGRA DRLRRRATGL  300
QGLRVPAAAS PATPRSRRRR RSAAPLDSGD TEPSPGGRSA AAAAPLATTA CGGGDTNSSD  360
SAGRSLEQRQ PAAREGAGDA AGRPPTAPQP QLSGRSRTVA SRSASARAGA PAGLPGFSPV  420
RDAATPSSHT GPAAASRGRL HSGGHGHGHA HGRAAKPERG GAAATLSAFA AAERAVAGAS  480
AAGAAGGGGG GGGGGGSAVK QEREQSAGQL VGPGGGGGVA GAAGRGADGA AARDDARAAA  540
GDVAVEAGAG TGGSSQLDGS SQWGMKRARS APQPQPLGAA GVAGTTSSHQ TTSSDRSQHQ  600
QQQAPLPPLQ ARTSREWPPM DMMPSGWQQP YRQPMPLDAT RTTPATATTA TTAAGGGADV  660
GSGRHQDRRL DHPSHLQQAP QHPHPQEPLH QRAQRPASSP TASMGGPATK QQAHGSYGAA  720
APLHHTAAQS HSHSYSHAPP TATTAPPAEP HPFFSPAAAA AAAAAAGPGI ESSLGLGSSV  780
PRSGALEHGV EDWEWRWREE GQQRHQRALL QQQQQQQEYM PPAPHHHMPH EQQQRHQQLL  840
QSQSQSQSQS QSQPQSQQQR HDLHREQLPP LPPQPEALSR QQQEQRQQEH FELQQLQLLQ  900
QQQQSQQLQQ PQQHLMPSSS SFQRDWIPPP QPDPQPLPSW RQSEPLHAEP QPRASRSQQQ  960
PQQLQSPGHV QQGQPLQHGG ASGSAPTYRG GGFGSELDGG WHWQQQQQQQ QQQQQLLQQQ  1020
QAASLSHQLQ EREQQELQRQ QAQARHHMQL QQQQQGAPPS SSWHGQYPPL HQRPDQAHAQ  1080
YPHLQQPQEH QHPQQERYPP LHQQTQQQLQ QQQQQPQLQQ QQQAQWPGAD PRLLAPGRQP  1140
LPPPPSWSAR QGPGSAPPST SAGHPTLQQQ LYPPMYQQQT QQQPQPQQQN AGQQPAYHHH  1200
PLLSPHSPQY QQRMQASAPV GSGGAHRQSQ SIIPGPPSAL LSLPSPASNE PSPRDFAYPS  1260
PLPPHHQPPH QPHPHHHSLS GSGFDSEGGA DAAAAARRPS NQSQSFDQQH QAAAIASGTL  1320
RNWSSSSSAS SLLVWGSAWP PGSGLGGMGP GGMGPGGVGS GGMGPGPLGP GGSNPHGAAG  1380
SGGGSAYGAA GFRSGLHSGG GVGGAGGAGF GGSAGGGYGW SSGGSSYSGG GFAHAMGLPS  1440
SGGGAAAASG TGGDGGGGDE NRMLHEGLRG RSAYVSDAGD RRDEAVAEAA AAAAATVAAG  1500
PGGGRMGGRD AAGTRGGGGG TGSGAATTAA VAAAAAAAAA VAAVAATSGA ADHETRHRSP  1560
ALQPPTHHGA PTDLLSSRHH ARHSHVLPPA GMNYAAYLPP PLSLPPALDS PHQPPYSASH  1620
GPAHGRLNPA AYSRVVSTGF PGIHPGSLPG SSSPSPSGGG GGGGANAGGG RVGGGSGTAA  1680
RRRGGDRAAS PLGSYGSGAA GGGGGIAAGV KAEDRGPSGG SGGDGGFGGD DTRDVEPHLG  1740
HVAFHGGNAG QQQTAAYRRT SAPAAAAAAT ADPRAQSASG ERDSGGGVAV DLRRVPYTGR  1800
LSEPVMRAPL RALLGLGSDL ADEGEGEAEL AASMAAAARD EERFRMLSTG SSTVAIFAPE  1860
AQPQPHQQTQ HQQPHQPSRP VEGRPHHGYG LPRLQQGAPE GRVASASAAD RGGRRTSGDM  1920
ERGGNGGRRP HTVPDARGWA GPADLLLACQ ETDAIAQAEA ADGVRGAVGH LHQATYQAAA  1980
SLHVAQQPRG GAAYEEPPPR RQPHQAAGAG GPSPVCSADL TTPTAAHQRP QQLRPHAAAP  2040
PSATSSRHQV SRRAAAAEAA VVTATPVAAT GSSGRPGPAA GPSPAAAPVT AAAASGAAFL  2100
SPVASVAAGS GVAGGPATSS ASSGRRSSGG GSSSSGGRTL LVWTGSQPRW GQVLPPLQHQ  2160
SQPQQQQPQQ QQQPQQQPEL QSPSMRPPQL QPLPLMPSSA PLRTGPQRRI SGAGAAGAEV  2220
TTGTATASGP GQESTSAAAA GAAGALPPMP PLPASLLAAL AAGGGAGGDL SPDAYPVTAL  2280
QPPAVGAADR GTDAGQREQD RPQGRRASPP PAATDPANAT TAGGVPLSIL EYLRHGSGSE  2340
PPLDLEALED LDMDLDLDFQ TPQAGDAAAA LALMRPWLTS TPGLGPTPME ISPRAALPAT  2400
RRISAAGTEA PAPQRQQAPP ATGQPTCEDA AGSTAAGGGT AAAPAARMTS AGQAAASTAL  2460
SLSASAAGPS RPPHATCAAA DATTFASPVA AAAADAARQH EFASPAPHAA AATAAAPAAV  2520
SPCTPAAAVI PGSGAGGAAG AGAPASPPPA PVKRPPPARL FLHTVRRHSG TGEGPAAEPT  2580
AARAHAPAAH AEGGGMDAPP LPHHQSQQQH QSQQHPQHHH QQASPYGGAA AHMYAAWCSS  2640
PAATPTRHNA SGGGAAAAAA GSTGGAAFAA APAPPQAPRA AAAADADTRS VSVTGGLGGS  2700
GSGGSAHGGG TGGLGGVSPR SRAALLVDVA EPLGPYGPQQ PTTATSTAAA AVAAGEAGGA  2760
SGRQLQPAPA PPTAASRYPW NMGWGAGASG AAGRENTPLR QLPPFPPPQP ELAALPLHRA  2820
QATGTVATGT AGPGEQGPQG YQQRSGGGDT TEAHVPPPLL QPPPRATSPT GYAVASYRRD  2880
SEGGGVIPVV SPPPQLMPQS VRPPPRLAPL VAPTAAAGPP QPPPLTSPPA PRKPAKQPAL  2940
MAPGPPAGQR RSSGGSVDTG GSAGGAAGRN SPASLAGAGT GAAGPLPKLP PHVLERPLPP  3000
RMQVHMPAPA PSPAAVQGPV GATAHSGAAA PGSRLLAAAM SPVGGTNAGG AAAGASPPPL  3060
LKVTIGGRPG SLQSAIGSFS VQPTQLPQPP QLPALPQSQH QSQRQHQVAE QGRFGMPGAA  3120
GWAVPGAAVG AAGSHLPQLH QHQHAAAGIL DGDTQSAGGT GGGGVSSKRK REGDSE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1290295DRLRRR
216681674GGRVGGG
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Cre.125580.0normal conditions
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre01.g004600.t1.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_001702133.10.0RWP-RK transcription factor
TrEMBLA0A2K3E4Z70.0A0A2K3E4Z7_CHLRE; Uncharacterized protein
STRINGEDO972220.0(Chlamydomonas reinhardtii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
ChlorophytaeOGCP841650
Representative plantOGRP5671777
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G53040.11e-11RWP-RK domain-containing protein