PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cre04.g218050.t2.1
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Chlamydomonadaceae; Chlamydomonas
Family Nin-like
Protein Properties Length: 1840aa    MW: 178771 Da    PI: 7.1684
Description Nin-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cre04.g218050.t2.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1RWP-RK61.71.3e-19136184452
              RWP-RK   4 eisledlskyFslpikdAAkeLgvclTvLKriCRqyGIkRWPhRkiksl 52 
                          + l++l++ ++lp k+AA++Lgv+l+ LKr CR +++ RWPhRk++sl
  Cre04.g218050.t2.1 136 ALPLSELQQVYHLPSKEAARQLGVSLSRLKRSCRAHNVLRWPHRKLASL 184
                         57799*****************************************997 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5151916.893119204IPR003035RWP-RK domain
PfamPF020423.1E-19137184IPR003035RWP-RK domain
Sequence ? help Back to Top
Protein Sequence    Length: 1840 aa     Download sequence    Send to blast
MQRIGPGSWQ SQPSHDAPLG RDYPLHGFSL PPDEGPHRPG PGAGHFHHPH QHHGHGTDSH  60
LGVRHAPLKA AAGASGRGPS QALHAFPNAL LEYHQKQEPM SDDDDNDGDG EGAGRGSGGP  120
NSASGAGNVG DALDDALPLS ELQQVYHLPS KEAARQLGVS LSRLKRSCRA HNVLRWPHRK  180
LASLHNLRDT IRNDRNMKPP DKERIQAQIS TELAAVIANP DHTIDTRLDE IRQAKYKLKY  240
YHKTKHIHAA ASAARRKRAS TGADGPEPSA NSAGGSGSGA GGAAGGGQGG YGAASVALTL  300
PGGGIDVTSP AASMLTGGGG GPMPSASVIT TATGVTTTEV MGSGGGPRLS YTSGAGASGL  360
GGGGLVGGGG GARLSGSGLA WPPQAPGGNG GGGGGGGGTP WQPSSGSWQQ QPQPWSGVSA  420
FSGGGGGGFG GGHGGRGGVQ GNDIVALGDT GGGNGGDGGR GGGGDPMSRA GSLTSGAPLP  480
QLPPGGRYTG GSNSSVQLPH PPPLSLGPHF GSAPSQSQSH TGALLQLPPA LPPTVQPPQH  540
SQHQQQHYPP IHSGVRPHPH SPQQPSLQQQ QQQQQYAAGG GSGSGMRLQR SLHGDASGGG  600
PASGDFGGGG GMYGGANNVL RRPSYGVSDE TPGTGSGFGF QHQAQQEQQH NHNQHMHMQN  660
QQQQHSPPAS VVYGHSGGRP HSQPHPHQDP SLQQYPEGPA SGPSSGPLSG QPWHQQPQMQ  720
QQQQQMQAPN QTQNQPSLLQ LHQQQQQQQH AAAAAAAANA AAVARHMSGD LDTAMEVERA  780
AGAGAGTAGA SGAGASTMSH GVIDTACAPV GQQGQGQEAA AAQVARHTSA SGAAPAAGGR  840
VGQWSTAPQL NRQTSAPGPR HSYGGHVSHY PDIGSSSGGA HGAGAAASPP EFLFTTAGAT  900
AGAPGSGHHP GVPGDMQPAP GGRVRMHSVD GAGFGISGGA GHFGSGAGHF GSGAGHYGSP  960
GPYGGAHPYG HSGGMGSYGM PGSGGHGPGG GGGYGSFGHG PGGGGAPAGG FGGGRERFYS  1020
VDGSSFSFPA TAPNSTGTGY PSLPASRQSA AGSSFAAGGG GGGGHSFHAG PPPALPSSAP  1080
AYGRAPYDSG VQTPGMSSVY GTALTPVAAA PPPGHGTPDA GGADLAPAPT LTTLLQPRRS  1140
DGGAGGGSGL GSLPEGSAVA HQQPHQHAHQ HHMLHPLPGC SPTGGPGGGG GGGGGGMSHA  1200
EAVRQHLLAP RDRAGSGRLT GTGSGSIGTN SGGGSAFALA GRSSIGRDGG AAGVGGGVGL  1260
SPAEIAAQLR ASAVSGGGPP AFDRRSDSGE VATATAGAEP LLLPSAGSPA TAVDGSGGGM  1320
GRRPSMGSGG GVTADDGSSA LAVAMAAAAA DVNAAASRMQ QVQQQFYSRS VTTGAPAAAP  1380
FMQRSTSSGG GAATPGATAL PAALPPGPAT GAPGLGFGMG MPPASADARG MAAAAAALGD  1440
GPSWMELGSV NSGQMPSFQP HAPPPPQRPQ PQEHPFPQQL HPGQPQAAAD DAQLRYQYQQ  1500
HQHQQHSHGS GSGGGAALPP GCHTVTTGRY GSPPDQPPAP PPTAMATAGS PPSSTAGTYP  1560
SPGGGSPDLS TTPPAGGSGA GASGGVAAMQ ADGGQPRAQA AAAAPPSRAN RQPYLDDLAA  1620
ALYDTLDDTD FGGGRRGGGG GGGRGRRGGA TLVDDGDDDT DGDYDDDQDQ PGPAGRGGRG  1680
VAAAAGGATG TAGTTPMRMG SSAELVSGSH QPQQQHQSPS PDQMQHQQLQ DGSGSDGGGA  1740
AAAAAAGGAV TVAAAGAVAA AAAAAAGGGD GGLTGGAPPG ANSGAIPAAL LLSSGTIMST  1800
GGGGGTLASL DWMLMDAQNT PPGGANGQLL PPPPSPPRQ*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
116321640GGRRGGGGG
216351643RGGGGGGGR
Cis-element ? help Back to Top
SourceLink
PlantRegMapCre04.g218050.t2.1
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
TrEMBLA0A2K3DU190.0A0A2K3DU19_CHLRE; Uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G53040.16e-11RWP-RK domain-containing protein