PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Seita.2G112400.1.p
Common NameSETIT_033066mg
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Panicodae; Paniceae; Cenchrinae; Setaria
Family HD-ZIP
Protein Properties Length: 736aa    MW: 79319.1 Da    PI: 6.8618
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Seita.2G112400.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox50.72.9e-1654103655
                         S--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHH CS
            Homeobox   6 tftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRake 55 
                         ++ +eq++eL+++++++ +p++++r+ L +k+gL+ rqV++WFqN+Ra++
  Seita.2G112400.1.p  54 RHAAEQIQELKAAYKQCDHPDEKARRALGAKIGLEARQVRFWFQNQRAQM 103
                         56789*******************************************86 PP

2START108.31.3e-342154483205
                         HHHHHHHHHHHHHHC-TT-EEEE..........EXCCTTEEEEEEESSS.SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EE CS
               START   3 aeeaaqelvkkalaeepgWvkss..........esengdevlqkfeeskvdsgealrasgvvdmvlallveellddkeqWdetla....ka 79 
                         ae+ +++++ +a+ +ep+Wv+ +          ++ + ++ l + +++  +  ea r+ gvv +++a lv +l d + +W+et++     a
  Seita.2G112400.1.p 215 AEASMDQFLMLATKGEPLWVLTPdgevlncqvyQKRTFPVLLGTCPDG--FVREATREAGVVRGAAAHLVGILTDAN-RWSETFPgvvaGA 302
                         6899*******************9866666666666666666666553..68*************************.*******999633 PP

                         EEEEEECTT......EEEEEEEEXXTTXX-.SSXEEEEEEEEEEE.TTS-EEEEEEEEE-......TTS--...-TTSEE.....-EESSE CS
               START  80 etlevissg......galqlmvaelqalsp.lvpRdfvfvRyirqlgagdwvivdvSvds......eqkppe..sssvvR.....aellpS 150
                         +  +v++ g      g +qlm+a+l + sp l++R + f+Ry++   +g w+++dvSvd        +k+    +s v+      ++llpS
  Seita.2G112400.1.p 303 TAGNVVNVGvfaphnGLIQLMNANLWVQSPrLRNRSVNFLRYSKLTAEGHWAVMDVSVDAilglegSRKVDLnvDSDVMQawhkgCRLLPS 393
                         334455555777889**************95667************************963333222333336667777644444799*** PP

                         EEEEEEECTCEEEEEEEE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
               START 151 giliepksnghskvtwvehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                         g+l+e+++ng++kvtwv h+++++ +++ ++++l +sg a ga++w+a lqrqce
  Seita.2G112400.1.p 394 GCLVEDMGNGYCKVTWVVHAEYDETTVPTMFKPLFRSGKAHGAQRWLASLQRQCE 448
                         ******************************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.25E-1341103IPR009057Homeodomain-like
PROSITE profilePS5007113.5446106IPR001356Homeobox domain
Gene3DG3DSA:1.10.10.603.6E-1546100IPR009057Homeodomain-like
SMARTSM003891.0E-1249110IPR001356Homeobox domain
CDDcd000862.52E-1250103No hitNo description
PfamPF000467.4E-1454103IPR001356Homeobox domain
PROSITE profilePS5084829.161204452IPR002913START domain
SuperFamilySSF559617.33E-20211449No hitNo description
SMARTSM002342.3E-10213449IPR002913START domain
PfamPF018524.5E-27215448IPR002913START domain
CDDcd088754.81E-74215448No hitNo description
SuperFamilySSF559615.5E-7470663No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
GO:0008289Molecular Functionlipid binding
Sequence ? help Back to Top
Protein Sequence    Length: 736 aa     Download sequence    Send to blast
MDNEGNQQRS NNEDVLHSLL GPEDQMNIDS ANVDDDNDDV YIPEPEAPSK RPKRHAAEQI  60
QELKAAYKQC DHPDEKARRA LGAKIGLEAR QVRFWFQNQR AQMQAKAQVQ DTKAVQQEHA  120
ALMAENASLR HAMLTMSCVA CSGGTAPAEP SPEKRRLLAE NARLRDEHMR ATALLHKILL  180
EAEAPPPSGR PVPAASTHSM LSAAVEGRGA LLRHAEASMD QFLMLATKGE PLWVLTPDGE  240
VLNCQVYQKR TFPVLLGTCP DGFVREATRE AGVVRGAAAH LVGILTDANR WSETFPGVVA  300
GATAGNVVNV GVFAPHNGLI QLMNANLWVQ SPRLRNRSVN FLRYSKLTAE GHWAVMDVSV  360
DAILGLEGSR KVDLNVDSDV MQAWHKGCRL LPSGCLVEDM GNGYCKVTWV VHAEYDETTV  420
PTMFKPLFRS GKAHGAQRWL ASLQRQCEYL SALHSSHVPR SSHNTGTISP MGRRGIMELA  480
QRMMASFYSA VSGPVTQPSS SIDEWHGSNG TGAERIAAAV RMVTWKKAGS MDGEPAGLVL  540
SSSTTVWLPN TPPQLVFEYL CNGQCRGEWD TFANGAAVEE LGSVATGHLD GNAVSVLCPN  600
VTDGINSKML ILQEARTDAS CSIVVYAPVE EERMRAVMNG GDHASVDFLL PSGFAVLPDG  660
HGKARRHAPC TSEAPVGDGN TAGALLTVAC QALLPGSPSG DHTARAFDDV GRLLCRAIKK  720
IKAAVKAKIV VASLM*
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008651755.10.0PREDICTED: uncharacterized protein LOC100383907 isoform X2
TrEMBLK4A2G80.0K4A2G8_SETIT; Uncharacterized protein
STRINGSi033066m0.0(Setaria italica)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP83241742
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G00730.11e-143HD-ZIP family protein