PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID EMT14894
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Pooideae; Triticodae; Triticeae; Triticinae; Aegilops
Family G2-like
Protein Properties Length: 2213aa    MW: 233112 Da    PI: 8.6793
Description G2-like family protein
Gene Model
Gene Model ID Type Source Coding Sequence
EMT14894genomeBGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1G2-like97.59.5e-3119522006155
   G2-like    1 kprlrWtpeLHerFveaveqLGGsekAtPktilelmkvkgLtlehvkSHLQkYRl 55  
                kprl+WtpeLHerF +av++LGG++kAtPk+i+++m+++gLtl+h+kSHLQk+Rl
  EMT14894 1952 KPRLKWTPELHERFADAVKKLGGPDKATPKAIMRVMGIPGLTLYHLKSHLQKFRL 2006
                79****************************************************8 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1172897.06E-1321270No hitNo description
SuperFamilySSF1172897.06E-13335495No hitNo description
PROSITE profilePS5129410.74219492009IPR017930Myb domain
Gene3DG3DSA:1.10.10.609.7E-2919502007IPR009057Homeodomain-like
SuperFamilySSF466893.4E-1519512007IPR009057Homeodomain-like
TIGRFAMsTIGR015572.3E-2219522007IPR006447Myb domain, plants
PfamPF002491.2E-819542004IPR001005SANT/Myb domain
PfamPF143796.6E-820502073IPR025756MYB-CC type transcription factor, LHEQLE-containing domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009793Biological Processembryo development ending in seed dormancy
GO:0010070Biological Processzygote asymmetric cell division
GO:0005634Cellular Componentnucleus
GO:0005739Cellular Componentmitochondrion
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2213 aa     Download sequence    Send to blast
MAAPRELDLS DEVEGEMDGT TDFVFRLVGD PIPVLPPASA PLPLFDLQSP PARPLAVSDR  60
HAAVFLAHPN GFIAARTKAL IEASKEAREK GKASTRCAQD CCVTDVPLPG VTLLALSRDQ  120
SVLAACTGSE IQFFSATSLL TDKDIKPSSS CSMGRSGTVK DFKWLDNAYI VLSNGGLLSH  180
GSLGQGLKDI MENVDAVCHL SEIYVVDCSK DGNHIAVARK NSLRILSPDL KETCCMALLF  240
QSWPDSDSEG TDIKVDSIGW VRDDSIVVGC VRLNEESNEE GYLVQVIRSG GDTFFEERKH  300
PLSHTVPAAK EISGLLDVVI ISILSLTCRL FVMKTNLLKW QLSLVTPWEQ SPLVNYLHSS  360
SKPVVFSYDV FGGIMDDILP SGVGPNLLLG YLHRWDLLVV TNKKSTDDHI SLLKWPSKTD  420
EERTVVYLEM VEDKYSPRID LQENGDDNVI LGFGVENVSL FQKITVLVGP EQKEVAPQHL  480
LLYLTSEGKL IIYYLARISD PSDLPQTSLS TIEDSNVNKQ ISPATASNKD LTPSVTSSMA  540
KSLLAGPGAS SAPAEKDQHG SGDAKSSFPI SNSKDIAAGS SLLISSDKKP LDTKQVNTAS  600
PFAPPSSSAP TGNMKPGMPF SFSTGNNVGL NSTGSKGSSE PVSSWQPNTS GSFVSSQLGK  660
GGFDSAKPLG AFGGSQNATK SGGSLSFKSS VFSSDGSVPV KTAERDGASS FGSYPAQTSY  720
TTERKVLGSS AGLSSVPSLS ISPNKPVGAS SAGFGAGNLE VPPVSRGSPL PQQIIGKSPN  780
NRNHMSADSK NFKLGTMFDT QQDLSKKIYS INDMTEELDT LLSYIEKDGG FRDACMTLQQ  840
RPLSVLEGDL QNLLELLQVF KNKVEEQCSK AEDLRNKMFQ VSARQAYMKG ILTQSSDTQY  900
WDIWNRQNLS PEFEAKRQNI LKANQNLTNQ LVELERHFNN LEMNKFGETG RVASSRRAVY  960
SNKSRSSQTQ LSSVYNALNS QLAAAEQLSE CLSKQISALN ISSPSKKRGA VTKELFESIG  1020
LAHMADAAKF SGGTPSKLIQ RFPSTKEHTK GMLGPSKSAE PETARRRRES LDMSLASLEP  1080
QKTTVKRIAQ QQRLKISSDL PFRSNKKMFD SQMAAISQET FGGSPSSSIV ESYTSRVRSP  1140
IEVLDEKTKP SGPQGNSLFK WVKEPAGSSQ GSEQKHLDLS GRMKSADQSS KLTPSSPASF  1200
SYTQKDARDR TSTPNVASLG AMHTVPKSNT LTFKTNIAPK TNANTRPDMS PSVSSPMPVK  1260
TLSGDSGAGF TLTTKNRYSD QDVPSFGSMK GFGISPQNTG VNKPSLSSEP SKPVVLHGKT  1320
FQVSGVSDTM QNSAKASPQV AFSPTSQSSS FPIMSGASSS AASLLSTMQA SAAKTSDVSS  1380
PTVSSTLPPQ ESAPKTHPTV PEGTVSCSLP SIPTPVKESL PDLNKNASKP EVVTPEVTGT  1440
TVSVSATLTG VPTSESKTAL LPVTNSSLSS NPPSIPVPKV VPGATESAVV TSTRKDAGPS  1500
NLSSDEDEMD EERPSASADL NLGALSGFGL GSAPSSSPKK SNPFGSAFGT SESKSSSSPF  1560
TLTTSPGQLF RPASLSIPSS QPAQPSQSSS SSTFSSAFSS GLGGFGQSAQ VGSGQQSGFG  1620
QSAQIGAGQQ AGFGQPAQIQ SGFGQPAQVG AAQQSGFGQP AQIGGGQQSG FGQPAQLGAQ  1680
QALGSVLGSF GQSRQLGGGF GGFASSSSGA FASAPSSNSG FAGAAAGGGF LRLFISVGGG  1740
FAAAATGGGF ASLASKSGGG FAAAASSSGG FGGAAQGGGF GGAAQGGGFG GAAQGGGFGG  1800
GLAIAGYLVS KRYLERNWSL RLPGAYTGMK NKSIRCQALV NKQRPKLNAE GIRQVLGTEH  1860
MHARTPHSIT GTPTEVEPSQ PTVKLRRRLQ SSHPAFLWEM YHQQQQQFHD HRQHMSSRPS  1920
LSPENKFFMK GQGGAGAGGG GDAGLILSTD AKPRLKWTPE LHERFADAVK KLGGPDKATP  1980
KAIMRVMGIP GLTLYHLKSH LQKFRLSKNL QAQANAVHAK NVYGFGTATD KACEGRGSPA  2040
DHLNRETNTS RHLQLRIEAQ GKYLHSVLEK AQEALGKQHV VAGLEAAEPT QRLPELASSV  2100
RRGLLQNDGS ADDSCLTASE DILSMGLSAS ATRRGCGAPF ETSASASREE DGECYLFLGK  2160
PEGRREVRRD GCSGGAAFGT AAELDLSIGV VAASSRRRPD GGERLDLNGS GWN
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
6j4r_A1e-1819522008157Protein PHOSPHATE STARVATION RESPONSE 1
6j4r_B1e-1819522008157Protein PHOSPHATE STARVATION RESPONSE 1
6j4r_C1e-1819522008157Protein PHOSPHATE STARVATION RESPONSE 1
6j4r_D1e-1819522008157Protein PHOSPHATE STARVATION RESPONSE 1
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK3625980.0AK362598.1 Hordeum vulgare subsp. vulgare mRNA for predicted protein, complete cds, clone: NIASHv2008K07.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_020161187.10.0nuclear pore complex protein NUP214
TrEMBLN1QX040.0N1QX04_AEGTA; Myb family transcription factor APL
STRINGEMT148940.0(Aegilops tauschii)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G04030.35e-49G2-like family protein