PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G166800.1.p
Common NameGLYMA_20G166800, LOC100799041
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family Trihelix
Protein Properties Length: 631aa    MW: 71231.9 Da    PI: 5.7514
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G166800.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix93.52.1e-2968151186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          rW++qe+laL+++r++m+  +r+++lk+plWeev++k++e g++rs+k+Ckek+en+ k++k++ke++++++  + +t+++fdql+
  Glyma.20G166800.1.p  68 RWPRQETLALLKIRSDMDAVFRDSSLKGPLWEEVARKLSELGYHRSAKKCKEKFENVYKYHKRTKESRSGKH--EGKTYKFFDQLQ 151
                          8********************************************************************964..4557******98 PP

2trihelix100.41.5e-31444529187
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                          rW+k ev+aLi++r+++e +++++  k+p We++s+ m + g++rs+k+Ckekwen+nk++kk+ke++k+r  e+s+tcpyf++lea
  Glyma.20G166800.1.p 444 RWPKTEVHALIRLRTSLEAKYQENGPKAPFWEDISAGMLRLGYNRSAKRCKEKWENINKYFKKVKESNKQR-REDSKTCPYFHELEA 529
                          8*********************************************************************8.78999********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007170.00965127IPR001005SANT/Myb domain
CDDcd122031.77E-2367132No hitNo description
PROSITE profilePS500907.0567125IPR017877Myb-like domain
PfamPF138374.0E-2067153No hitNo description
PROSITE profilePS500907.224437501IPR017877Myb-like domain
SMARTSM007170.14441503IPR001005SANT/Myb domain
PfamPF138371.8E-21443530No hitNo description
CDDcd122031.00E-25444508No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 631 aa     Download sequence    Send to blast
MELIGDTTTV METSSGEAVA AHDGGEVIMM DANSGEEENN NKGEEGEEEE EGDNKINSNN  60
NSLCGGNRWP RQETLALLKI RSDMDAVFRD SSLKGPLWEE VARKLSELGY HRSAKKCKEK  120
FENVYKYHKR TKESRSGKHE GKTYKFFDQL QALENQFTVS YSPKPQPTLA TTTNIITLPP  180
PTRPSDTTAI SYVTTTVPST NPTIISPSPQ PPTHATTTTT ITSPTVATNP KNPPQSNNNS  240
NIPNYSLLNM NNLFSTTSTS SSTASDEDLE EKYRKKRKWK DYFRRLTRQV LAKQEEMQKR  300
FLEAIDNRER EQVAQQEAWR IQEMARINRE HELLVQERST AAAKNAAVIA FLQQLSGQHQ  360
NSTTTKAGAN FLQQPLPQQV QPPPQQAPQP LMMSNNNNIE IQKMNNGHSV VAAATPTTVV  420
AATAIATTAV TTTPSSLSSL SSSRWPKTEV HALIRLRTSL EAKYQENGPK APFWEDISAG  480
MLRLGYNRSA KRCKEKWENI NKYFKKVKES NKQRREDSKT CPYFHELEAL YKEKSKTTQN  540
PFGASFHNMK PHEMMEPLMV QPEQQWRPPT QYEQGAAKEN NNSERKEREE EEEEEDDDEN  600
EEGDLESVED EGGNRYEIAT NKLSSVDTVE *
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.302760.0cotyledon| leaf
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.20G166800.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006606164.10.0trihelix transcription factor GT-2
SwissprotQ391171e-143TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLI1NH220.0I1NH22_SOYBN; Uncharacterized protein
STRINGGLYMA20G30650.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF37334181
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.16e-50Trihelix family protein