PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G166600.1.p
Common NameGLYMA_20G166600, LOC100817001
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family Trihelix
Protein Properties Length: 645aa    MW: 71720.9 Da    PI: 5.338
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G166600.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix971.7e-3065148186
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                          rW++qe+laL+++r++m+ ++r+++ k+plWeevs+km+e g++rs+k+Ckek+en+ k++k++keg++++  ++ +t+++fdql+
  Glyma.20G166600.1.p  65 RWPRQETLALLRIRSDMDVAFRDASVKGPLWEEVSRKMAELGYHRSSKKCKEKFENVYKYHKRTKEGRSGK--QDGKTYRFFDQLQ 148
                          8********************************************************************95..67778******98 PP

2trihelix106.22.3e-33459544187
             trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                          rW+k ev+aLi++r++m+e+++++  k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++k+r +e+s+tcpyf+ql+a
  Glyma.20G166600.1.p 459 RWPKVEVQALIKLRTSMDEKYQENGPKGPLWEEISASMKKLGYNRNAKRCKEKWENINKYFKKVKESNKRR-PEDSKTCPYFHQLDA 544
                          8********************************************************************97.99***********85 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007177.1E-462124IPR001005SANT/Myb domain
PfamPF138375.6E-2064150No hitNo description
CDDcd122034.24E-2464129No hitNo description
PROSITE profilePS500906.88864122IPR017877Myb-like domain
SMARTSM007172.7E-4456518IPR001005SANT/Myb domain
PfamPF138375.0E-22458545No hitNo description
PROSITE profilePS500907.201458516IPR017877Myb-like domain
Gene3DG3DSA:1.10.10.601.7E-4458515IPR009057Homeodomain-like
CDDcd122031.20E-28458523No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 645 aa     Download sequence    Send to blast
MLGDSALLGG GGGEGGASAD VVAATATHDA TTTTTTGGGG GGSNNSGDDE RGRIEEGERS  60
FGGNRWPRQE TLALLRIRSD MDVAFRDASV KGPLWEEVSR KMAELGYHRS SKKCKEKFEN  120
VYKYHKRTKE GRSGKQDGKT YRFFDQLQAL ENHSPTPHSP NPSSKPLQSA PSRVVATTTA  180
SSMSLPIPTP TTTVPMQPIL SNTIPTSSVP NITVPSTTIL PITIPQPILT TPSINLTIPS  240
YPPSNPTNFP PPSNPTPPLS FPTDTFSNST SSSSTSSDET LERRRKRKRK WKDFFERLMK  300
EVIEKQEELQ KKFLEAIEKR EHDRIAREEA WRVQEMQRIN REREILAQER SIAAAKDAAV  360
MSFLQKIAEQ QNLGQALTNI NLVQPQPQLQ PQPPVQQQVT PPNIVPAPMQ QPLPVIVTQP  420
VVLPVVSQVT NMEIMKADNN NNNNNNNNCE NFLPPSSSRW PKVEVQALIK LRTSMDEKYQ  480
ENGPKGPLWE EISASMKKLG YNRNAKRCKE KWENINKYFK KVKESNKRRP EDSKTCPYFH  540
QLDALYRQKH KAEESTAAAK AESAVAPLMV QPEQQWPPQQ QDDRDITMED MENEDDEYEE  600
EREGEEEEEE DEEDEEGGGG NYEIVANKTS GGGGAAASVG ASTE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1283288RRKRKR
2283289RRKRKRK
3284289RKRKRK
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.498880.0cotyledon| flower| leaf| pod
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Cis-element ? help Back to Top
SourceLink
PlantRegMapGlyma.20G166600.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_003556152.30.0trihelix transcription factor GT-2
SwissprotQ391171e-145TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A0R0EMD90.0A0A0R0EMD9_SOYBN; Uncharacterized protein
STRINGGLYMA20G30640.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF37334181
Representative plantOGRP6631573
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76880.12e-52Trihelix family protein