PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.002G231500.1
Common NameB456_002G231500, LOC105786253
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 476aa    MW: 53739.3 Da    PI: 9.1907
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.002G231500.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix91.11.2e-2859143187
            trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqlea 87 
                         rW++qe+laL+++r++m+ ++r+++ k+plWeevs+k++e g++rs+k+Ckek+en+ k++k++k+g++++   + +t+++ dqlea
  Gorai.002G231500.1  59 RWPRQETLALLKLRSDMDVTFREASVKGPLWEEVSRKLAELGYHRSAKKCKEKFENVYKYHKRTKDGRSGK--ADGKTYRFCDQLEA 143
                         8********************************************************************96..56668******985 PP

2trihelix102.62.9e-32316400186
            trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkmrergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                         rW+k e+ aLi++r+++++++++++ k+plWee+s+ m++ g++r++k+Ckekwen+nk++kk+ke++k+r + +s+tcpyf+ql+
  Gorai.002G231500.1 316 RWPKVEIEALIKIRTSLDSKYQDNSPKGPLWEEISNGMKKLGYNRNAKRCKEKWENINKYFKKVKESNKQR-PVDSKTCPYFHQLD 400
                         8*********************************************************************8.9999********97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM007177.6E-556118IPR001005SANT/Myb domain
PROSITE profilePS500907.03958116IPR017877Myb-like domain
CDDcd122032.07E-2358123No hitNo description
PfamPF138374.1E-1858143No hitNo description
SMARTSM007170.0034313375IPR001005SANT/Myb domain
CDDcd122033.31E-27315380No hitNo description
PROSITE profilePS500906.98315373IPR017877Myb-like domain
PfamPF138373.8E-22315401No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 476 aa     Download sequence    Send to blast
MLGGGGTTAS VSSGGGCNGS NEAAAPVAVF DTNDGNSNNS GKDDRSKVDE GDRSFGGNRW  60
PRQETLALLK LRSDMDVTFR EASVKGPLWE EVSRKLAELG YHRSAKKCKE KFENVYKYHK  120
RTKDGRSGKA DGKTYRFCDQ LEAFQNQPSI QWPPPPPVAA AATINQSISA VQMSNSTSSS  180
TSSDLELQGR KKRKRKWKDF FERLMKEVIQ KQQVMQKTFL EAIEKHERER IVRDEAWKVQ  240
EMSRLNRERE ILAQERSIAA AKDAAIMAFL QKLSEKQNLG QSQNSPLPPP AVVPAAVAPP  300
PDNGNQIQTH TPSSSRWPKV EIEALIKIRT SLDSKYQDNS PKGPLWEEIS NGMKKLGYNR  360
NAKRCKEKWE NINKYFKKVK ESNKQRPVDS KTCPYFHQLD VLYREKNKHD CSSKSNPLMV  420
RPEKQWPPPL EPHQQHHDTI MEDMMESDQN DDEEEDEGGS YELVASKPVS MGTAE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1189194RKKRKR
2189195RKKRKRK
3190195KKRKRK
Functional Description ? help Back to Top
Source Description
UniProtProbable transcription factor that binds specific DNA sequence. {ECO:0000250}.
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJQ0130920.0JQ013092.1 Gossypium hirsutum trihelix transcription factor (GT7) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012468066.10.0PREDICTED: trihelix transcription factor GT-2-like
SwissprotQ391173e-83TGT2_ARATH; Trihelix transcription factor GT-2
TrEMBLA0A0D2MEV10.0A0A0D2MEV1_GOSRA; Uncharacterized protein
STRINGGorai.002G231500.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM48492553
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G76890.27e-73Trihelix family protein
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]