PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Gorai.011G216400.1
Common NameB456_011G216400, LOC105777244
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium
Family Trihelix
Protein Properties Length: 373aa    MW: 40798.3 Da    PI: 10.0345
Description Trihelix family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Gorai.011G216400.1genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1trihelix51.23.3e-1650133186
            trihelix   1 rWtkqevlaLiearremeerlrrgklkkplWeevskkm....rergferspkqCkekwenlnkrykkikegekkrtsessstcpyfdqle 86 
                         +W++  v  L+ea+++++   +r+klk+++We+v++++      ++  ++++qCk+k+e+++kry+ + +++++      s++p++ +l+
  Gorai.011G216400.1  50 EWSEGAVSSLLEAYENKWVLRNRAKLKGHDWEDVARYVsaraNCTKSPKTQTQCKNKIESMKKRYRSESATAEG------SSWPLYPRLD 133
                         5*************************************844444455556679****************99997......4699999986 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PfamPF138371.3E-1948133No hitNo description
Sequence ? help Back to Top
Protein Sequence    Length: 373 aa     Download sequence    Send to blast
MDKETNNQEN PSLLSNNNTN KEDCSPKKHP GSSTVTGGGG GSNDRLKRDE WSEGAVSSLL  60
EAYENKWVLR NRAKLKGHDW EDVARYVSAR ANCTKSPKTQ TQCKNKIESM KKRYRSESAT  120
AEGSSWPLYP RLDLLLRGNA AAAAAAAAPS PPPSLPLPPP PQQLHLSAVV QPQPPGPLFT  180
NLPLTLPEAS TLVVLQQQQQ PPPPPLPPAA PPPALAPQGL GTAQNSHGSN GFEKIPKDDG  240
AGTKVSDHLS DKVAIETDSS TPGLYSDKEK LRSKKLKMKT MENKKKKRRK KEEYREIGES  300
IRILAEVVLK SEESRMETLR EIEKMRIEAE TKRGEMELKR TEIIANTQLE IAKLFAGSSN  360
KGIDPSLRIG RS*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1283288KKKKRR
2283290KKKKRRKK
3284288KKKRR
4284289KKKRRK
5284290KKKRRKK
6285289KKRRK
7285290KKRRKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX9642361e-114JX964236.1 Gossypium hirsutum clone NBRI_TRANS-505 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_012455847.10.0PREDICTED: trihelix transcription factor ASIL2-like
TrEMBLA0A0D2UWK60.0A0A0D2UWK6_GOSRA; Uncharacterized protein
STRINGGorai.011G216400.10.0(Gossypium raimondii)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM98592533
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G54390.11e-92sequence-specific DNA binding transcription factors
Publications ? help Back to Top
  1. Paterson AH, et al.
    Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres.
    Nature, 2012. 492(7429): p. 423-7
    [PMID:23257886]