PlantTFDB
Plant Transcription Factor Database
v5.0
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G166100.4.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 1145aa    MW: 129481 Da    PI: 9.4774
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G166100.4.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B350.63.6e-1634122599
                           -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                    B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                           l+p  +++++ +v+pk+f++++ gk   s  ++le+++  s++v +    + + +v++ GW +Fv+++++ke+D ++F++++ s+f  +
  Sobic.001G166100.4.p  34 LSPMTASSKHSMVVPKRFLKHFAGK--LSGIIKLESPNRGSYDVGI--IEHCNNVVFRHGWGQFVESHHIKENDYLLFRHVEGSCF--K 116
                           56677888999********888777..6679***************..9*******************************998999..9 PP

                           EEEE-S CS
                    B3  94 vkvfrk 99 
                           v +f++
  Sobic.001G166100.4.p 117 VLIFDS 122
                           999875 PP

2B338.81.6e-122563351696
                           EE--HHH.HTT---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEE..E-SS.SEE..EEEE CS
                    B3  16 lvlpkkfaeehggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFk..ldgr.sefelvvkv 96 
                           l ++k +a +h   +++s+ +tl+   ++++W  k++ ++k+ +++l++ W +Fv++n+++egD+++F   + gr s+f  +v +
  Sobic.001G166100.4.p 256 LAISKGYALAHF--PRKSMNVTLQRpGKSKKWHPKFC-KRKDAQMLLKGQWMDFVRDNHVQEGDICIFLptMAGRrSTF--TVYL 335
                           789999999997..56899******5566*******4.444445899999******************94333443444..6655 PP

3B357.52.5e-18431523597
                           -..-HHHHT.T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE. CS
                    B3   5 ltpsdvlks.grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefe 91 
                           +++s+v+ + + lv+ k++a+eh  +  es+ +tle + gr+W+ +l +r  +++++lt+ W++Fv++n+ +  D+++F+ + + ++f+
  Sobic.001G166100.4.p 431 MKKSNVNHLrSDLVICKDYAAEHFPQ--ESQFITLERPGGRKWRTRLYVRPDGRAFMLTTRWQNFVHDNHFQKDDICLFQPMPNeKGFR 517
                           444555444456***********855..6678***************88999999*************************996669999 PP

                           .EEEEE CS
                    B3  92 lvvkvf 97 
                            +v+++
  Sobic.001G166100.4.p 518 VMVHLL 523
                           999876 PP

4B360.14e-19643734596
                           -..-HHHHT.T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-.SSSEE. CS
                    B3   5 ltpsdvlks.grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkld.grsefe 91 
                           +++s+v+++ ++lv+ k +a++h  +  +s+ +tle ++g++W  +l +r + ++y+lt+ W++Fv++n+L+  D+++F+ + ++++f+
  Sobic.001G166100.4.p 643 MKKSNVNRLsSNLVICKGYAAQHFPQ--KSQFITLERPRGKKWCSRLHVRPHERAYMLTTRWQNFVRDNQLRKDDICLFQPMpSEKGFR 729
                           556666665566***********855..5668***************99*******************************884559998 PP

                           .EEEE CS
                    B3  92 lvvkv 96 
                            +v++
  Sobic.001G166100.4.p 730 VMVHL 734
                           88876 PP

5B353.83.4e-17859946998
                           HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE..EEEE CS
                    B3   9 dvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefelvvkv 96 
                             l+s +lv+ k +a++h  +  es+ +tle + g++W  kl +r  +++y+l++ Wk+Fv++n+L+e D+++F+ + + ++f+ + ++
  Sobic.001G166100.4.p 859 KRLNS-NLVICKGYAAQHFPQ--ESQFITLECPGGKRWHPKLHVRPDGRGYMLSTQWKNFVRDNRLREDDICLFQPMPSeKGFRVMAHL 944
                           34444.5***********855..6678***************99*******************************88444999888877 PP

                           E- CS
                    B3  97 fr 98 
                           +r
  Sobic.001G166100.4.p 945 LR 946
                           76 PP

6B331.14.1e-10106811402199
                            HH.HTT..---..--SEEEEEE.TTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                    B3   21 kfaeeh..ggkkeesktltled.esgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99  
                            + a+e+  +gk  +   ltl+   +gr W+  l   ++   ++ t+ W+eFv++ gL++gD+++F+ ++ ++  ++v+++r+
  Sobic.001G166100.4.p 1068 SDAAEYlpDGK--Q--SLTLRWqGQGRAWRTDL--HNRL--MLATGEWREFVRDSGLEDGDICLFEPMK-ERLAMLVHIIRS 1140
                            55666654333..4..4555554699*******..4443..556677*******************887.788899999885 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF1019366.47E-2228130IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.104.2E-2028125IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.98430123IPR003340B3 DNA binding domain
CDDcd100176.49E-2031121No hitNo description
SMARTSM010194.3E-1533123IPR003340B3 DNA binding domain
PfamPF023623.1E-1434122IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.2E-18232338IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.94E-17233337IPR015300DNA-binding pseudobarrel domain
CDDcd100173.02E-17240336No hitNo description
SMARTSM010192.2E-4240339IPR003340B3 DNA binding domain
PROSITE profilePS5086311.194241339IPR003340B3 DNA binding domain
PfamPF023624.7E-10256336IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.2E-21419521IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019365.49E-21421525IPR015300DNA-binding pseudobarrel domain
CDDcd100177.76E-19425521No hitNo description
SMARTSM010199.3E-7427526IPR003340B3 DNA binding domain
PROSITE profilePS5086311.476428526IPR003340B3 DNA binding domain
PfamPF023627.6E-17430523IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.106.1E-22631733IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.35E-20634734IPR015300DNA-binding pseudobarrel domain
CDDcd100177.70E-18637733No hitNo description
SMARTSM010195.4E-7639739IPR003340B3 DNA binding domain
PROSITE profilePS5086311.293640738IPR003340B3 DNA binding domain
PfamPF023629.0E-18642734IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.7E-20841945IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019364.12E-20843947IPR015300DNA-binding pseudobarrel domain
CDDcd100172.77E-17847940No hitNo description
SMARTSM010190.0043849941IPR003340B3 DNA binding domain
PROSITE profilePS5086311.265850948IPR003340B3 DNA binding domain
PfamPF023626.6E-16861946IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.1E-1310361141IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.92E-1110411141IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.003710471142IPR003340B3 DNA binding domain
PROSITE profilePS5086312.05510481141IPR003340B3 DNA binding domain
CDDcd100173.96E-910701139No hitNo description
PfamPF023624.8E-810771140IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1145 aa     Download sequence    Send to blast
MAGSGASSIQ KPCDACKRYL DHLDGKNQNV RSFLSPMTAS SKHSMVVPKR FLKHFAGKLS  60
GIIKLESPNR GSYDVGIIEH CNNVVFRHGW GQFVESHHIK ENDYLLFRHV EGSCFKVLIF  120
DSDGCEKVFP CAGIRSVEYV DISSSSHHET TESLASERFV RCQKGSSCHR GKTAKMAAAF  180
SSSEESGENI PSKNKSSELD DLQTPLRQHY VLSQRSYLSE AQEERVIALI QEIQPESTAF  240
IAVMCKSHVQ PPCPYLAISK GYALAHFPRK SMNVTLQRPG KSKKWHPKFC KRKDAQMLLK  300
GQWMDFVRDN HVQEGDICIF LPTMAGRRST FTVYLIQATT TCSRGGSGKR GSLSRQKETA  360
KKAATSSLYE DSGGEDSLSG YESIQLDHFK AFSKRNYVLS AWCHLTAEQE EKIVALVKKV  420
QPEIPFLVVQ MKKSNVNHLR SDLVICKDYA AEHFPQESQF ITLERPGGRK WRTRLYVRPD  480
GRAFMLTTRW QNFVHDNHFQ KDDICLFQPM PNEKGFRVMV HLLHEPSTRS SSLCRHVHGL  540
NSHINRGVTP TAHVHEKSGS ERDSLSCQKE TTKKAGTSSL HEESAGEDSL SGHESIQSDH  600
VKAFSERNYV LSARCHLTAE QEEEIITLVK KVQPAIPFLV IQMKKSNVNR LSSNLVICKG  660
YAAQHFPQKS QFITLERPRG KKWCSRLHVR PHERAYMLTT RWQNFVRDNQ LRKDDICLFQ  720
PMPSEKGFRV MVHLLCEPRT RSSSLGGHAH GLNSHIKRVT STAHVHEKSG SERGSLSCQK  780
ETANKARTSS LYEESEEGTL SGYESTQLDH VKAFSERYYV LSARCHLTAE QKEKIVALVK  840
KVQPEIPVLV VKMKKINVKR LNSNLVICKG YAAQHFPQES QFITLECPGG KRWHPKLHVR  900
PDGRGYMLST QWKNFVRDNR LREDDICLFQ PMPSEKGFRV MAHLLRERST RSSSSDGHVH  960
GLHSHIERGL ASTAHVHEKS GSENSGLLDL HKRQPVQQGH QVLNDCGGAS SSKPPLYVVL  1020
GGTCLTPAQD KVVQEKAMAI KAEVSIFVAT MNKKILGYNN EAFILDFSDA AEYLPDGKQS  1080
LTLRWQGQGR AWRTDLHNRL MLATGEWREF VRDSGLEDGD ICLFEPMKER LAMLVHIIRS  1140
KQYS*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G166100.4.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021313439.10.0uncharacterized protein LOC8062507 isoform X1
RefseqXP_021313446.10.0uncharacterized protein LOC8062507 isoform X1
TrEMBLA0A1Z5S6040.0A0A1Z5S604_SORBI; Uncharacterized protein
STRINGSb01g014595.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP58531172
Representative plantOGRP1136337
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.16e-19B3 family protein