PlantTFDB
Plant Transcription Factor Database
PlantRegMap/PlantTFDB v5.0
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sobic.001G166900.1.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; PACMAD clade; Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum
Family B3
Protein Properties Length: 722aa    MW: 81627.8 Da    PI: 8.8485
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sobic.001G166900.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3573.6e-1834122599
                           -..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..E CS
                    B3   5 ltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelv 93 
                           l+p  +++++ +v+pk+f++++ gk   s t++l +++  s++v++  + + +++v++ GW++Fv+++++ke+D ++F++++ s+f   
  Sobic.001G166900.1.p  34 LSPMTASSKHSMVVPKRFLKHFAGK--LSGTIKLDSPNRGSYDVEV--TEHCNKVVFRHGWRQFVESHDIKENDYLLFRHVEGSCF--E 116
                           56677888999********888877..6779***************..********************************998999..8 PP

                           EEEE-S CS
                    B3  94 vkvfrk 99 
                           v +f++
  Sobic.001G166900.1.p 117 VLIFDT 122
                           888875 PP

2B343.84.5e-142543351596
                           -EE--HHH.HTT---..--SEEEEEE.TTS-EEEEEE..EEETTE.EEE-TTHHHHHHHHT--TT-EEEEE..E-SS.SEE..EEEE CS
                    B3  15 rlvlpkkfaeehggkkeesktltled.esgrsWevkliyrkksgr.yvltkGWkeFvkangLkegDfvvFk..ldgr.sefelvvkv 96 
                            lv++k +a +h   +++s+ +tl++  ++++W  k+ y++k+g+ y+l++ W +Fv++n+++egD+++F   + gr s+f  +v +
  Sobic.001G166900.1.p 254 YLVISKGYALAHF--PHKSTNVTLQTpGKSKKWHPKF-YKRKDGQlYMLKEQWMDFVRDNHVQEGDICLFLptMAGRrSTF--TVYL 335
                           589******9997..457789*****5566*******.65555556************************94333443444..6655 PP

3B368.41e-21431522697
                           ..-HHHHT.T-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SS.SEE.. CS
                    B3   6 tpsdvlks.grlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgr.sefel 92 
                           ++s+v+ + + lv+ k++a++h   +ees+ +tle + g++W+ kl +r  +++y+lt+GW++Fv++n+L+e D+++F+ + + ++f+ 
  Sobic.001G166900.1.p 431 KKSNVNHLrSDLVICKRYAAQHF--PEESQSITLERQGGKKWRSKLHVRPDGSGYLLTTGWQNFVRDNHLQEDDICLFQPMPSkKGFRV 517
                           44444443455***********8..456778***************99***999*************************8855599999 PP

                           EEEEE CS
                    B3  93 vvkvf 97 
                           +v+++
  Sobic.001G166900.1.p 518 MVHLL 522
                           88875 PP

4B332.41.7e-106577173499
                           SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
                    B3  34 ktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                           + ltl+ ++g+ W  +l   ++   ++ t+ W +Fv++ngL+++D+++F+ +++++  ++v+++r+
  Sobic.001G166900.1.p 657 QSLTLR-QQGKAWHTNL--HNRL--MLATGEWHQFVRDNGLEDRDICLFEPMKNERLAMLVHIIRS 717
                           346655.59********..4443..556667*********************99*********996 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.103.2E-2128125IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.35E-2229129IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086312.8330123IPR003340B3 DNA binding domain
CDDcd100177.55E-2131121No hitNo description
SMARTSM010191.3E-1333123IPR003340B3 DNA binding domain
PfamPF023625.3E-1634121IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.105.4E-20231338IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019367.46E-19234331IPR015300DNA-binding pseudobarrel domain
SMARTSM010191.5E-6239339IPR003340B3 DNA binding domain
CDDcd100179.01E-20239336No hitNo description
PROSITE profilePS5086310.447240339IPR003340B3 DNA binding domain
PfamPF023621.0E-11254336IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.104.5E-24419523IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019362.75E-23420523IPR015300DNA-binding pseudobarrel domain
CDDcd100176.47E-21424520No hitNo description
SMARTSM010192.3E-7426525IPR003340B3 DNA binding domain
PROSITE profilePS5086313.197427525IPR003340B3 DNA binding domain
PfamPF023627.0E-21429521IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.107.2E-14617717IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.75E-11621719IPR015300DNA-binding pseudobarrel domain
CDDcd100172.48E-8624716No hitNo description
SMARTSM010190.12625718IPR003340B3 DNA binding domain
PfamPF023628.5E-9655717IPR003340B3 DNA binding domain
PROSITE profilePS5086310.87659718IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 722 aa     Download sequence    Send to blast
MARRGASRIQ TPCDACKRYL DHLDEKNKNV RSFLSPMTAS SKHSMVVPKR FLKHFAGKLS  60
GTIKLDSPNR GSYDVEVTEH CNKVVFRHGW RQFVESHDIK ENDYLLFRHV EGSCFEVLIF  120
DTDGCEKMFS CAGIRSVDYV NISSDCHHET TESSASERFV RCQKGSSCHH GKIAKKVAAF  180
SSSESGEDIP SENKSSESDD LQTPLRQHYV LSRRNYLSKA QEERVIALIQ EIQPESTAFV  240
AVMRKSHVQP PCPYLVISKG YALAHFPHKS TNVTLQTPGK SKKWHPKFYK RKDGQLYMLK  300
EQWMDFVRDN HVQEGDICLF LPTMAGRRST FTVYLIQATS TCSRGGSGKR GSLSHQKEIA  360
KKAAVSSLYE ESGEDSLSGY ESIQSDHVKA FSECNYVLSA RCHLTVEQEK KIVGLFKKVQ  420
PEIPFLVIQM KKSNVNHLRS DLVICKRYAA QHFPEESQSI TLERQGGKKW RSKLHVRPDG  480
SGYLLTTGWQ NFVRDNHLQE DDICLFQPMP SKKGFRVMVH LLHDPSTRSS SSGGHAHGLN  540
SHVKRRVTST AHVHEKSGSE NSGSLDLHKC RSVQQVHQVF SDCEVVPSSS MPPLYVVLGG  600
TCLTPAQDKV VQEKAMAIKA EVSIFVATMN DKIVGYHYEP VILDLSDAAQ YLPDGKQSLT  660
LRQQGKAWHT NLHNRLMLAT GEWHQFVRDN GLEDRDICLF EPMKNERLAM LVHIIRSQQY  720
S*
Cis-element ? help Back to Top
SourceLink
PlantRegMapSobic.001G166900.1.p
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_021318281.10.0B3 domain-containing protein Os03g0619600 isoform X2
TrEMBLA0A1B6QJC20.0A0A1B6QJC2_SORBI; Uncharacterized protein
STRINGSb01g014595.10.0(Sorghum bicolor)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP58531172
Representative plantOGRP1136337
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G33280.12e-20B3 family protein