PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID 350906
Common NameARALYDRAFT_350906
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis
Family B3
Protein Properties Length: 309aa    MW: 35654 Da    PI: 10.1049
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
350906genomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3634.9e-2019112198
             EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTE.EEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE- CS
      B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgr.yvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfr 98 
             ffk+l ++d +++ + ++p + +++++  k +s++++l+ ++g sW+vk+  +k++   y+  +GW++Fv++ngL  ++f++F+++ +++f  +v++f+
  350906  19 FFKILRREDFSSEMMRMIPHHLIRSISD-KSSSFKMVLRVPWGSSWQVKI--SKNPIFhYMEDRGWNQFVNDNGLGLNEFLTFTHEANMCF--NVTIFE 112
             99************************64.5799*****************..9***9966777**********************999***..999987 PP

2B346.47e-152293061597
             -EE--HHH.HTT---..--SEEEEEETTS.-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE CS
      B3  15 rlvlpkkfaeehggkkeesktltledesg.rsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvf 97 
              l +pkkf++ h+ +  e+k+ ++ +++g +sWev +    ++ +  +++GW++ +k+ gL +gD+++F+l++   +e++vkv 
  350906 229 FLGIPKKFVDMHMPS--ETKMFKIHHPRGkKSWEVWY--VVNDVQSRFSGGWSRLAKELGLVVGDVCTFELIK--PTEMCVKVS 306
             368*******99855..66799999*99989******..556666669**********************987..666788876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:2.40.330.104.7E-2113115IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.67E-2014118IPR015300DNA-binding pseudobarrel domain
CDDcd100171.62E-1817111No hitNo description
SMARTSM010191.1E-2419114IPR003340B3 DNA binding domain
PfamPF023623.8E-1819113IPR003340B3 DNA binding domain
PROSITE profilePS5086312.98519114IPR003340B3 DNA binding domain
SuperFamilySSF1019366.28E-20215305IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.758215308IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.0E-18215306IPR015300DNA-binding pseudobarrel domain
CDDcd100172.37E-16216307No hitNo description
SMARTSM010192.9E-18218308IPR003340B3 DNA binding domain
PfamPF023621.5E-11229305IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009553Biological Processembryo sac development
GO:0009567Biological Processdouble fertilization forming a zygote and endosperm
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 309 aa     Download sequence    Send to blast
MVRNGVFGQV MEERDNPAFF KILRREDFSS EMMRMIPHHL IRSISDKSSS FKMVLRVPWG  60
SSWQVKISKN PIFHYMEDRG WNQFVNDNGL GLNEFLTFTH EANMCFNVTI FEADGTEMLR  120
PRQPSTIASS SGRNKREEKK SIYIDVKKEE EIESWSESSY AGHKTAESTS GRLKQKQELN  180
LRKKEADKTE KSKKRKKKKV DTVCNDSEAG TSSLVPEFKL TIKKSHLLFL GIPKKFVDMH  240
MPSETKMFKI HHPRGKKSWE VWYVVNDVQS RFSGGWSRLA KELGLVVGDV CTFELIKPTE  300
MCVKVSKE*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
1171183RLKQKQELNLRKK
2181198RKKEADKTEKSKKRKKKK
3192196KKRKK
4192198KKRKKKK
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAK2270350.0AK227035.1 Arabidopsis thaliana mRNA for hypothetical protein, complete cds, clone: RAFL09-55-P16.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002873865.10.0hypothetical protein ARALYDRAFT_350906
SwissprotQ9FJG20.0Y5800_ARATH; B3 domain-containing protein At5g18000
TrEMBLD7LXQ00.0D7LXQ0_ARALL; Putative uncharacterized protein
STRINGfgenesh1_pg.C_scaffold_60014990.0(Arabidopsis lyrata)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM56521135
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G18000.11e-177VERDANDI