PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v1.0, v2.0, v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID WALNUT_00010949-RA
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fagales; Juglandaceae; Juglans
Family C3H
Protein Properties Length: 710aa    MW: 84008.5 Da    PI: 6.6269
Description C3H family protein
Gene Model
Gene Model ID Type Source Coding Sequence
WALNUT_00010949-RAgenomeJHUView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1zf-CCCH22.52.1e-07224243625
                         -SGGGGTS--TTTTT-SS-S CS
             zf-CCCH   6 CrffartGtCkyGdrCkFaH 25 
                         C+f+++tG C++G rC++ H
  WALNUT_00010949-RA 224 CPFHLKTGACRFGLRCSRVH 243
                         ******************99 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5010311.779218246IPR000571Zinc finger, CCCH-type
PfamPF006422.6E-5223243IPR000571Zinc finger, CCCH-type
PRINTSPR018481.6E-23224243IPR009145U2 auxiliary factor small subunit
PRINTSPR018481.6E-23243263IPR009145U2 auxiliary factor small subunit
PROSITE profilePS501029.098250342IPR000504RNA recognition motif domain
Gene3DG3DSA:3.30.70.3301.9E-23251342IPR012677Nucleotide-binding alpha-beta plait domain
CDDcd125408.30E-46251342No hitNo description
SMARTSM003610.0011274346IPR003954RNA recognition motif domain, eukaryote
PRINTSPR018481.6E-23279294IPR009145U2 auxiliary factor small subunit
SuperFamilySSF549283.39E-13282342IPR012677Nucleotide-binding alpha-beta plait domain
PRINTSPR018481.6E-23307329IPR009145U2 auxiliary factor small subunit
PROSITE profilePS501037.761329357IPR000571Zinc finger, CCCH-type
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0000398Biological ProcessmRNA splicing, via spliceosome
GO:0089701Cellular ComponentU2AF
GO:0000166Molecular Functionnucleotide binding
GO:0003723Molecular FunctionRNA binding
GO:0046872Molecular Functionmetal ion binding
Sequence ? help Back to Top
Protein Sequence    Length: 710 aa     Download sequence    Send to blast
MAEPVPNKAE EEDDDEAECT VKMEGLLQIM SRKEKRKAKK KMKRKQIRKE MAEKEREEEE  60
ARLNDPEEQR REGVMEEEEK ERKERERKQF EDRERAWIEA MEKKNKEEEE RRMALEEEKS  120
KRLQGENENE FNEDDDWEYV EEGPAEIIWQ GNEIIVKKKR VRVPKKDANQ QSRQENADRP  180
TSNPLPPQSE AFSSYKNASS AEHILKNVSQ QVPNFGTEQD KAHCPFHLKT GACRFGLRCS  240
RVHFYPDKSF TLLIRNMYNG PGLAWEQDEG LEYTDEEVER CYEEFYEDVH TEFLKFGEIV  300
NFKVCRNGSF HLRGNVYIQY KSLDSAALAY HSSNGRYFAG KQTCSHGTAC NFIHCFRNPG  360
GDYEWADCDK PPPKYWVKKM TALFGYYDEA GHEKQVEQEN LGHRRHSSKM MTDVDRSCSR  420
RSKSREMDFL NSSSCSGRNG SGNDVPKSTR WRQYASDDRK QMKVLGEDMY EENTNLKETN  480
HMKSRTHDTD SDGEWLDKDG DKERYRDRYH GSASKSSRSQ NGDKKHRSCG DESKGDWSDS  540
EDRETHNRHA RKRSRHSREA RLLDDCEDQE NRRREWSKRD GDRETYHGQM TKSARHHRKV  600
KYPDDHWDSK NRTHDTGVDW SDGERDRDRD RDRDRHHRKR RKSSRHQRKV ERRDDHGDSK  660
NGNHEGEWLD RDSNSDREQS RRQKGSEQDS ISDDHGDSTS RADDTGFMND
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4yh8_A6e-252153549167Splicing factor U2AF 23 kDa subunit
Search in ModeBase
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
13543RKAKKKMKR
23845KKKMKRKQ
3624634RDRDRDRDRDR
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008218649.10.0PREDICTED: zinc finger CCCH domain-containing protein 5
SwissprotQ9SY741e-179C3H5_ARATH; Zinc finger CCCH domain-containing protein 5
TrEMBLM5XNZ20.0M5XNZ2_PRUPE; Uncharacterized protein
STRINGVIT_12s0035g01790.t010.0(Vitis vinifera)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF47132433
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G10320.11e-125C3H family protein