PlantTFDB
PlantRegMap/PlantTFDB v5.0
Plant Transcription Factor Database
Previous version: v3.0 v4.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Cla001807
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Cucurbitales; Cucurbitaceae; Benincaseae; Citrullus
Family B3
Protein Properties Length: 1202aa    MW: 137318 Da    PI: 6.2245
Description B3 family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Cla001807genomeICuGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1B3382.9e-121771799
               E--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
         B3 17 vlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99
               +lpkkf++++g+    s  + l+ ++g +W++    +   ++++l++GW++F +  +Lk g  +vF++dg+s+f     +f++
  Cla001807  1 MLPKKFITDYGKF--LSNSICLKLPDGLEWKLGS--KTANDTVWLQNGWQQFSNHYRLKPGSLLVFRFDGNSTF--QTCIFDQ 77
               69********844..8889***************..999999***************************99999..7777765 PP

2B3581.7e-18211303199
                EEEE-..-HHHHTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
         B3   1 ffkvltpsdvlksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99 
                ffkv +++ + k++ l++p  f+++ +g    + + t++d +g+sW ++l  +k ++  ++++GW+ Fv+ + Lk gDf+vF++dg+ +f   v++f k
  Cla001807 211 FFKVNIHKKSYKNSVLSIPPAFMKHLNGT--FPEKATIQDHTGKSWCITL--EKLDDLLYFKNGWQTFVDYHSLKYGDFLVFQYDGHCTF--DVTIFGK 303
                99*********************666766..67789**************..9**999***************************98888..8888876 PP

3B350.73.2e-1692810091299
                 HTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEEE-SSSEE..EEEEE-S CS
         B3   12 ksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFkldgrsefelvvkvfrk 99  
                  +++ ++p  f++ ++g    + + +++d+s +sW v+l  +  ++ +++++GW+eFv+++ Lk gDf+vF++dg++ f   vk+f+k
  Cla001807  928 DEHNHSIPPAFVKYFNGR--IPSEAVIRDQSRQSWHVTL--EELKNVVFFKDGWQEFVESHLLKLGDFLVFQYDGSHMF--DVKIFSK 1009
                 456689******655655..5668***************..********************************998777..9999987 PP

4B343.17.6e-14110011861298
                 HTT-EE--HHH.HTT---..--SEEEEEETTS-EEEEEE..EEETTEEEE-TTHHHHHHHHT--TT-EEEEE..E-SS.SEE..EEEEE- CS
         B3   12 ksgrlvlpkkfaeehggkkeesktltledesgrsWevkliyrkksgryvltkGWkeFvkangLkegDfvvFk..ldgr.sefelvvkvfr 98  
                 +++ + +pk+ + +h+++   +  l++ +e+grsW v+     ++gr+ lt+GW  F +an L+e D ++F+  ld++  ++el+vk+ r
  Cla001807 1100 SHNAIHIPKTVMVTHNIS--LKPNLVIVNERGRSWLVTA-KPISRGRFALTTGWPAFFRANSLREDDECIFEfvLDSNnLCGELKVKITR 1186
                 567799*********554..3337999************.3556666*************************776777789999999876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
CDDcd100172.95E-11176No hitNo description
PROSITE profilePS5086314.071178IPR003340B3 DNA binding domain
SuperFamilySSF1019362.75E-16177IPR015300DNA-binding pseudobarrel domain
SMARTSM010190.004178IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.101.9E-15279IPR015300DNA-binding pseudobarrel domain
PfamPF023629.1E-9277IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.4E-6119178IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019367.46E-7119177IPR015300DNA-binding pseudobarrel domain
Gene3DG3DSA:2.40.330.101.3E-21206306IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019363.53E-23207306IPR015300DNA-binding pseudobarrel domain
CDDcd100174.50E-16210301No hitNo description
PfamPF023629.8E-16211301IPR003340B3 DNA binding domain
PROSITE profilePS5086314.889211304IPR003340B3 DNA binding domain
SMARTSM010191.8E-15211304IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.102.2E-10433517IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019366.08E-12434517IPR015300DNA-binding pseudobarrel domain
CDDcd100174.77E-7438517No hitNo description
SMARTSM010190.0057439530IPR003340B3 DNA binding domain
PROSITE profilePS508639.46469533IPR003340B3 DNA binding domain
PROSITE profilePS5086314.0579171010IPR003340B3 DNA binding domain
SMARTSM010192.2E-109181010IPR003340B3 DNA binding domain
Gene3DG3DSA:2.40.330.103.7E-199201012IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019361.14E-199281012IPR015300DNA-binding pseudobarrel domain
PfamPF023621.3E-139291009IPR003340B3 DNA binding domain
CDDcd100177.81E-169331008No hitNo description
Gene3DG3DSA:2.40.330.103.1E-1410881186IPR015300DNA-binding pseudobarrel domain
SuperFamilySSF1019367.85E-1410881173IPR015300DNA-binding pseudobarrel domain
PROSITE profilePS5086311.84310891188IPR003340B3 DNA binding domain
SMARTSM010198.8E-610911188IPR003340B3 DNA binding domain
CDDcd100171.30E-1011001186No hitNo description
PfamPF023625.1E-1111001186IPR003340B3 DNA binding domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1202 aa     Download sequence    Send to blast
MLPKKFITDY GKFLSNSICL KLPDGLEWKL GSKTANDTVW LQNGWQQFSN HYRLKPGSLL  60
VFRFDGNSTF QTCIFDQTCL EIQYPSNNIG KTKPDDEEFN GYRYEEVETN NHEINDLKPE  120
KIGFKIVVKK STVEGRYNML IPKHFASKHL KEEFGRIEIE NSDGESWAMS YKWSQSRNVA  180
EYVYISRTTF SPPISAKNTN VKITTPNNNL FFKVNIHKKS YKNSVLSIPP AFMKHLNGTF  240
PEKATIQDHT GKSWCITLEK LDDLLYFKNG WQTFVDYHSL KYGDFLVFQY DGHCTFDVTI  300
FGKNGCKKAV AAKDASSVPI LEAEIAEAGN SVSNLEAIVA DAGNSVSNLE AVAADAGNSV  360
SNLEVVVADA GSSVPILKVK EEPVVEEEDV EPSISHKRKR LQVGSDTVRK SKSIVASNCG  420
RAGNASNSVE QVSPRGLFFE RTMKRWSRQT IYISGRVVRD ENISLKPNIV LRDEEGTLWP  480
ATVSFTSQNR ISVTAGWSKF YTGHKLRIND KCEFEFVLER GNVEEPFHYM PFFGIENFRI  540
EPFEIIPVRI NPEVRKYIEY SQFQEHSREM TCSQFHSSSA NEEPKYIQFE HEGVDSQQHD  600
QYFQDDDLQE DILSEGVDMD ITNELPISQS QEILYLEYQP PQTDKEDNWK SANTDTTELG  660
GNSFDIKENN MDTTTELREN SFGIEENNMD ITEQLGGNLF DIEENNIKQE KQSPVTVKAT  720
RKMKKRKSRE TTSFEVQEQN EETSEIDTDQ DSRRNVVTKQ KKKISEQSKE DDGKRKTRSK  780
RVKKSRISGT PSEHDDEVDV YKFDIQPLIA TKTEMEMPYM IPFGGVKPSK EKGKSPVDQE  840
HNSDARTSYN NDYCNMKGPQ SVSNDGVKNF LFTKIVNIEE ILGSLVHDID NLKNLFSKVC  900
ENVNEAADPE KMREVLFSQK VMERGKRDEH NHSIPPAFVK YFNGRIPSEA VIRDQSRQSW  960
HVTLEELKNV VFFKDGWQEF VESHLLKLGD FLVFQYDGSH MFDVKIFSKN GCKKERVSRT  1020
GCPCAVVKVK DEPQSEHNYS TSLTRCKRSD SEVRSTDSSG TAPKSRRRST SNLEELSPSK  1080
TAEHISMESP TFELMVKRWS HNAIHIPKTV MVTHNISLKP NLVIVNERGR SWLVTAKPIS  1140
RGRFALTTGW PAFFRANSLR EDDECIFEFV LDSNNLCGEL KVKITRSLEI TQEKQATMVP  1200
EG
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankLN6818751e-175LN681875.1 Cucumis melo genomic scaffold, anchoredscaffold00007.
GenBankLN7132621e-175LN713262.1 Cucumis melo genomic chromosome, chr_8.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008440208.10.0PREDICTED: uncharacterized protein LOC103484737 isoform X2
RefseqXP_008440213.10.0PREDICTED: uncharacterized protein LOC103484737 isoform X6
TrEMBLA0A1S3B0K01e-180A0A1S3B0K0_CUCME; uncharacterized protein LOC103484737 isoform X6
TrEMBLA0A1S3B1B61e-180A0A1S3B1B6_CUCME; uncharacterized protein LOC103484737 isoform X2
STRINGXP_008440207.11e-180(Cucumis melo)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF2444533
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G66980.14e-25B3 family protein