English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Exploiting physico-chemical properties in string kernels

Toussaint, N., Widmer, C., Kohlbacher, O., & Rätsch, G. (2010). Exploiting physico-chemical properties in string kernels. BMC Bioinformatics, 11(Supplement 8): S7. doi:10.1186/1471-2105-11-S8-S7.

Item is

Basic

show hide
Genre: Conference Paper

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Toussaint, NC, Author
Widmer, C1, Author           
Kohlbacher, O, Author           
Rätsch, G1, Author           
Affiliations:
1Rätsch Group, Friedrich Miescher Laboratory, Max Planck Society, ou_3378052              

Content

show
hide
Free keywords: -
 Abstract: Background: String kernels are commonly used for the classification of biological sequences, nucleotide as well as amino acid sequences. Although string kernels are already very powerful, when it comes to amino acids they have a major short coming. They ignore an important piece of information when comparing amino acids: the physico-chemical properties such as size, hydrophobicity, or charge. This information is very valuable, especially when training data is less abundant. There have been only very few approaches so far that aim at combining these two ideas.
Results: We propose new string kernels that combine the benefits of physico-chemical descriptors for amino acids with the ones of string kernels. The benefits of the proposed kernels are assessed on two problems: MHC-peptide binding classification using position specific kernels and protein classification based on the substring spectrum of the sequences. Our experiments demonstrate that the incorporation of amino acid properties in string kernels yields improved performances compared to standard string kernels and to previously proposed non-substring kernels.
Conclusions: In summary, the proposed modifications, in particular the combination with the RBF substring kernel, consistently yield improvements without affecting the computational complexity. The proposed kernels therefore appear to be the kernels of choice for any protein sequence-based inference.

Details

show
hide
Language(s):
 Dates: 2010-10
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1186/1471-2105-11-S8-S7
PMID: 21034432
 Degree: -

Event

show
hide
Title: NIPS Workshop on Machine Learning in Computational Biology (MLCB 2019)
Place of Event: Whistler, BC, Canada
Start-/End Date: 2009-12-10 - 2009-12-11

Legal Case

show

Project information

show

Source 1

show
hide
Title: BMC Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: BioMed Central
Pages: 9 Volume / Issue: 11 (Supplement 8) Sequence Number: S7 Start / End Page: - Identifier: ISSN: 1471-2105
CoNE: https://pure.mpg.de/cone/journals/resource/111000136905000