Statistical Learning with Similarity and Dissimilarity Functions

von Luxburg, U

Local TagsRelease HistoryDetailsSummary

Statistical Learning with Similarity and Dissimilarity Functions

von Luxburg, U. (2004). Statistical Learning with Similarity and Dissimilarity Functions. Berlin, Germany: Logos Verlag.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-DA7B-F Version Permalink: https://hdl.handle.net/21.11116/0000-0003-D07E-5

Genre: Book

Files

show Files

Locators

show

hide

Locator:
https://www.logos-verlag.de/cgi-bin/engbuchmid?isbn=0767&lng=deu&id= (Abstract) Open Access status unknown

Description:
-

OA-Status:

Creators

show

hide

Creators:
von Luxburg, U^{1, 2, 3}, Author

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795
2Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_1497794
3Department Human Perception, Cognition and Action, Max Planck Institute for Biological Cybernetics, Max Planck Society, Spemannstrasse 38, 72076 Tübingen, DE, ou_1497797

Content

show

hide

Free keywords: -

Abstract: This work explores statistical properties of machine learning algorithms from different perspectives. Questions arising both in the fields of supervised and unsupervised learning, dealing with diverse issues such as the convergence of algorithms, the speed of convergence, generalization bounds, and how statistical properties can be used in practical machine learning applications are investigated. All topics covered have the common feature that the properties of the similarity or dissimilarity function on the data play an important role.

Learning is the process of inferring general rules from given examples. The examples are instances of some input space (pattern space), and the rules can consist of some general observation about the structure of the input space, or have the form of a functional dependancy between the input and some output space. Two types of learning problems are considered: classification and clustering. In both problems, the goal is to divide the input space into several regions such that objects within the same region "belong together" and "are different" from the objects in the other regions. The difference between the two problems is that classification is a supervised learning technique while clustering is unsupervised.

Machine learning algorithms are usually designed to deal with either similarities or dissimilarities. In general it is recommended to close an algorithm which can deal with the type of data given, but sometimes it may become necessary to convert similarities into dissimilarities or vice versa. In some situations this can be done without loosing information, especially if the similarities and distances are defined by a scalar product in an Euclidean space. If this is not the case, several heuristics can be invoked. The general idea is to transform a similarity into a dissimilarity function or vice versa by applying a monotonically decreasing function. This is according to the general intuition that a distance is small if the similarity is large, and vice versa. The connection between information theory and learning can be exploited in every-day machine learning applications.

Details

show

hide

Language(s):

Dates: Date issued: 2004

Publication Status: Issued

Pages: 166

Publishing info: Berlin, Germany : Logos Verlag

Table of Contents: -

Rev. Type: -

Identifiers: BibTex Citekey: 2836
ISBN: 978-3-8325-0767-1

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: MPI Series in Biological Cybernetics

Source Genre: Series

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 10 Sequence Number: - Start / End Page: - Identifier: -