English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Philentropy: Information Theory and Distance Quantification with R

Drost, H.-G. (2018). Philentropy: Information Theory and Distance Quantification with R. The Journal of Open Source Software, 3(26): 765. doi:10.21105/joss.00765.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Drost, H-G1, Author                 
Affiliations:
1External Organizations, ou_persistent22              

Content

show
hide
Free keywords: -
 Abstract: Comparison is a fundamental method of scientific research leading to insights about the processes that generate similarity or dissimilarity. In statistical terms comparisons between probability functions are performed to infer connections, correlations, or relationships between objects or samples (Cha 2007). Most quantification methods rely on distance or similarity measures, but the right choice for each individual application is not always clear and sometimes poorly explored. The reason for this is partly that diverse measures are either implemented in different R packages with very different notations or are not implemented at all. Thus, a comprehensive framework implementing the most common similarity and distance measures using a uniform notation is still missing. The R (R Core Team 2018) package Philentropy aims to fill this gap by implementing forty-six fundamental distance and similarity measures (Cha 2007) for comparing probability functions. These comparisons between probability functions have their foundations in a broad range of scientific disciplines from mathematics to ecology. The aim of this package is to provide a comprehensive and computationally optimized base framework for clustering, classification, statistical inference, goodness-of-fit, non-parametric statistics, information theory, and machine learning tasks that are based on comparing univariate or multivariate probability functions. All functions are written in C++ and are integrated into the R package using the Rcpp Application Programming Interface (API) (Eddelbuettel 2013). Together, this framework allows building new similarity or distance based (statistical) models and algorithms in R which are computationally efficient and scalable. The comprehensive availability of diverse metrics and measures furthermore enables a systematic assessment of choosing the most optimal similarity or distance measure for individual applications in diverse scientific disciplines. The following probability distance/similarity and information theory measures are implemented in Philentropy.

Details

show
hide
Language(s):
 Dates: 2018-05
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.21105/joss.00765
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: The Journal of Open Source Software
  Abbreviation : J. Open Source Softw.
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: 4 Volume / Issue: 3 (26) Sequence Number: 765 Start / End Page: - Identifier: ISSN: 2475-9066
CoNE: https://pure.mpg.de/cone/journals/resource/2475-9066