Help Privacy Policy Disclaimer
  Advanced SearchBrowse





Context-specific independence mixture models for cluster analysis of biological data


Georgi,  Benjamin
Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

(Any fulltext), 3MB

Supplementary Material (public)
There is no public supplementary material available

Georgi, B. (in preparation). Context-specific independence mixture models for cluster analysis of biological data.

Cite as: https://hdl.handle.net/11858/00-001M-0000-0010-7D6F-0
Clustering is a crucial first step in the exploratory analysis of biological data. This thesis is concerned with cluster analysis of biological data using mixture models. Mixture models is a class of powerful and versatile statistical models. We develop an extension to the conventional mixtures in form of the context-specific independence (CSI) framework. CSI mixtures are particularly suited for the analysis of biological data since they perform robustly in the presence of noise and uninformative features in the data. This is achieved by adapting the model complexity to the degree of variation observed in a given data set. We present a learning algorithm for CSI mixtures in a Bayesian framework. We apply CSI mixture clustering on data sets of transcription factor binding sites, protein sequences and complex disease phenotype data.