English
 
User Manual Privacy Policy Disclaimer Contact us
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Efficient Sparse-Group Bayesian Feature Selection for Gene Network Reconstruction

Steiger, E. (2018). Efficient Sparse-Group Bayesian Feature Selection for Gene Network Reconstruction. PhD Thesis, Freie Universität, Berlin.

Item is

Basic

show hide
Item Permalink: http://hdl.handle.net/21.11116/0000-0003-7469-5 Version Permalink: http://hdl.handle.net/21.11116/0000-0003-746A-4
Genre: Thesis

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Steiger, Edgar1, Author              
Affiliations:
1Gene regulation (Martin Vingron), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479639              

Content

show
hide
Free keywords: feature selection spike-and-slab networks gene network reconstruction expectation propagation Variablenselektion statistics Statistik Netzwerkrekonstruktion Bayes
 DDC: Probabilities & applied mathematics - 519
 Abstract: All the genes of an organism's genome build up an intricate network of connections between them. Many of these connections are unknown, but knowing about the structure of the network is important for e.g. medical applications. This leads to the problem of reverse engineering the (large-scale) gene regulatory network from gene expression data. Gene network reconstruction can be formulated as a problem of feature selection in a linear regression framework, and we include additional information (like co-binding of transcription factors) about the network with a grouping of features. Available methods for feature selection in the presence of grouping information have different short-comings: Lasso methods underestimate the regression coefficients and do not make good use of the grouping information, and Bayesian approaches often rely on the stochastic and slow Gibbs sampling procedure to recover the parameters, which makes them infeasible for gene network reconstruction. Here we present a Bayesian method for feature selection with grouping information (with sparsity on the between- and within group level), where the parameters are recovered by a deterministic algorithm (expectation propagation). This sparse-group framework is applied to (large-scale) gene network reconstruction from gene expression data and extended to the vector autoregressive model for time series data. We prove (on simulated and experimental data) that the Bayesian approach is the best choice for network reconstruction for three reasons: Highest number of correctly selected features, best prediction on new data and reasonable computing time. We show that a Bayesian approach to feature selection is superior to lasso methods on time series data. Results on experimental temporal data are inconclusive for the linear model. Finally we note that the presented method is very fundamental and not restricted to the reconstruction of gene regulatory networks, but can be applied to any feature selection problem with grouped features.

Details

show
hide
Language(s): eng - English
 Dates: 2018-07-20
 Publication Status: Accepted / In Press
 Pages: 115
 Publishing info: Berlin : Freie Universität
 Table of Contents: -
 Rev. Method: -
 Identifiers: -
 Degree: PhD

Event

show

Legal Case

show

Project information

show

Source

show