Association measures and prior information in the reconstruction of gene 
networks.

Ghanbari, Mahsa

Local TagsRelease HistoryDetailsSummary

Association measures and prior information in the reconstruction of gene networks.

Ghanbari, M. (2016). Association measures and prior information in the reconstruction of gene networks. PhD Thesis.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-002D-4311-9 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-002D-4312-7

Genre: Thesis

Files

show Files

hide Files

:

MahsaGhanbari.pdf (Any fulltext), 3MB

View Save

File Permalink:
https://hdl.handle.net/11858/00-001M-0000-002D-4313-5

Name:
MahsaGhanbari.pdf

Description:
-

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
-

Locators

show

Creators

show

hide

Creators:
Ghanbari, Mahsa^{1, 2}, Author

Affiliations:
1Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1433547
2Freie Universität Berlin, External Organizations, Berlin, Germany, ou_131875

Content

show

hide

Free keywords: association measure; prior knowledge; gene regulatory network; distance correlation; Bayesian network

Abstract: Reconstructing gene regulatory networks (GRNs) from expression data is a challenging task that has become essential to the understanding of complex regulatory mechanisms in cells. The major issues are the usually very high ratio of the number of genes to the sample size, and the noise in the available data. In this thesis we investigate the effect of the number of samples and noise on the performance of statistical methods. The results indicate that in the case of not having many samples and/or in facing high amount of noise like the case for gene expression data, the performance of all methods decreased significantly compared to the well behaved case (many samples and no noise). Integrating biological prior knowledge to the learning process is a natural and promising way to partially compensate for the lack of reliable expression data and to increase the accuracy of network reconstruction algorithms. In this thesis, we present PriorPC, a new algorithm based on the PC algorithm that uses prior knowledge. Despite being one of the most popular methods for Bayesian network reconstruction, the PC algorithm is known to depend strongly on the order in which nodes are presented, especially for large networks. PriorPC exploits this flaw to include prior knowledge. We show on both synthetic and real data that the structural accuracy of networks obtained with PriorPC is greatly improved compared to the PC algorithm. Furthermore, PriorPC is fast and scales well for large networks which is important for its applicability to experimental data. Another challenge in GRN reconstruction is to detect (direct) nonlinear interactions between genes. A recently proposed association measure named distance correlation is a powerful method to find nonlinear relationships. In this thesis, we propose a novel approach to estimate partial distance correlation, the generalization of distance correlation which accounts for the influence of other variables and therefore it can detect direct nonlinear relationships.

Details

show

hide

Language(s): eng - English

Dates: Accepted: 2016

Publication Status: Accepted / In Press

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: -

Degree: PhD

Event

show

Legal Case

show

Project information

show

Source

show