Analyzing Gene Expression Time-Courses IEEE/ACM Transactions on Computational 
Biology and Bioinformatics

Schliep, Alexander; Costa, Ivan G.; Steinhoff, Christine; Schonhuth, Alexander

doi:ieeecomputersociety.org/10.1109/TCBB.2005.31

Local TagsRelease HistoryDetailsSummary

Analyzing Gene Expression Time-Courses IEEE/ACM Transactions on Computational Biology and Bioinformatics

Schliep, A., Costa, I. G., Steinhoff, C., & Schonhuth, A. (2005). Analyzing Gene Expression Time-Courses IEEE/ACM Transactions on Computational Biology and Bioinformatics. Special Issue on Machine Learning for Bioinformatics, 179-193. doi:ieeecomputersociety.org/10.1109/TCBB.2005.31.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0010-85FD-F Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0010-85FE-D

Genre: Journal Article

Files

show Files

Locators

show

Creators

show

hide

Creators:
Schliep, Alexander¹, Author
Costa, Ivan G.¹, Author
Steinhoff, Christine¹, Author
Schonhuth, Alexander², Author

Affiliations:
1Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1433547
2Max Planck Society, ou_persistent13

Content

show

hide

Free keywords: Index Terms- Index Terms- Mixture modeling hidden Markov models, partially supervised learning, gene expression, time-course analysis

Abstract: Measuring gene expression over time can provide important insights into basic cellular processes. Identifying groups of genes with similar expression time-courses is a crucial first step in the analysis. As biologically relevant groups frequently overlap, due to genes having several distinct roles in those cellular processes, this is a difficult problem for classical clustering methods. We use a mixture model to circumvent this principal problem, with hidden Markov models (HMMs) as effective and flexible components. We show that the ensuing estimation problem can be addressed with additional labeled data—partially supervised learning of mixtures—through a modification of the Expectation-Maximization (EM) algorithm. Good starting points for the mixture estimation are obtained through a modification to Bayesian model merging, which allows us to learn a collection of initial HMMs. We infer groups from mixtures with a simple information-theoretic decoding heuristic, which quantifies the level of ambiguity in group assignment. The effectiveness is shown with high-quality annotation data. As the HMMs we propose capture asynchronous behavior by design, the groups we find are also asynchronous. Synchronous subgroups are obtained from a novel algorithm based on Viterbi paths. We show the suitability of our HMM mixture approach on biological and simulated data and through the favorable comparison with previous approaches. A software implementing the method is freely available under the GPL from http://ghmm.org/gql.

Details

show

hide

Language(s): eng - English

Dates: Date issued: 2005-07-09

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: eDoc: 268496
DOI: ieeecomputersociety.org/10.1109/TCBB.2005.31

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Special Issue on Machine Learning for Bioinformatics

Source Genre: Issue

Creator(s):
Ling, C.X., Editor
Noble, W.S., Editor
Yang, Q., Editor

Affiliations:
-

Publ. Info: -

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 179 - 193 Identifier: -

Source 2

show

hide

Title: IEE/ACM Transactions on Computational Biology and Bioinformatics

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: -

Pages: - Volume / Issue: 2 (3) Sequence Number: - Start / End Page: - Identifier: -