English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Dynamic programming algorithms for two statistical problems in computational biology

Rahmann, S. (2003). Dynamic programming algorithms for two statistical problems in computational biology. Algorithms in Bioinformatics, Proceedings, 2812, 151-164. doi:10.1007/b13243.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Rahmann, Sven1, Author           
Affiliations:
1Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1433547              

Content

show
hide
Free keywords: -
 Abstract: We present dynamic programming algorithms for two exact statistical tests that frequently arise in computational biology. The first test concerns the decision whether an observed sequence stems from a given profile (also known as position specific score matrix or position weight matrix), or from an assumed background distribution. We show that the common assumption that the log-odds score has a Gaussian distribution is false for many short profiles, such as transcription factor binding sites or splice sites. We present an efficient implementation of a non-parametric method (first mentioned by Staden) to compute the exact score distribution. The second test concerns the decision whether observed category counts stem from a specified Multinomial distribution. A branch-and-bound method for computing exact p-values for this test was presented by Bejerano at a recent RECOMB conference. Our contribution is a dynamic programming approach to compute the entire distribution of the test statistic, allowing not only the computation of exact p-values for all values of the test statistic simultaneously, but also of the power function of the test. As one of several applications, we introduce p-value based sequence logos, which provide a more meaningful visual description of probabilistic sequences than conventional sequence logos do.

Details

show
hide
Language(s): eng - English
 Dates: 2003
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: eDoc: 194744
ISI: 000188697400012
DOI: 10.1007/b13243
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Algorithms in Bioinformatics, Proceedings
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 2812 Sequence Number: - Start / End Page: 151 - 164 Identifier: -