English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  SLALOM, a flexible method for the identification and statistical analysis of overlapping continuous sequence elements in sequence- and time-series data

Prytuliak, R., Pfeiffer, F., & Habermann, B. H. (2018). SLALOM, a flexible method for the identification and statistical analysis of overlapping continuous sequence elements in sequence- and time-series data. BMC Bioinformatics, 19: 24. doi:10.1186/s12859-018-2020-x.

Item is

Files

show Files
hide Files
:
Prytuliak.pdf (Publisher version), 825KB
Name:
Prytuliak.pdf
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
© The Author(s). 2018
:
3988656.zip (Supplementary material), 5MB
Name:
3988656.zip
Description:
-
OA-Status:
Visibility:
Public
MIME-Type / Checksum:
application/zip / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
License:
-

Locators

show

Creators

show
hide
 Creators:
Prytuliak, Roman1, Author           
Pfeiffer, Friedhelm1, Author           
Habermann, Bianca Hermine1, Author           
Affiliations:
1Habermann, Bianca / Computational Biology, Max Planck Institute of Biochemistry, Max Planck Society, ou_1832284              

Content

show
hide
Free keywords: PROTEIN SECONDARY STRUCTURE; PREDICTION; ACCURACY; MOTIFSBiochemistry & Molecular Biology; Biotechnology & Applied Microbiology; Mathematical & Computational Biology;
 Abstract: Background: Protein or nucleic acid sequences contain a multitude of associated annotations representing continuous sequence elements (CSEs). Comparing these CSEs is needed, whenever we want to match identical annotations or integrate distinctive ones. Currently, there is no ready-to-use software available that provides comprehensive statistical readout for comparing two annotations of the same type with each other, which can be adapted to the application logic of the scientific question. Results: We have developed a method, SLALOM (for StatisticaL Analysis of Locus Overlap Method), to perform comparative analysis of sequence annotations in a highly flexible way. SLALOM implements six major operation modes and a number of additional options that can answer a variety of statistical questions about a pair of input annotations of a given sequence collection. We demonstrate the results of SLALOM on three different examples from biology and economics and compare our method to already existing software. We discuss the importance of carefully choosing the application logic to address specific scientific questions. Conclusion: SLALOM is a highly versatile, command-line based method for comparing annotations in a collection of sequences, with a statistical read-out for performance evaluation and benchmarking of predictors and gene annotation pipelines. Abstraction from sequence content even allows SLALOM to compare other kinds of positional data including, for example, data coming from time series.

Details

show
hide
Language(s): eng - English
 Dates: 2018-01-26
 Publication Status: Published online
 Pages: 19
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: ISI: 000423722900001
DOI: 10.1186/s12859-018-2020-x
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: BMC Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: BioMed Central
Pages: - Volume / Issue: 19 Sequence Number: 24 Start / End Page: - Identifier: ISSN: 1471-2105
CoNE: https://pure.mpg.de/cone/journals/resource/111000136905000