English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Needle: a fast and space-efficient prefilter for estimating the quantification of very large collections of expression experiments

Mitra, D., Seiler, E., Mehringer, S., Rahn, R., & Reinert, K. (2022). Needle: a fast and space-efficient prefilter for estimating the quantification of very large collections of expression experiments. Bioinformatics, 28(17), 4100-4108. doi:10.1093/bioinformatics/btac492.

Item is

Files

show Files
hide Files
:
Bioinformatics_Darvish et al_2022.pdf (Publisher version), 644KB
Name:
Bioinformatics_Darvish et al_2022.pdf
Description:
-
OA-Status:
Gold
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
© The Author(s) 2022

Locators

show

Creators

show
hide
 Creators:
Mitra, Darvish1, Author           
Seiler, Enrico1, Author                 
Mehringer, Svenja1, Author                 
Rahn, René1, Author           
Reinert, Knut2, Author                 
Affiliations:
1IMPRS for Biology and Computation (Anne-Dominique Gindrat), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479666              
2Efficient Algorithms for Omics Data (Knut Reinert), Max Planck Fellow Group, Max Planck Institute for Molecular Genetics, Max Planck Society, ou_2385698              

Content

show
hide
Free keywords: -
 Abstract: Motivation: The ever-growing size of sequencing data is a major bottleneck in bioinformatics as the advances of hardware development cannot keep up with the data growth. Therefore, an enormous amount of data is collected but rarely ever reused, because it is nearly impossible to find meaningful experiments in the stream of raw data.

Results: As a solution, we propose Needle, a fast and space-efficient index which can be built for thousands of experiments in <2 h and can estimate the quantification of a transcript in these experiments in seconds, thereby outperforming its competitors. The basic idea of the Needle index is to create multiple interleaved Bloom filters that each store a set of representative k-mers depending on their multiplicity in the raw data. This is then used to quantify the query.

Details

show
hide
Language(s): eng - English
 Dates: 2022-07-072022-07-082022-09
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1093/bioinformatics/btac492
PMID: 35801930
PMC: PMC9438961
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Oxford : Oxford University Press
Pages: - Volume / Issue: 28 (17) Sequence Number: - Start / End Page: 4100 - 4108 Identifier: ISSN: 1367-4803
CoNE: https://pure.mpg.de/cone/journals/resource/954926969991