Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT
  Struo: a pipeline for building custom databases for common metagenome profilers

de la Cuesta-Zuluaga, J., Ley, R., & Youngblut, N. (2019). Struo: a pipeline for building custom databases for common metagenome profilers. Poster presented at German Conference on Bioinformatics (GCB 2019), Heidelberg, Germany.

Item is

Externe Referenzen

einblenden:
ausblenden:
Beschreibung:
-
OA-Status:
Keine Angabe

Urheber

einblenden:
ausblenden:
 Urheber:
de la Cuesta-Zuluaga, J1, Autor           
Ley, RE1, Autor           
Youngblut, ND1, Autor           
Affiliations:
1Department Microbiome Science, Max Planck Institute for Developmental Biology, Max Planck Society, ou_3375789              

Inhalt

einblenden:
ausblenden:
Schlagwörter: -
 Zusammenfassung: Background: Metagenome profiling is the most efficient method of obtaining comprehensive taxonomic and functional data from metagenomes, yet default databases accompanying metagenome profilers are not updated at a pace that reflects the rapid increase in microbial genomics data. The creation of updated comprehensive, custom databases is cumbersome due to the complexity and high computational requirements of retrieving the genomes, and configuring and executing the software. As a result, many metagenomic analyses fail to include the most up to date microbial data, missing critical insights. We address this with the development of Struo, an automatized and modular pipeline that assists in the retrieval of genomes and construction of databases for metagenome profilers.
Methods and results: Struo uses Snakemake and Conda to unify the workflow and build databases in a straight-forward, reproducible manner on Unix-based high-performance compute clusters. Currently, Struo supports Kraken2, Bracken2 and HUMANn2, and can be extended to include other tools. Publicly available or novel genomes can be used; here, we used Struo with 21,276 representative genomes of the GTDB to generate databases that broadly encompass known microbial diversity. This resulted in an increase of 25% more reads mapped from simulated and real metagenomes compared to default profiler databases.
Discussion: A carefully curated and tailored selection of genomes to be included in reference databases for metagenome profiling facilitates the exploration of microbiomes by increasing the fraction of reads mapped to a known reference. Struo empowers researchers to incorporate previously unexplored taxa in the study of hidden microbial diversity. Struo and the custom databases will be made public as open source resources.

Details

einblenden:
ausblenden:
Sprache(n):
 Datum: 2019-09
 Publikationsstatus: Online veröffentlicht
 Seiten: -
 Ort, Verlag, Ausgabe: -
 Inhaltsverzeichnis: -
 Art der Begutachtung: -
 Identifikatoren: -
 Art des Abschluß: -

Veranstaltung

einblenden:
ausblenden:
Titel: German Conference on Bioinformatics (GCB 2019)
Veranstaltungsort: Heidelberg, Germany
Start-/Enddatum: 2019-09-16 - 2019-09-19

Entscheidung

einblenden:

Projektinformation

einblenden:

Quelle 1

einblenden:
ausblenden:
Titel: German Conference on Bioinformatics (GCB 2019)
Genre der Quelle: Konferenzband
 Urheber:
Affiliations:
Ort, Verlag, Ausgabe: -
Seiten: - Band / Heft: - Artikelnummer: P 7.24 Start- / Endseite: 158 Identifikator: -