English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Digital Microbe: a genome-informed data integration framework for team science on emerging model organisms

MPS-Authors
/persons/resource/persons300200

Eren,  A. Murat
Max Planck Institute for Marine Microbiology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

s41597-024-03778-z.pdf
(Publisher version), 31MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Veseli, I., Demers, M. A., Cooper, Z. S., Schechter, M. S., Miller, S., Weber, L., et al. (2024). Digital Microbe: a genome-informed data integration framework for team science on emerging model organisms. SCIENTIFIC DATA, 11(1): 967. doi:10.1038/s41597-024-03778-z.


Cite as: https://hdl.handle.net/21.11116/0000-0010-F1AF-9
Abstract
The remarkable pace of genomic data generation is rapidly transforming our understanding of life at the micron scale. Yet this data stream also creates challenges for team science. A single microbe can have multiple versions of genome architecture, functional gene annotations, and gene identifiers; additionally, the lack of mechanisms for collating and preserving advances in this knowledge raises barriers to community coalescence around shared datasets. "Digital Microbes" are frameworks for interoperable and reproducible collaborative science through open source, community-curated data packages built on a (pan)genomic foundation. Housed within an integrative software environment, Digital Microbes ensure real-time alignment of research efforts for collaborative teams and facilitate novel scientific insights as new layers of data are added. Here we describe two Digital Microbes: 1) the heterotrophic marine bacterium Ruegeria pomeroyi DSS-3 with > 100 transcriptomic datasets from lab and field studies, and 2) the pangenome of the cosmopolitan marine heterotroph Alteromonas containing 339 genomes. Examples demonstrate how an integrated framework collating public (pan)genome-informed data can generate novel and reproducible findings.