English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Modeling the mosaic structure of bacterial genomes to infer their evolutionary history

Sheinman, M., Arndt, P. F., & Massip, F. (2024). Modeling the mosaic structure of bacterial genomes to infer their evolutionary history. PNAS, 121(13): e2313367121. doi:10.1073/pnas.2313367121.

Item is

Files

show Files
hide Files
:
PNAS_Sheinman et al_2024.pdf (Publisher version), 3MB
Name:
PNAS_Sheinman et al_2024.pdf
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
-
Copyright Info:
© 2024, The Author(s)

Locators

show

Creators

show
hide
 Creators:
Sheinman, Michael , Author
Arndt, Peter F.1, Author                 
Massip, Florian, Author
Affiliations:
1Evolutionary Genomics (Peter Arndt), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society, ou_1479638              

Content

show
hide
Free keywords: bacterial evolution; molecular clock; mutation rate; maximal exact matches; horizontal gene transfer
 Abstract: The chronology and phylogeny of bacterial evolution are difficult to reconstruct due to a scarce fossil record. The analysis of bacterial genomes remains challenging because of large sequence divergence, the plasticity of bacterial genomes due to frequent gene loss, horizontal gene transfer, and differences in selective pressure from one locus to another. Therefore, taking advantage of the rich and rapidly accumulating genomic data requires accurate modeling of genome evolution. An important technical consideration is that loci with high effective mutation rates may diverge beyond the detection limit of the alignment algorithms used, biasing the genome-wide divergence estimates toward smaller divergences. In this article, we propose a novel method to gain insight into bacterial evolution based on statistical properties of genome comparisons. We find that the length distribution of sequence matches is shaped by the effective mutation rates of different loci, by the horizontal transfers, and by the aligner sensitivity. Based on these inputs, we build a model and show that it accounts for the empirically observed distributions, taking the Enterobacteriaceae family as an example. Our method allows to distinguish segments of vertical and horizontal origins and to estimate the time divergence and exchange rate between any pair of taxa from genome-wide alignments. Based on the estimated time divergences, we construct a time-calibrated phylogenetic tree to demonstrate the accuracy of the method.

Details

show
hide
Language(s): eng - English
 Dates: 2024-01-302024-03-22
 Publication Status: Published online
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1073/pnas.2313367121
PMID: 38517978
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: PNAS
  Other : Proceedings of the National Academy of Sciences of the United States of America
  Other : Proceedings of the National Academy of Sciences of the USA
  Abbreviation : Proc. Natl. Acad. Sci. U. S. A.
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 121 (13) Sequence Number: e2313367121 Start / End Page: - Identifier: ISSN: 0027-8424
CoNE: https://pure.mpg.de/cone/journals/resource/954925427230

Source 2

show
hide
Title: bioRxiv
Source Genre: Web Page
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: - Identifier: DOI: 10.1101/2023.09.22.558938