English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

How evolution of genomes is reflected in exact DNA sequence match statistics

MPS-Authors
/persons/resource/persons188264

Massip,  Florian
Evolutionary Genomics (Peter Arndt), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society;
UR1077, Unite Mathematiques Informatique et Genome, INRA, domaine de Vilvert, Jouy-en-Josas, France ;

/persons/resource/persons188283

Sheinman,  Michel
Evolutionary Genomics (Peter Arndt), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society;

/persons/resource/persons50074

Arndt,  Peter F.
Evolutionary Genomics (Peter Arndt), Dept. of Computational Molecular Biology (Head: Martin Vingron), Max Planck Institute for Molecular Genetics, Max Planck Society;

External Resource
Fulltext (public)

Massip.pdf
(Publisher version), 639KB

Supplementary Material (public)
There is no public supplementary material available
Citation

Massip, F., Sheinman, M., Schbath, S., & Arndt, P. F. (2015). How evolution of genomes is reflected in exact DNA sequence match statistics. Mol Biol Evol, 32(2), 524-535. doi:10.1093/molbev/msu313.


Cite as: http://hdl.handle.net/11858/00-001M-0000-002A-3AAC-A
Abstract
Genome evolution is shaped by a multitude of mutational processes, including point mutations, insertions, and deletions of DNA sequences, as well as segmental duplications. These mutational processes can leave distinctive qualitative marks in the statistical features of genomic DNA sequences. One such feature is the match length distribution (MLD) of exactly matching sequence segments within an individual genome or between the genomes of related species. These have been observed to exhibit characteristic power law decays in many species. Here, we show that simple dynamical models consisting solely of duplication and mutation processes can already explain the characteristic features of MLDs observed in genomic sequences. Surprisingly, we find that these features are largely insensitive to details of the underlying mutational processes and do not necessarily rely on the action of natural selection. Our results demonstrate how analyzing statistical features of DNA sequences can help us reveal and quantify the different mutational processes that underlie genome evolution.