English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

Quantifying and reducing spurious alignments for the analysis of ultra-short ancient DNA sequences

MPS-Authors
/persons/resource/persons72538

de Filippo,  Cesare
Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons72846

Meyer,  Matthias
Advanced DNA Sequencing Techniques, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

/persons/resource/persons72912

Prüfer,  Kay       
Genomes, Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

de Filippo_Quantifying_BMCBio_2018.pdf
(Publisher version), 2MB

Supplementary Material (public)
There is no public supplementary material available
Citation

de Filippo, C., Meyer, M., & Prüfer, K. (2018). Quantifying and reducing spurious alignments for the analysis of ultra-short ancient DNA sequences. BMC Biology, 16: 121. doi:10.1186/s12915-018-0581-9.


Cite as: https://hdl.handle.net/21.11116/0000-0002-64F9-5
Abstract
Background: The study of ancient DNA is hampered by degradation, resulting in short DNA fragments. Advances in laboratory methods have made it possible to retrieve short DNA fragments, thereby improving access to DNA preserved in highly degraded, ancient material. However, such material contains large amounts of microbial contamination in addition to DNA fragments from the ancient organism. The resulting mixture of sequences constitutes a challenge for computational analysis, since microbial sequences are hard to distinguish from the ancient sequences of interest, especially when they are short.

Results: Here, we develop a method to quantify spurious alignments based on the presence or absence of rare variants. We find that spurious alignments are enriched for mismatches and insertion/deletion differences and lack substitution patterns typical of ancient DNA. The impact of spurious alignments can be reduced by filtering on these features and by imposing a sample-specific minimum length cutoff. We apply this approach to sequences from four ~ 430,000-year-old Sima de los Huesos hominin remains, which contain particularly short DNA fragments, and increase the amount of usable sequence data by 17–150%. This allows us to place a third specimen from the site on the Neandertal lineage.

Conclusions: Our method maximizes the sequence data amenable to genetic analysis from highly degraded ancient material and avoids pitfalls that are associated with the analysis of ultra-short DNA sequences.