English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  hapCon: Estimating contamination of ancient genomes by copying from reference haplotypes

Huang, Y., & Ringbauer, H. (2022). hapCon: Estimating contamination of ancient genomes by copying from reference haplotypes. Bioinformatics, 38(15): btac390, pp. 3768-3777. doi:10.1093/bioinformatics/btac390.

Item is

Files

show Files
hide Files
:
Huang_hapCon_Bioinf_2022.pdf (Publisher version), 5MB
Name:
Huang_hapCon_Bioinf_2022.pdf
Description:
-
OA-Status:
Hybrid
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
2022
Copyright Info:
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unre- stricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited
:
Huang_hapCon_Bioinf_Suppl_2022.pdf (Supplementary material), 11MB
Name:
Huang_hapCon_Bioinf_Suppl_2022.pdf
Description:
-
OA-Status:
Not specified
Visibility:
Public
MIME-Type / Checksum:
application/pdf / [MD5]
Technical Metadata:
Copyright Date:
2022
Copyright Info:
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unre- stricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited

Locators

show

Creators

show
hide
 Creators:
Huang, Yilei1, 2, Author           
Ringbauer, Harald1, Author                 
Affiliations:
1Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Max Planck Society, ou_3222712              
2The Leipzig School of Human Origins (IMPRS), Max Planck Institute for Evolutionary Anthropology, Max Planck Society, Deutscher Platz 6, 04103 Leipzig, DE, ou_1497688              

Content

show
hide
Free keywords: -
 Abstract: Human ancient DNA (aDNA) studies have surged in recent years, revolutionizing the study of the human past. Typically, aDNA is preserved poorly, making such data prone to contamination from other human DNA. Therefore, it is important to rule out substantial contamination before proceeding to downstream analysis. As most aDNA samples can only be sequenced to low coverages (<1× average depth), computational methods that can robustly estimate contamination in the low coverage regime are needed. However, the ultra low-coverage regime (0.1× and below) remains a challenging task for existing approaches.We present a new method to estimate contamination in aDNA for male modern humans. It utilizes a Li&Stephens haplotype copying model for haploid X chromosomes, with mismatches modeled as errors or contamination. We assessed this new approach, hapCon, on simulated and down-sampled empirical aDNA data. Our experiments demonstrate that hapCon outperforms a commonly used tool for estimating male X contamination (ANGSD), with substantially lower variance and narrower confidence intervals, especially in the low coverage regime. We found that hapCon provides useful contamination estimates for coverages as low as 0.1× for SNP capture data (1240k) and 0.02× for whole genome sequencing data, substantially extending the coverage limit of previous male X chromosome-based contamination estimation methods. Our experiments demonstrate that hapCon has little bias for contamination up to 25–30\% as long as the contaminating source is specified within continental genetic variation, and that its application range extends to human aDNA as old as ∼45 000 and various global ancestries.We make hapCon available as part of a python package (hapROH), which is available at the Python Package Index (https://pypi.org/project/hapROH) and can be installed via pip. The documentation provides example use cases as blueprints for custom applications (https://haproh.readthedocs.io/en/latest/hapCon.html). The program can analyze either BAM files or pileup files produced with samtools. An implementation of our software (hapCon) using Python and C is deposited at https://github.com/hyl317/hapROH.Supplementary data are available at Bioinformatics online.

Details

show
hide
Language(s): eng - English
 Dates: 2022-06-13
 Publication Status: Published online
 Pages: 10
 Publishing info: -
 Table of Contents: -
 Rev. Type: Peer
 Identifiers: DOI: 10.1093/bioinformatics/btac390
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Bioinformatics
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: Oxford : Oxford University Press
Pages: - Volume / Issue: 38 (15) Sequence Number: btac390 Start / End Page: 3768 - 3777 Identifier: ISSN: 1367-4803