Challenges with using primer IDs to improve accuracy of next generation 
sequencing

Brodin, J; Hedskog, C; Heddini, A; Benard, E; Neher, RA; Mild, M; Albert, J

doi:10.1371/journal.pone.0119123

Local TagsRelease HistoryDetailsSummary

Challenges with using primer IDs to improve accuracy of next generation sequencing

Brodin, J., Hedskog, C., Heddini, A., Benard, E., Neher, R., Mild, M., et al. (2015). Challenges with using primer IDs to improve accuracy of next generation sequencing. PLoS One, 10(3): e0119123. doi:10.1371/journal.pone.0119123.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-000B-963B-A Version Permalink: https://hdl.handle.net/21.11116/0000-000B-963C-9

Genre: Journal Article

Files

show Files

Locators

show

Creators

show

hide

Creators:
Brodin, J, Author
Hedskog, C, Author
Heddini, A, Author
Benard, E¹, Author
Neher, RA¹, Author
Mild, M, Author
Albert, J, Author

Affiliations:
1Research Group Evolutionary Dynamics and Biophysics, Max Planck Institute for Developmental Biology, Max Planck Society, ou_3377926

Content

show

hide

Free keywords: -

Abstract: Next generation sequencing technologies, like ultra-deep pyrosequencing (UDPS), allows detailed investigation of complex populations, like RNA viruses, but its utility is limited by errors introduced during sample preparation and sequencing. By tagging each individual cDNA molecule with barcodes, referred to as Primer IDs, before PCR and sequencing these errors could theoretically be removed. Here we evaluated the Primer ID methodology on 257,846 UDPS reads generated from a HIV-1 SG3Δenv plasmid clone and plasma samples from three HIV-infected patients. The Primer ID consisted of 11 randomized nucleotides, 4,194,304 combinations, in the primer for cDNA synthesis that introduced a unique sequence tag into each cDNA molecule. Consensus template sequences were constructed for reads with Primer IDs that were observed three or more times. Despite high numbers of input template molecules, the number of consensus template sequences was low. With 10,000 input molecules for the clone as few as 97 consensus template sequences were obtained due to highly skewed frequency of resampling. Furthermore, the number of sequenced templates was overestimated due to PCR errors in the Primer IDs. Finally, some consensus template sequences were erroneous due to hotspots for UDPS errors. The Primer ID methodology has the potential to provide highly accurate deep sequencing. However, it is important to be aware that there are remaining challenges with the methodology. In particular it is important to find ways to obtain a more even frequency of resampling of template molecules as well as to identify and remove artefactual consensus template sequences that have been generated by PCR errors in the Primer IDs.

Details

show

hide

Language(s):

Dates: Date issued: 2015-03

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: DOI: 10.1371/journal.pone.0119123
PMID: 25741706

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: PLoS One

Abbreviation : PLoS One

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: San Francisco, CA : Public Library of Science

Pages: 12 Volume / Issue: 10 (3) Sequence Number: e0119123 Start / End Page: - Identifier: ISSN: 1932-6203
CoNE: https://pure.mpg.de/cone/journals/resource/1000000000277850