Deutsch
 
Hilfe Datenschutzhinweis Impressum
  DetailsucheBrowse

Datensatz

DATENSATZ AKTIONENEXPORT

Freigegeben

Zeitschriftenartikel

Identification and quantification of chimeric sequencing reads in a highly multiplexed RAD-seq protocol

MPG-Autoren
/persons/resource/persons271201

Kucka,  M
Chan Group, Friedrich Miescher Laboratory, Max Planck Society;

/persons/resource/persons56629

Chan,  YF
Chan Group, Friedrich Miescher Laboratory, Max Planck Society;

Externe Ressourcen
Es sind keine externen Ressourcen hinterlegt
Volltexte (beschränkter Zugriff)
Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.
Volltexte (frei zugänglich)
Es sind keine frei zugänglichen Volltexte in PuRe verfügbar
Ergänzendes Material (frei zugänglich)
Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar
Zitation

Martin Cerezo, M., Raval, R., de Haro Reyes, B., Kucka, M., Chan, Y., & Bryk, J. (2022). Identification and quantification of chimeric sequencing reads in a highly multiplexed RAD-seq protocol. Molecular Ecology Resources, 22(8), 2860-2870. doi:10.1111/1755-0998.13661.


Zitierlink: https://hdl.handle.net/21.11116/0000-000A-31CE-7
Zusammenfassung
Highly multiplexed approaches have become common in genomic studies. They have improved the cost-effectiveness of genotyping hundreds of individuals by using combinatorially-barcoded adapters. These strategies, however, can potentially missassign reads to incorrect samples. Here we used a modified quaddRAD protocol to analyse the occurrence of index hopping and PCR chimeras in a series of experiments with up to a 100 multiplexed samples per sequencing lane ( 639 samples in total). We created two types of sequencing libraries: four libraries of Type A, where PCR reactions were run on individual samples before multiplexing, and three libraries of Type B, where PCRs were run on pooled samples. We used fixed pairs of inner barcodes to identify chimeric reads. Type B libraries show a higher percentage of misassigned reads (1.15%) compared to Type A libraries (0.65%). We also quantify the commonly undetectable chimeric sequences that occur whenever multiplexed groups of samples with different outer barcodes are sequenced together on a single flow cell. Our results suggest that these types of chimeric sequences represent up to 1.56% and 1.29% of reads in Type A and B libraries, respectively.We also show that increasing the number of mismatches allowed for barcode rescue to above 2 dramatically increases the number of recovered chimeric reads. We provide recommendations for developing highly-multiplexed RAD-seq protocols and analysing the resulting data to minimise the generation of chimeric sequences, allowing their quantification and a finer control on the number of PCR cycles necessary to generate enough input DNA for library preparation.