English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
 
 
DownloadE-Mail
  Merfin: improved variant filtering, assembly evaluation and polishing via k-mer validation.

Formenti, G., Rhie, A., Walenz, B., Thibaud-Nissen, F., Shafin, K., Koren, S., et al. (2022). Merfin: improved variant filtering, assembly evaluation and polishing via k-mer validation. Nature methods, 19(6), 696-704. doi:10.1038/s41592-022-01445-y.

Item is

Files

show Files

Locators

show

Creators

show
hide
 Creators:
Formenti, Giulio, Author
Rhie, Arang, Author
Walenz, Brian, Author
Thibaud-Nissen, Francoise, Author
Shafin, Kishwar, Author
Koren, Sergey, Author
Myers, Eugene W1, Author           
Jarvis, Erich D, Author
Phillippy, Adam M, Author
Affiliations:
1Max Planck Institute for Molecular Cell Biology and Genetics, Max Planck Society, ou_2340692              

Content

show
hide
Free keywords: -
 Abstract: Variant calling has been widely used for genotyping and for improving the consensus accuracy of long-read assemblies. Variant calls are commonly hard-filtered with user-defined cutoffs. However, it is impossible to define a single set of optimal cutoffs, as the calls heavily depend on the quality of the reads, the variant caller of choice and the quality of the unpolished assembly. Here, we introduce Merfin, a k-mer based variant-filtering algorithm for improved accuracy in genotyping and genome assembly polishing. Merfin evaluates each variant based on the expected k-mer multiplicity in the reads, independently of the quality of the read alignment and variant caller's internal score. Merfin increased the precision of genotyped calls in several benchmarks, improved consensus accuracy and reduced frameshift errors when applied to human and nonhuman assemblies built from Pacific Biosciences HiFi and continuous long reads or Oxford Nanopore reads, including the first complete human genome. Moreover, we introduce assembly quality and completeness metrics that account for the expected genomic copy numbers.

Details

show
hide
Language(s):
 Dates: 2022-06-01
 Publication Status: Issued
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1038/s41592-022-01445-y
Other: cbg-8332
PMID: 35361932
 Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show
hide
Title: Nature methods
  Other : Nat Methods
Source Genre: Journal
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 19 (6) Sequence Number: - Start / End Page: 696 - 704 Identifier: -