automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered 
by Machine Learning

Eilertz, Daniel; Mitterer, Michael; Büscher, Jörg Martin

doi:10.1021/acs.analchem.1c05224

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Journal Article

automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning

MPS-Authors

Eilertz, Daniel
Max Planck Institute of Immunobiology and Epigenetics, Max Planck Society;

Mitterer, Michael
Max Planck Institute of Immunobiology and Epigenetics, Max Planck Society;

/persons/resource/persons241896

Büscher, Jörg Martin
Max Planck Institute of Immunobiology and Epigenetics, Max Planck Society;

External Resource

https://pubs.acs.org/doi/10.1021/acs.analchem.1c05224
(Publisher version)

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

10.1021_acs.analchem.1c05224.pdf
(Publisher version), 4MB

Supplementary Material (public)

There is no public supplementary material available

Citation

Eilertz, D., Mitterer, M., & Büscher, J. M. (2022). automRm: An R Package for Fully Automatic LC-QQQ-MS Data Preprocessing Powered by Machine Learning. Analytical Chemistry, 94, 6163-6171. doi:10.1021/acs.analchem.1c05224.

Cite as: https://hdl.handle.net/21.11116/0000-000A-4543-D

Abstract

Preprocessing of liquid chromatography-mass spectrometry (LC-MS) raw data facilitates downstream statistical and biological data analyses. In the case of targeted LC-MS data, consistent recognition of chromatographic peaks is a main challenge, in particular, for low abundant signals. Fully automatic preprocessing is faster than manual peak review and does not depend on the individual operator. Here, we present the R package automRm for fully automatic preprocessing of LC-MS data recorded in MRM mode. Using machine learning (ML) for detection of chromatographic peaks and quality control of reported results enables the automatic recognition of complex patterns in raw data. In addition, this approach renders automRm generally applicable to a wide range of analytical methods including hydrophilic interaction liquid chromatography (HILIC), which is known for sample-to-sample variations in peak shape and retention time. We demonstrate the impact of the choice of training data set, of the applied ML algorithm, and of individual peak characteristics on automRm’s ability to correctly report chromatographic peaks. Next, we show that automRm can replicate results obtained by manual peak review on published data. Moreover, automRm outperforms alternative software solutions regarding the variation in peak integration among replicate measurements and the number of correctly reported peaks when applied to a HILIC-MS data set. The R package is freely available from gitlab (https://gitlab.gwdg.de/joerg.buescher/automrm).