English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

ROCker Models for Reliable Detection and Typing of Short-Read Sequences Carrying beta-Lactamase Genes

MPS-Authors
/persons/resource/persons256962

Orellana,  Luis H.
Department of Molecular Ecology, Max Planck Institute for Marine Microbiology, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

msystems.01281-21.pdf
(Publisher version), 3MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Zhang, S.-Y., Suttner, B., Rodriguez-R, L. M., Orellana, L. H., Conrad, R. E., Liu, F., et al. (2022). ROCker Models for Reliable Detection and Typing of Short-Read Sequences Carrying beta-Lactamase Genes. MSYSTEMS. doi:10.1128/msystems.01281-21.


Cite as: https://hdl.handle.net/21.11116/0000-000A-C198-0
Abstract
Identification of genes encoding beta-lactamases (BLs) from short-read sequences remains challenging due to the high frequency of shared amino acid functional domains and motifs in proteins encoded by BL genes and related non-BL gene sequences. Divergent BL homologs can be frequently missed during similarity searches, which has important practical consequences for monitoring antibiotic resistance. To address this limitation, we built ROCker models that targeted broad classes (e.g., class A, B, C, and D) and individual families (e.g., TEM) of BLs and challenged them with mock 150-bp- and 250-bp-read data sets of known composition. ROCker identifies most-discriminant bit score thresholds in sliding windows along the sequence of the target protein sequence and hence can account for nondiscriminative domains shared by unrelated proteins. BL ROCker models showed a 0% false-positive rate (FPR), a 0% to 4% false-negative rate (FNR), and an up-to-50-fold-higher F1 score [2 x precision x recall/(precision + recall)] compared to alternative methods, such as similarity searches using BLASTx with various e-value thresholds and BL hidden Markov models, or tools like DeepARG, ShortBRED, and AMRFinder. The ROCker models and the underlying protein sequence reference data sets and phylogenetic trees for read placement are freely available through http://enve-omics.ce.gatech.edu/data/rocker-bla. Application of these BL ROCker models to metagenomics, metatranscriptomics, and high-throughput PCR gene amplicon data should facilitate the reliable detection and quantification of BL variants encoded by environmental or clinical isolates and microbiomes and more accurate assessment of the associated public health risk, compared to the current practice.
IMPORTANCE Resistance genes encoding beta-lactamases (BLs) confer resistance to the widely prescribed antibiotic class beta-lactams. Therefore, it is important to assess the prevalence of BL genes in clinical or environmental samples for monitoring the spreading of these genes into pathogens and estimating public health risk. However, detecting BLs in short-read sequence data is technically challenging. Our ROCker model-based bioinformatics approach showcases the reliable detection and typing of BLs in complex data sets and thus contributes toward solving an important problem in antibiotic resistance surveillance. The ROCker models developed substantially expand the toolbox for monitoring antibiotic resistance in clinical or environmental settings.