Warning: file_get_contents(https://eutils.ncbi.nlm.nih.gov/entrez/eutils/elink.fcgi?dbfrom=pubmed&id=41143534&cmd=llinks): Failed to open stream: HTTP request failed! HTTP/1.1 429 Too Many Requests
in C:\Inetpub\vhosts\kidney.de\httpdocs\pget.php on line 215
ROCker models for reliable detection and typing of short-read sequences carrying mcr, erm, mph, and lnu antibiotic resistance genes #MMPMID41143534
Conrad R; Gerhardt K; Konstantinidis KT; Williams-Newkirk AJ; Huang AD
Microbiol Spectr 2025[Oct]; ? (?): e0241325 PMID41143534show ga
Quantitative monitoring of emerging antimicrobial resistance genes (ARGs) using short-read sequences remains challenging due to the high frequency of amino acid functional domains and motifs shared with related but functionally distinct (non-target) proteins. To facilitate ARG monitoring efforts using unassembled short reads, we present novel ROCker models for mcr, mph, erm, and lnu ARG families, as well as models for variants of special public health concern within these families, including mcr-1, mphA, ermB, lnuF, lnuB, and lnuG genes. For this, we curated target gene sequence sets for model training and built these models using the recently updated ROCker V2 pipeline (Gerhardt et al., in review). To validate our models, we simulated reads from the whole genome of ARG-carrying isolates spanning a range of common read lengths and used them to challenge the filtering efficacy of ROCker versus common static filtering approaches, such as similarity searches using BLASTx with various e-value thresholds or hidden Markov models. ROCker models consistently showed F1 scores up to 10x higher (31% higher on average) and lower false-positive (by 30%, on average) and false-negative (by 16%, on average) rates based on 250 bp reads compared to alternative methods. The ROCker models and all related reference materials and data are freely available through http://enve-omics.ce.gatech.edu/rocker/models, further expanding the available model collection previously developed for other genes. Their application to short-read metagenomes, metatranscriptomes, and PCR amplicon data should facilitate more accurate classification and quantification of unassembled short-read sequences for these ARG families and specific genes.IMPORTANCEAntimicrobial resistance gene families encoding erm and mph genes confer resistance to the macrolide class of antimicrobials, which are used to treat a wide range of infections. Similarly, the mcr gene family confers resistance to polymyxin E (colistin), a drug of last resort for many serious drug-resistant bacterial infections, and the lnu gene family confers resistance to lincomycin, which is reserved for patients allergic to penicillin or where bacteria have developed resistance to other antimicrobials. Assessing the prevalence of these genes in clinical or environmental samples and monitoring their spread to new pathogens are thus important for quantifying the associated public health risk. However, detecting these and other resistance genes in short-read sequence data is technically challenging. Our ROCker bioinformatic pipeline achieves reliable detection and typing of broad-range target gene sequences in complex data sets, thus contributing toward solving an important problem in ongoing surveillance efforts of antimicrobial resistance.