Health / Tue, 02 Jun 2026 EMJ

StackPred: AI-Boosted AMR Phenotype Prediction for Multiple Species and Antimicrobial Agents

BACKGROUND AND AIMSGenome-based antimicrobial susceptibility testing is emerging as a promising alternative to culture-based methods, which remain time-consuming despite being the clinical gold standard.1,2 Here, the authors present an extension of the stacked Random Forest framework, StackPredAMR (under revision), which originally covered 18 antimicrobial agents across three species. At the European Society of Clinical Microbiology and Infectious Diseases (ESCMID) 2026, the authors presented an extension of this framework, demonstrating its straightforward scalability to additional species and antimicrobial agents for multi-agent antimicrobial resistance (AMR) prediction from genomic data.3METHODSThe model was trained on BVBRC VITEK (bioMérieux, Marcy-l’Étoile, France) antimicrobial susceptibility testing data comprising more than 2,500 isolates from Escherichia coli, Klebsiella pneumoniae, Acinetobacter baumannii, and Enterobacter cloacae, covering up to 23 antimicrobial agents.4 Genomic input features were derived from the Comprehensive Antibiotic Resistance Database (CARD) annotations, and encoded as binary presence/absence of AMR genes.5 The approach builds on the original stackPredAMR architecture by combining individual Random Forest classifiers for each antimicrobial agent with a second-layer meta-model. This stacking strategy enables the model to capture cross-resistance patterns between antimicrobial agents while handling incomplete phenotype labels, a common limitation in AMR datasets.6 Compared to rule-based approaches, which rely on known resistance genes,7 machine learning methods can learn more complex patterns directly from genomic data.8RESULTSModel performance was evaluated using 5-fold cross-validation. For E. coli and K. pneumoniae, median major error and very major error rates across antimicrobial agents remained below 3%. Predictions for A. baumannii and E. cloacae, included as a proof of concept with a smaller sample size, were also successfully generated, highlighting the flexibility and extensibility of the framework (Figure 1).

BACKGROUND AND AIMS

Genome-based antimicrobial susceptibility testing is emerging as a promising alternative to culture-based methods, which remain time-consuming despite being the clinical gold standard.1,2 Here, the authors present an extension of the stacked Random Forest framework, StackPredAMR (under revision), which originally covered 18 antimicrobial agents across three species. At the European Society of Clinical Microbiology and Infectious Diseases (ESCMID) 2026, the authors presented an extension of this framework, demonstrating its straightforward scalability to additional species and antimicrobial agents for multi-agent antimicrobial resistance (AMR) prediction from genomic data.3

METHODS

The model was trained on BVBRC VITEK (bioMérieux, Marcy-l’Étoile, France) antimicrobial susceptibility testing data comprising more than 2,500 isolates from Escherichia coli, Klebsiella pneumoniae, Acinetobacter baumannii, and Enterobacter cloacae, covering up to 23 antimicrobial agents.4 Genomic input features were derived from the Comprehensive Antibiotic Resistance Database (CARD) annotations, and encoded as binary presence/absence of AMR genes.5 The approach builds on the original stackPredAMR architecture by combining individual Random Forest classifiers for each antimicrobial agent with a second-layer meta-model. This stacking strategy enables the model to capture cross-resistance patterns between antimicrobial agents while handling incomplete phenotype labels, a common limitation in AMR datasets.6 Compared to rule-based approaches, which rely on known resistance genes,7 machine learning methods can learn more complex patterns directly from genomic data.8

RESULTS

Model performance was evaluated using 5-fold cross-validation. For E. coli and K. pneumoniae, median major error and very major error rates across antimicrobial agents remained below 3%. Predictions for A. baumannii and E. cloacae, included as a proof of concept with a smaller sample size, were also successfully generated, highlighting the flexibility and extensibility of the framework (Figure 1).