1. Academic Validation
  2. Benchmarking AlphaFold-enabled molecular docking predictions for antibiotic discovery

Benchmarking AlphaFold-enabled molecular docking predictions for antibiotic discovery

  • Mol Syst Biol. 2022 Sep;18(9):e11081. doi: 10.15252/msb.202211081.
Felix Wong  # 1 2 3 Aarti Krishnan  # 1 2 3 Erica J Zheng 3 4 Hannes Stärk 5 Abigail L Manson 3 Ashlee M Earl 3 Tommi Jaakkola 5 James J Collins 1 2 3 6
Affiliations

Affiliations

  • 1 Institute for Medical Engineering & Science, Massachusetts Institute of Technology, Cambridge, MA, USA.
  • 2 Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
  • 3 Infectious Disease and Microbiome Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • 4 Program in Chemical Biology, Harvard University, Cambridge, MA, USA.
  • 5 Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA.
  • 6 Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA.
  • # Contributed equally.
Abstract

Efficient identification of drug mechanisms of action remains a challenge. Computational docking approaches have been widely used to predict drug binding targets; yet, such approaches depend on existing protein structures, and accurate structural predictions have only recently become available from AlphaFold2. Here, we combine AlphaFold2 with molecular docking simulations to predict protein-ligand interactions between 296 proteins spanning Escherichia coli's essential proteome, and 218 active Antibacterial compounds and 100 inactive compounds, respectively, pointing to widespread compound and protein promiscuity. We benchmark model performance by measuring enzymatic activity for 12 essential proteins treated with each Antibacterial compound. We confirm extensive promiscuity, but find that the average area under the receiver operating characteristic curve (auROC) is 0.48, indicating weak model performance. We demonstrate that rescoring of docking poses using machine learning-based approaches improves model performance, resulting in average auROCs as large as 0.63, and that ensembles of rescoring functions improve prediction accuracy and the ratio of true-positive rate to false-positive rate. This work indicates that advances in modeling protein-ligand interactions, particularly using machine learning-based approaches, are needed to better harness AlphaFold2 for drug discovery.

Keywords

AlphaFold2; enzymatic activity; machine learning; molecular docking; protein-ligand interactions.

Figures
Products