Back

Minisymposium

MS2H - Updating Workflows in Virtual Drug Discovery with Current Technologies

Fully booked
Monday, June 3, 2024
14:30
-
16:30
CEST
HG F 26.5

Replay

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

Session Chair

Description

Drug discovery is a difficult process that often relies on fortuitous discoveries. The number of such discoveries has stagnated for decades despite numerous technological advances. The role of HPC in the field is undergoing transformations due to the advent of large-scale machine learning models in recent years, which promise to revolutionize parts of the discovery pipeline. Traditionally, computation provides ways to sample the mutual conformational space of ligands and receptors or predicts physicochemical properties of small molecules, to name just two. Our minisymposium zooms in on the following overarching considerations: first, a mix of access to technology, computational resources, and data dictates the ease and feasibility of use and therefore the widespread adoption of modern, AI-based prediction methods. Second, it is the particular challenge of complexes of small molecules and receptors that they pose many specific problems but offer little generalizability. Third, it is difficult to analyze results objectively when the ultimate goal is simply to discover a new binder, which has limited the ability to transfer and abstract knowledge. Following these lead concerns, our minisymposium is meant to foster the exchange of technologies and to fortify the ongoing discourse on objectivity and standardization in computational drug discovery.

Presentations

14:30
-
15:00
CEST
Molecular Docking and Virtual Screening at Scale with GNINA

Molecular docking computationally predicts the conformation of a small molecule when binding to a receptor. Scoring functions are a vital piece of any molecular docking pipeline as they determine the fitness of sampled poses.We will describe the training and development of convolutional neural networks for protein-ligand scoring and how these deep learning models are integrated into the GNINA molecular docking open source software. We will describe the role high performance computing played in the training and optimzation of these networks and how high throughput docking can be performed at scale. Successful prospective evaluations of GNINA will be discussed, including recent top performance in the Critical Assessment of Computational Hit-Finding Experiments (CACHE).

David Koes (University of Pittsburgh)
With Thorsten Kurth (NVIDIA Inc.)
15:00
-
15:30
CEST
GPU-Accelerated Molecular Dynamics Simulations for Multistate Binding Affinity Calculations with RE-EDS

Computational approaches for estimating protein-ligand binding affinities are crucial in modern drug discovery. All-atom explicit-solvent molecular dynamics (MD) simulations use the foundational principles of classical mechanics and statistical thermodynamics to rigorously calculate binding free energies. They are the most accurate affinity prediction methods that are still tractable when applied to biological systems, but they are also among the most computationally demanding. To mitigate costs, efficient sampling protocols such as multistate methods have been developed, enabling calculations of relative binding free energy across multiple ligands from a single simulation. One such technique, replica-exchange EDS (RE-EDS), leverages similarities in ligand environments to achieve improved sampling performance compared to pairwise methods. Its recent GPU-accelerated implementation in the OpenMM toolkit for molecular simulation has been benchmarked on a set of solvation free energies from the FreeSolv dataset and forms the basis of a high throughput framework for free energy calculations.

Enrico Ruijsenaars (ETH Zurich)
With Thorsten Kurth (NVIDIA Inc.)
15:30
-
16:00
CEST
Leveraging Large Datasets to Assess the Potential of Machine Learning for Drug Target Prediction through Reverse Screening

Estimating protein targets of compounds based on the similarity principle is a long-standing strategy in drug discovery. Building upon prior quantification of this principle, the large-scale assessment of its predictive power was performed using an unprecedented vast external test set of more than 300’000 active small molecules against another bioactivity set of more than 500’000 compounds. It was found that machine-learning can predict the correct targets, with the highest probability among 2069 proteins, for more than 51% of the external molecules. The strong enrichment thus obtained demonstrates its usefulness in supporting phenotypic screens, polypharmacology, or repurposing. Moreover, the impact of the bioactivity knowledge available for proteins in terms of number and diversity of actives was investigated. This study advocates for the adoption of application-oriented benchmarking strategies to prevent accidental overestimation of their predictive ability, and the use of large, high-quality, non-overlapping datasets.

Vincent Zoete (University of Lausanne)
With Thorsten Kurth (NVIDIA Inc.)
16:00
-
16:30
CEST
An Integrated, HPC-Ready, Graphical Platform to Discover, Test, and Refine Small Molecule Binders

In virtual drug discovery, the reproducibility of results across different practitioners is a frequent concern, and the roles of human intervention, e.g., through visual inspection, are hard to quantify and recapitulate. This is in part a result of the nature of the task, which is to find, rather than explain, promising compounds. Nevertheless, the standardization of workflows is a concern, both in general and especially so when the results of screening molecules have a stochastic component. The abstraction of tasks through sophisticated user interfaces such as Maestro (Schrodinger) is one route toward standardization, which has the additional benefit of flattening the learning curves for beginners. In my talk, I will present a unified, open-source graphical user interface developed in our group, ACGui, which serves as a frontend for different popular docking softwares. I will explain the SQL database we use for compound management, including topics such as parameterization, conformer generation, and visualization of results. ACGui also abstracts an interface to HPC resources through SLURM to tackle large-scale computations as well as the setup of bulk molecular dynamics simulations using GROMACS to study promising hits in more detail. I will briefly present the main workflows of both of these components.

Yang Zhang (University of Zurich)
With Thorsten Kurth (NVIDIA Inc.)