- Published
- 19 September 2023
- Journal article
SILVR: Guided Diffusion for Molecule Generation
- Authors
- Source
- Journal of Chemical Information and Modeling
Full text
Abstract
Computationally generating new synthetically accessible compounds with high affinity and low toxicity is a great challenge in drug design. Machine learning models beyond conventional pharmacophoric methods have shown promise in the generation of novel small-molecule compounds but require significant tuning for a specific protein target. Here, we introduce a method called selective iterative latent variable refinement (SILVR) for conditioning an existing diffusion-based equivariant generative model without retraining. The model allows the generation of new molecules that fit into a binding site of a protein based on fragment hits. We use the SARS-CoV-2 main protease fragments from Diamond XChem that form part of the COVID Moonshot project as a reference dataset for conditioning the molecule generation. The SILVR rate controls the extent of conditioning, and we show that moderate SILVR rates make it possible to generate new molecules of similar shape to the original fragments, meaning that the new molecules fit the binding site without knowledge of the protein. We can also merge up to 3 fragments into a new molecule without affecting the quality of molecules generated by the underlying generative model. Our method is generalizable to any protein target with known fragments and any diffusion-based model for molecule generation.
Cite as
Runcie, N. & Mey, A. 2023, 'SILVR: Guided Diffusion for Molecule Generation', Journal of Chemical Information and Modeling, 63(19), pp. 5996-6005. https://doi.org/10.1021/acs.jcim.3c00667