Author: "Gilson, Michael K." - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Gilson, Michael K."' showing total 769 results

Start Over Author "Gilson, Michael K."

769 results on '"Gilson, Michael K."'

1. Technical report: Improving the properties of molecules generated by LIMO

Author: Thumuluri, Vineet, Eckmann, Peter, Gilson, Michael K., and Yu, Rose
Subjects: Computer Science - Machine Learning, Quantitative Biology - Biomolecules
Abstract: This technical report investigates variants of the Latent Inceptionism on Molecules (LIMO) framework to improve the properties of generated molecules. We conduct ablative studies of molecular representation, decoder model, and surrogate model training scheme. The experiments suggest that an autogressive Transformer decoder with GroupSELFIES achieves the best average properties for the random generation task., Comment: 9 pages, 2 figures
Published: 2024

2. MFBind: a Multi-Fidelity Approach for Evaluating Drug Compounds in Practical Generative Modeling

Author: Eckmann, Peter, Wu, Dongxia, Heinzelmann, Germano, Gilson, Michael K, and Yu, Rose
Subjects: Quantitative Biology - Biomolecules, Computer Science - Machine Learning
Abstract: Current generative models for drug discovery primarily use molecular docking to evaluate the quality of generated compounds. However, such models are often not useful in practice because even compounds with high docking scores do not consistently show experimental activity. More accurate methods for activity prediction exist, such as molecular dynamics based binding free energy calculations, but they are too computationally expensive to use in a generative model. We propose a multi-fidelity approach, Multi-Fidelity Bind (MFBind), to achieve the optimal trade-off between accuracy and computational cost. MFBind integrates docking and binding free energy simulators to train a multi-fidelity deep surrogate model with active learning. Our deep surrogate model utilizes a pretraining technique and linear prediction heads to efficiently fit small amounts of high-fidelity data. We perform extensive experiments and show that MFBind (1) outperforms other state-of-the-art single and multi-fidelity baselines in surrogate modeling, and (2) boosts the performance of generative models with markedly higher quality compounds., Comment: 9 pages, 4 figures
Published: 2024

3. A Fast, Convenient, Polarizable Electrostatic Model for Molecular Dynamics

Author: Wang, Liangyue, Schauperl, Michael, Mobley, David L, Bayly, Christopher, and Gilson, Michael K
Subjects: Chemical Sciences, Physical Chemistry, Theoretical and Computational Chemistry, Biochemistry and Cell Biology, Computer Software, Chemical Physics, Physical chemistry, Theoretical and computational chemistry
Abstract: We present an efficient polarizable electrostatic model, utilizing typed, atom-centered polarizabilities and the fast direct approximation, designed for efficient use in molecular dynamics (MD) simulations. The model provides two convenient approaches for assigning partial charges in the context of atomic polarizabilities. One is a generalization of RESP, called RESP-dPol, and the other, AM1-BCC-dPol, is an adaptation of the widely used AM1-BCC method. Both are designed to accurately replicate gas-phase quantum mechanical electrostatic potentials. Benchmarks of this polarizable electrostatic model against gas-phase dipole moments, molecular polarizabilities, bulk liquid densities, and static dielectric constants of organic liquids show good agreement with the reference values. Of note, the model yields markedly more accurate dielectric constants of organic liquids, relative to a matched nonpolarizable force field. MD simulations with this method, which is currently parametrized for molecules containing elements C, N, O, and H, run only about 3.6-fold slower than fixed charge force fields, while simulations with the self-consistent mutual polarization average 4.5-fold slower. Our results suggest that RESP-dPol and AM1-BCC-dPol afford improved accuracy relative to fixed charge force fields and are good starting points for developing general, affordable, and transferable polarizable force fields. The software implementing these approaches has been designed to utilize the force field fitting frameworks developed and maintained by the Open Force Field Initiative, setting the stage for further exploration of this approach to polarizable force field development.
Published: 2024

4. Structure-Based Experimental Datasets for Benchmarking of Protein Simulation Force Fields

Author: Cavender, Chapin E., Case, David A., Chen, Julian C. -H., Chong, Lillian T., Keedy, Daniel A., Lindorff-Larsen, Kresten, Mobley, David L., Ollila, O. H. Samuli, Oostenbrink, Chris, Robustelli, Paul, Voelz, Vincent A., Wall, Michael E., Wych, David C., and Gilson, Michael K.
Subjects: Quantitative Biology - Biomolecules, Physics - Biological Physics, Physics - Computational Physics
Abstract: This review article provides an overview of structurally oriented, experimental datasets that can be used to benchmark protein force fields, focusing on data generated by nuclear magnetic resonance (NMR) spectroscopy and room temperature (RT) protein crystallography. We discuss why these observables are useful for assessing force field accuracy, how they can be calculated from simulation trajectories, and statistical issues that arise when comparing simulations with experiment. The target audience for this article is computational researchers and trainees who develop, benchmark, or use protein force fields for molecular simulations.
Published: 2023

5. Development and Benchmarking of Open Force Field 2.0.0: The Sage Small Molecule Force Field

Author: Boothroyd, Simon, Behara, Pavan Kumar, Madin, Owen C, Hahn, David F, Jang, Hyesu, Gapsys, Vytautas, Wagner, Jeffrey R, Horton, Joshua T, Dotson, David L, Thompson, Matthew W, Maat, Jessica, Gokey, Trevor, Wang, Lee-Ping, Cole, Daniel J, Gilson, Michael K, Chodera, John D, Bayly, Christopher I, Shirts, Michael R, and Mobley, David L
Subjects: Medicinal and Biomolecular Chemistry, Chemical Sciences, Theoretical and Computational Chemistry, Benchmarking, Ligands, Proteins, Thermodynamics, Entropy, Biochemistry and Cell Biology, Computer Software, Chemical Physics, Physical chemistry, Theoretical and computational chemistry
Abstract: We introduce the Open Force Field (OpenFF) 2.0.0 small molecule force field for drug-like molecules, code-named Sage, which builds upon our previous iteration, Parsley. OpenFF force fields are based on direct chemical perception, which generalizes easily to highly diverse sets of chemistries based on substructure queries. Like the previous OpenFF iterations, the Sage generation of OpenFF force fields was validated in protein-ligand simulations to be compatible with AMBER biopolymer force fields. In this work, we detail the methodology used to develop this force field, as well as the innovations and improvements introduced since the release of Parsley 1.0.0. One particularly significant feature of Sage is a set of improved Lennard-Jones (LJ) parameters retrained against condensed phase mixture data, the first refit of LJ parameters in the OpenFF small molecule force field line. Sage also includes valence parameters refit to a larger database of quantum chemical calculations than previous versions, as well as improvements in how this fitting is performed. Force field benchmarks show improvements in general metrics of performance against quantum chemistry reference data such as root-mean-square deviations (RMSD) of optimized conformer geometries, torsion fingerprint deviations (TFD), and improved relative conformer energetics (ΔΔE). We present a variety of benchmarks for these metrics against our previous force fields as well as in some cases other small molecule force fields. Sage also demonstrates improved performance in estimating physical properties, including comparison against experimental data from various thermodynamic databases for small molecule properties such as ΔHmix, ρ(x), ΔGsolv, and ΔGtrans. Additionally, we benchmarked against protein-ligand binding free energies (ΔGbind), where Sage yields results statistically similar to previous force fields. All the data is made publicly available along with complete details on how to reproduce the training results at https://github.com/openforcefield/openff-sage.
Published: 2023

6. Development of Potent and Highly Selective Epoxyketone‐Based Plasmodium Proteasome Inhibitors

Author: Almaliti, Jehad, Fajtová, Pavla, Calla, Jaeson, LaMonte, Gregory M, Feng, Mudong, Rocamora, Frances, Ottilie, Sabine, Glukhov, Evgenia, Boura, Evzen, Suhandynata, Raymond T, Momper, Jeremiah D, Gilson, Michael K, Winzeler, Elizabeth A, Gerwick, William H, and O'Donoghue, Anthony J
Subjects: Chemical Sciences, Orphan Drug, Rare Diseases, Vector-Borne Diseases, Infectious Diseases, Malaria, 5.1 Pharmaceuticals, Development of treatments and therapeutic interventions, Infection, Good Health and Well Being, Mice, Animals, Humans, Proteasome Inhibitors, Proteasome Endopeptidase Complex, Plasmodium, Plasmodium falciparum, Antimalarials, epoxyketone, inhibition, malaria, plasmodium, proteasome, General Chemistry, Chemical sciences
Abstract: Here, we present remarkable epoxyketone-based proteasome inhibitors with low nanomolar in vitro potency for blood-stage Plasmodium falciparum and low cytotoxicity for human cells. Our best compound has more than 2,000-fold greater selectivity for erythrocytic-stage P. falciparum over HepG2 and H460 cells, which is largely driven by the accommodation of the parasite proteasome for a D-amino acid in the P3 position and the preference for a difluorobenzyl group in the P1 position. We isolated the proteasome from P. falciparum cell extracts and determined that the best compound is 171-fold more potent at inhibiting the β5 subunit of P. falciparum proteasome when compared to the same subunit of the human constitutive proteasome. These compounds also significantly reduce parasitemia in a P. berghei mouse infection model and prolong survival of animals by an average of 6 days. The current epoxyketone inhibitors are ideal starting compounds for orally bioavailable anti-malarial drugs.
Published: 2023

7. LIMO: Latent Inceptionism for Targeted Molecule Generation

Author: Eckmann, Peter, Sun, Kunyang, Zhao, Bo, Feng, Mudong, Gilson, Michael K., and Yu, Rose
Subjects: Computer Science - Machine Learning
Abstract: Generation of drug-like molecules with high binding affinity to target proteins remains a difficult and resource-intensive task in drug discovery. Existing approaches primarily employ reinforcement learning, Markov sampling, or deep generative models guided by Gaussian processes, which can be prohibitively slow when generating molecules with high binding affinity calculated by computationally-expensive physics-based methods. We present Latent Inceptionism on Molecules (LIMO), which significantly accelerates molecule generation with an inceptionism-like technique. LIMO employs a variational autoencoder-generated latent space and property prediction by two neural networks in sequence to enable faster gradient-based reverse-optimization of molecular properties. Comprehensive experiments show that LIMO performs competitively on benchmark tasks and markedly outperforms state-of-the-art techniques on the novel task of generating drug-like compounds with high binding affinity, reaching nanomolar range against two protein targets. We corroborate these docking-based results with more accurate molecular dynamics-based calculations of absolute binding free energy and show that one of our generated drug-like compounds has a predicted $K_D$ (a measure of binding affinity) of $6 \cdot 10^{-14}$ M against the human estrogen receptor, well beyond the affinities of typical early-stage drug candidates and most FDA-approved drugs to their respective targets. Code is available at https://github.com/Rose-STL-Lab/LIMO., Comment: 16 pages, 5 figures, ICML 2022
Published: 2022

8. CACHE (Critical Assessment of Computational Hit-finding Experiments): A public–private partnership benchmarking initiative to enable the development of computational methods for hit-finding

Author: Ackloo, Suzanne, Al-awar, Rima, Amaro, Rommie E, Arrowsmith, Cheryl H, Azevedo, Hatylas, Batey, Robert A, Bengio, Yoshua, Betz, Ulrich AK, Bologa, Cristian G, Chodera, John D, Cornell, Wendy D, Dunham, Ian, Ecker, Gerhard F, Edfeldt, Kristina, Edwards, Aled M, Gilson, Michael K, Gordijo, Claudia R, Hessler, Gerhard, Hillisch, Alexander, Hogner, Anders, Irwin, John J, Jansen, Johanna M, Kuhn, Daniel, Leach, Andrew R, Lee, Alpha A, Lessel, Uta, Morgan, Maxwell R, Moult, John, Muegge, Ingo, Oprea, Tudor I, Perry, Benjamin G, Riley, Patrick, Rousseaux, Sophie AL, Saikatendu, Kumar Singh, Santhakumar, Vijayaratnam, Schapira, Matthieu, Scholten, Cora, Todd, Matthew H, Vedadi, Masoud, Volkamer, Andrea, and Willson, Timothy M
Subjects: Networking and Information Technology R&D (NITRD), Bioengineering
Abstract: One aspirational goal of computational chemistry is to predict potent and drug-like binders for any protein, such that only those that bind are synthesized. In this Roadmap, we describe the launch of Critical Assessment of Computational Hit-finding Experiments (CACHE), a public benchmarking project to compare and improve small molecule hit-finding algorithms through cycles of prediction and experimental testing. Participants will predict small molecule binders for new and biologically relevant protein targets representing different prediction scenarios. Predicted compounds will be tested rigorously in an experimental hub, and all predicted binders as well as all experimental screening data, including the chemical structures of experimentally tested compounds, will be made publicly available, and not subject to any intellectual property restrictions. The ability of a range of computational approaches to find novel binders will be evaluated, compared, and openly published. CACHE will launch 3 new benchmarking exercises every year. The outcomes will be better prediction methods, new small molecule binders for target proteins of importance for fundamental biology or drug discovery, and a major technological step towards achieving the goal of Target 2035, a global initiative to identify pharmacological probes for all human proteins.
Published: 2022

9. Absolute binding free energy calculations improve enrichment of actives in virtual compound screening

Author: Feng, Mudong, Heinzelmann, Germano, and Gilson, Michael K
Subjects: Theory Of Computation, Medicinal and Biomolecular Chemistry, Chemical Sciences, Built Environment and Design, Information and Computing Sciences, Design, Prevention, Amyloid Precursor Protein Secretases, Aspartic Acid Endopeptidases, Entropy, Ligands, Molecular Docking Simulation, Protein Binding
Abstract: We determined the effectiveness of absolute binding free energy (ABFE) calculations to refine the selection of active compounds in virtual compound screening, a setting where the more commonly used relative binding free energy approach is not readily applicable. To do this, we conducted baseline docking calculations of structurally diverse compounds in the DUD-E database for three targets, BACE1, CDK2 and thrombin, followed by ABFE calculations for compounds with high docking scores. The docking calculations alone achieved solid enrichment of active compounds over decoys. Encouragingly, the ABFE calculations then improved on this baseline. Analysis of the results emphasizes the importance of establishing high quality ligand poses as starting points for ABFE calculations, a nontrivial goal when processing a library of diverse compounds without informative co-crystal structures. Overall, our results suggest that ABFE calculations can play a valuable role in the drug discovery process.
Published: 2022

10. Development and Benchmarking of Open Force Field v1.0.0the Parsley Small-Molecule Force Field

Author: Qiu, Yudong, Smith, Daniel GA, Boothroyd, Simon, Jang, Hyesu, Hahn, David F, Wagner, Jeffrey, Bannan, Caitlin C, Gokey, Trevor, Lim, Victoria T, Stern, Chaya D, Rizzi, Andrea, Tjanaka, Bryon, Tresadern, Gary, Lucas, Xavier, Shirts, Michael R, Gilson, Michael K, Chodera, John D, Bayly, Christopher I, Mobley, David L, and Wang, Lee-Ping
Subjects: Benchmarking, Ecosystem, Humans, Ligands, Molecular Conformation, Petroselinum, Theoretical and Computational Chemistry, Biochemistry and Cell Biology, Computer Software, Chemical Physics
Abstract: We present a methodology for defining and optimizing a general force field for classical molecular simulations, and we describe its use to derive the Open Force Field 1.0.0 small-molecule force field, codenamed Parsley. Rather than using traditional atom typing, our approach is built on the SMIRKS-native Open Force Field (SMIRNOFF) parameter assignment formalism, which handles increases in the diversity and specificity of the force field definition without needlessly increasing the complexity of the specification. Parameters are optimized with the ForceBalance tool, based on reference quantum chemical data that include torsion potential energy profiles, optimized gas-phase structures, and vibrational frequencies. These quantum reference data are computed and are maintained with QCArchive, an open-source and freely available distributed computing and database software ecosystem. In this initial application of the method, we present essentially a full optimization of all valence parameters and report tests of the resulting force field against compounds and data types outside the training set. These tests show improvements in optimized geometries and conformational energetics and demonstrate that Parsley's accuracy for liquid properties is similar to that of other general force fields, as is accuracy on binding free energies. We find that this initial Parsley force field affords accuracy similar to that of other general force fields when used to calculate relative binding free energies spanning 199 protein-ligand systems. Additionally, the resulting infrastructure allows us to rapidly optimize an entirely new force field with minimal human intervention.
Published: 2021

11. Enhancing water sampling of buried binding sites using nonequilibrium candidate Monte Carlo

Author: Bergazin, Teresa Danielle, Ben-Shalom, Ido Y, Lim, Nathan M, Gill, Sam C, Gilson, Michael K, and Mobley, David L
Subjects: Medicinal and Biomolecular Chemistry, Chemical Sciences, Theoretical and Computational Chemistry, Generic health relevance, Binding Sites, Ligands, Molecular Dynamics Simulation, Monte Carlo Method, Protein Binding, Protein Conformation, Proteins, Thermodynamics, Water, Molecular Dynamics simulations, Monte Carlo, NCMC, Nonequilibrium candidate Monte Carlo, Enhanced sampling, Water sampling, Buried binding sites, Buried cavity, Buried water, Major Urinary Protein, Heat Shock Protein 90, Medicinal & Biomolecular Chemistry, Medicinal and biomolecular chemistry, Theoretical and computational chemistry
Abstract: Water molecules can be found interacting with the surface and within cavities in proteins. However, water exchange between bulk and buried hydration sites can be slow compared to simulation timescales, thus leading to the inefficient sampling of the locations of water. This can pose problems for free energy calculations for computer-aided drug design. Here, we apply a hybrid method that combines nonequilibrium candidate Monte Carlo (NCMC) simulations and molecular dynamics (MD) to enhance sampling of water in specific areas of a system, such as the binding site of a protein. Our approach uses NCMC to gradually remove interactions between a selected water molecule and its environment, then translates the water to a new region, before turning the interactions back on. This approach of gradual removal of interactions, followed by a move and then reintroduction of interactions, allows the environment to relax in response to the proposed water translation, improving acceptance of moves and thereby accelerating water exchange and sampling. We validate this approach on several test systems including the ligand-bound MUP-1 and HSP90 proteins with buried crystallographic waters removed. We show that our BLUES (NCMC/MD) method enhances water sampling relative to normal MD when applied to these systems. Thus, this approach provides a strategy to improve water sampling in molecular simulations which may be useful in practical applications in drug discovery and biomolecular design.
Published: 2021

12. Enhanced Diffusion and Chemotaxis of Enzymes

Author: Feng, Mudong and Gilson, Michael K.
Subjects: Physics - Chemical Physics, Condensed Matter - Mesoscale and Nanoscale Physics, Physics - Biological Physics, Quantitative Biology - Biomolecules
Abstract: Many enzymes appear to diffuse faster in the presence of substrate and to drift either up or down a concentration gradient of their substrate. Observations of these phenomena, termed enhanced enzyme diffusion (EED) and enzyme chemotaxis, respectively, lead to a novel view of enzymes as active matter. Enzyme chemotaxis and EED may be important in biology, and they could have practical applications in biotechnology and nanotechnology. They also are of considerable biophysical interest; indeed, their physical mechanisms are still quite uncertain. This review provides an analytic summary of experimental studies of these phenomena and of the mechanisms that have been proposed to explain them, and offers a perspective of future directions for the field.
Published: 2019

13. The SAMPL6 SAMPLing challenge: assessing the reliability and efficiency of binding free energy calculations

Author: Rizzi, Andrea, Jensen, Travis, Slochower, David R, Aldeghi, Matteo, Gapsys, Vytautas, Ntekoumes, Dimitris, Bosisio, Stefano, Papadourakis, Michail, Henriksen, Niel M, de Groot, Bert L, Cournia, Zoe, Dickson, Alex, Michel, Julien, Gilson, Michael K, Shirts, Michael R, Mobley, David L, and Chodera, John D
Subjects: Affordable and Clean Energy, Bridged-Ring Compounds, Entropy, Imidazoles, Ligands, Macrocyclic Compounds, Physical Phenomena, Protein Binding, Proteins, Quantum Theory, Solvents, Thermodynamics, SAMPL6, Host-guest, Binding affinity, Free energy calculations, Cucurbit[8]uril, Octa-acid, Sampling, Host–guest, Medicinal and Biomolecular Chemistry, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry
Abstract: Approaches for computing small molecule binding free energies based on molecular simulations are now regularly being employed by academic and industry practitioners to study receptor-ligand systems and prioritize the synthesis of small molecules for ligand design. Given the variety of methods and implementations available, it is natural to ask how the convergence rates and final predictions of these methods compare. In this study, we describe the concept and results for the SAMPL6 SAMPLing challenge, the first challenge from the SAMPL series focusing on the assessment of convergence properties and reproducibility of binding free energy methodologies. We provided parameter files, partial charges, and multiple initial geometries for two octa-acid (OA) and one cucurbit[8]uril (CB8) host-guest systems. Participants submitted binding free energy predictions as a function of the number of force and energy evaluations for seven different alchemical and physical-pathway (i.e., potential of mean force and weighted ensemble of trajectories) methodologies implemented with the GROMACS, AMBER, NAMD, or OpenMM simulation engines. To rank the methods, we developed an efficiency statistic based on bias and variance of the free energy estimates. For the two small OA binders, the free energy estimates computed with alchemical and potential of mean force approaches show relatively similar variance and bias as a function of the number of energy/force evaluations, with the attach-pull-release (APR), GROMACS expanded ensemble, and NAMD double decoupling submissions obtaining the greatest efficiency. The differences between the methods increase when analyzing the CB8-quinine system, where both the guest size and correlation times for system dynamics are greater. For this system, nonequilibrium switching (GROMACS/NS-DS/SB) obtained the overall highest efficiency. Surprisingly, the results suggest that specifying force field parameters and partial charges is insufficient to generally ensure reproducibility, and we observe differences between seemingly converged predictions ranging approximately from 0.3 to 1.0 kcal/mol, even with almost identical simulations parameters and system setup (e.g., Lennard-Jones cutoff, ionic composition). Further work will be required to completely identify the exact source of these discrepancies. Among the conclusions emerging from the data, we found that Hamiltonian replica exchange-while displaying very small variance-can be affected by a slowly-decaying bias that depends on the initial population of the replicas, that bidirectional estimators are significantly more efficient than unidirectional estimators for nonequilibrium free energy calculations for systems considered, and that the Berendsen barostat introduces non-negligible artifacts in expanded ensemble simulations.
Published: 2020

14. Data-Driven Mapping of Gas-Phase Quantum Calculations to General Force Field Lennard-Jones Parameters.

Author: Kantonen, Sophie M, Muddana, Hari S, Schauperl, Michael, Henriksen, Niel M, Wang, Lee-Ping, and Gilson, Michael K
Subjects: Chemical Sciences, Physical Chemistry, Theoretical and Computational Chemistry, Biochemistry and Cell Biology, Computer Software, Chemical Physics, Physical chemistry, Theoretical and computational chemistry
Abstract: Molecular dynamics simulations are helpful tools for a range of applications, ranging from drug discovery to protein structure determination. The successful use of this technology largely depends on the potential function, or force field, used to determine the potential energy at each configuration of the system. Most force fields encode all of the relevant parameters to be used in distinct atom types, each associated with parameters for all parts of the force field, typically bond stretches, angle bends, torsions, and nonbonded terms accounting for van der Waals and electrostatic interactions. Much attention has been paid to the nonbonded parameters and their derivation, which are important in particular due to their governance of noncovalent interactions, such as protein-ligand binding. Parametrization involves adjusting the nonbonded parameters to minimize the error between simulation results and experimental properties, such as heats of vaporization and densities of neat liquids. In this setting, determining the best set of atom types is far from trivial, and the large number of parameters to be fit for the atom types in a typical force field can make it difficult to approach a true optimum. Here, we utilize a previously described Minimal Basis Iterative Stockholder (MBIS) method to carry out an atoms-in-molecules partitioning of electron densities. Information from these atomic densities is then mapped to Lennard-Jones parameters using a set of mapping parameters much smaller than the typical number of atom types in a force field. This approach is advantageous in two ways: it eliminates atom types by allowing each atom to have unique Lennard-Jones parameters, and it greatly reduces the number of parameters to be optimized. We show that this approach yields results comparable to those obtained with the typed GAFF 1.7 force field, even when trained on a relatively small amount of experimental data.
Published: 2020

15. D3R grand challenge 4: blind prediction of protein–ligand poses, affinity rankings, and relative binding free energies

Author: Parks, Conor D, Gaieb, Zied, Chiu, Michael, Yang, Huanwang, Shao, Chenghua, Walters, W Patrick, Jansen, Johanna M, McGaughey, Georgia, Lewis, Richard A, Bembenek, Scott D, Ameriks, Michael K, Mirzadegan, Tara, Burley, Stephen K, Amaro, Rommie E, and Gilson, Michael K
Subjects: Neurodegenerative, Amyloid Precursor Protein Secretases, Aspartic Acid Endopeptidases, Drug Design, Enzyme Inhibitors, Humans, Ligands, Machine Learning, Molecular Docking Simulation, Small Molecule Libraries, Thermodynamics, D3R, Docking, Scoring, Ligand ranking, Free-energy, Blinded prediction challenge, Medicinal and Biomolecular Chemistry, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry
Abstract: The Drug Design Data Resource (D3R) aims to identify best practice methods for computer aided drug design through blinded ligand pose prediction and affinity challenges. Herein, we report on the results of Grand Challenge 4 (GC4). GC4 focused on proteins beta secretase 1 and Cathepsin S, and was run in an analogous manner to prior challenges. In Stage 1, participant ability to predict the pose and affinity of BACE1 ligands were assessed. Following the completion of Stage 1, all BACE1 co-crystal structures were released, and Stage 2 tested affinity rankings with co-crystal structures. We provide an analysis of the results and discuss insights into determined best practice methods.
Published: 2020

16. Non-bonded force field model with advanced restrained electrostatic potential charges (RESP2)

Author: Schauperl, Michael, Nerenberg, Paul S, Jang, Hyesu, Wang, Lee-Ping, Bayly, Christopher I, Mobley, David L, and Gilson, Michael K
Abstract: The restrained electrostatic potential (RESP) approach is a highly regarded and widely used method of assigning partial charges to molecules for simulations. RESP uses a quantum-mechanical method that yields fortuitous overpolarization and thereby accounts only approximately for self-polarization of molecules in the condensed phase. Here we present RESP2, a next generation of this approach, where the polarity of the charges is tuned by a parameter, δ, which scales the contributions from gas- and aqueous-phase calculations. When the complete non-bonded force field model, including Lennard-Jones parameters, is optimized to liquid properties, improved accuracy is achieved, even with this reduced set of five Lennard-Jones types. We argue that RESP2 with δ≈0.6 (60% aqueous, 40% gas-phase charges) is an accurate and robust method of generating partial charges, and that a small set of Lennard-Jones types is good starting point for a systematic re-optimization of this important non-bonded term.
Published: 2020

17. Data-driven analysis of the number of Lennard–Jones types needed in a force field

Author: Schauperl, Michael, Kantonen, Sophie M, Wang, Lee-Ping, and Gilson, Michael K
Abstract: Force fields used in molecular simulations contain numerical parameters, such as Lennard-Jones (LJ) parameters, which are assigned to the atoms in a molecule based on a classification of their chemical environments. The number of classes, or types, should be no more than needed to maximize agreement with experiment, as parsimony avoids overfitting and simplifies parameter optimization. However, types have historically been crafted based largely on chemical intuition, so current force fields may contain more types than needed. In this study, we seek the minimum number of LJ parameter types needed to represent key properties of organic liquids. We find that highly competitive force field accuracy is obtained with minimalist sets of LJ types; e.g. two H types and one type apiece for C, O, and N atoms. We also find that the fitness surface has multiple minima, which can lead to local trapping of the optimizer.
Published: 2020

18. Binding Thermodynamics of Host-Guest Systems with SMIRNOFF99Frosst 1.0.5 from the Open Force Field Initiative.

Author: Slochower, David R, Henriksen, Niel M, Wang, Lee-Ping, Chodera, John D, Mobley, David L, and Gilson, Michael K
Subjects: Medicinal and Biomolecular Chemistry, Chemical Sciences, Bioengineering, Generic health relevance, Ligands, Models, Molecular, Thermodynamics, alpha-Cyclodextrins, beta-Cyclodextrins, Theoretical and Computational Chemistry, Biochemistry and Cell Biology, Computer Software, Chemical Physics, Physical chemistry, Theoretical and computational chemistry
Abstract: Designing ligands that bind their target biomolecules with high affinity and specificity is a key step in small-molecule drug discovery, but accurately predicting protein-ligand binding free energies remains challenging. Key sources of errors in the calculations include inadequate sampling of conformational space, ambiguous protonation states, and errors in force fields. Noncovalent complexes between a host molecule with a binding cavity and a druglike guest molecule have emerged as powerful model systems. As model systems, host-guest complexes reduce many of the errors in more complex protein-ligand binding systems, as their small size greatly facilitates conformational sampling, and one can choose systems that avoid ambiguities in protonation states. These features, combined with their ease of experimental characterization, make host-guest systems ideal model systems to test and ultimately optimize force fields in the context of binding thermodynamics calculations. The Open Force Field Initiative aims to create a modern, open software infrastructure for automatically generating and assessing force fields using data sets. The first force field to arise out of this effort, named SMIRNOFF99Frosst, has approximately one tenth the number of parameters, in version 1.0.5, compared to typical general small molecule force fields, such as GAFF. Here, we evaluate the accuracy of this initial force field, using free energy calculations of 43 α and β-cyclodextrin host-guest pairs for which experimental thermodynamic data are available, and compare with matched calculations using two versions of GAFF. For all three force fields, we used TIP3P water and AM1-BCC charges. The calculations are performed using the attach-pull-release (APR) method as implemented in the open source package, pAPRika. For binding free energies, the root-mean-square error of the SMIRNOFF99Frosst calculations relative to experiment is 0.9 [0.7, 1.1] kcal/mol, while the corresponding results for GAFF 1.7 and GAFF 2.1 are 0.9 [0.7, 1.1] kcal/mol and 1.7 [1.5, 1.9] kcal/mol, respectively, with 95% confidence ranges in brackets. These results suggest that SMIRNOFF99Frosst performs competitively with existing small molecule force fields and is a parsimonious starting point for optimization.
Published: 2019

19. Continuous Evaluation of Ligand Protein Predictions: A Weekly Community Challenge for Drug Docking

Author: Wagner, Jeffrey R, Churas, Christopher P, Liu, Shuai, Swift, Robert V, Chiu, Michael, Shao, Chenghua, Feher, Victoria A, Burley, Stephen K, Gilson, Michael K, and Amaro, Rommie E
Subjects: Patient Safety, Bioengineering, Generic health relevance, Binding Sites, Computational Biology, Crystallography, X-Ray, Drug Design, Ligands, Molecular Docking Simulation, Protein Binding, Protein Conformation, Proteins, Structure-Activity Relationship, CELPP, D3R, RCSB PDB, community resource, drug design data resource, drug docking, pose prediction, Biophysics
Abstract: Docking calculations can accelerate drug discovery by predicting the bound poses of ligands for a targeted protein. However, it is not clear which docking methods work best. Furthermore, predicting poses requires steps outside the docking algorithm itself, such as preparation of the protein and ligand, and it is not known which components are most in need of improvement. The Continuous Evaluation of Ligand Protein Predictions (CELPP) is a blinded prediction challenge designed to address these issues. Participants create a workflow to predict protein-ligand binding poses, which is then tasked with predicting 10-100 new protein-ligand crystal structures each week. CELPP evaluates the accuracy of each workflow's predictions and posts the scores online. The results can be used to identify the strengths and weaknesses of current approaches, help map docking problems to the algorithms most likely to overcome them, and illuminate areas of unmet need in structure-guided drug design.
Published: 2019

20. Antitumor Activity of 1,18-Octadecanedioic Acid-Paclitaxel Complexed with Human Serum Albumin

Author: Callmann, Cassandra E, LeGuyader, Clare LM, Burton, Spencer T, Thompson, Matthew P, Hennis, Robert, Barback, Christopher, Henriksen, Niel M, Chan, Warren C, Jaremko, Matt J, Yang, Jin, Garcia, Arnold, Burkart, Michael D, Gilson, Michael K, Momper, Jeremiah D, Bertin, Paul A, and Gianneschi, Nathan C
Subjects: Engineering, Chemical Sciences, Cancer, Animals, Antineoplastic Agents, Cell Line, Tumor, Cell Proliferation, Dicarboxylic Acids, Dose-Response Relationship, Drug, Humans, Mice, Mice, Nude, Models, Molecular, Molecular Structure, Neoplasms, Experimental, Paclitaxel, Prodrugs, Serum Albumin, Human, Stearic Acids, General Chemistry, Chemical sciences
Abstract: We describe the design, synthesis, and antitumor activity of an 18 carbon α,ω-dicarboxylic acid monoconjugated via an ester linkage to paclitaxel (PTX). This 1,18-octadecanedioic acid-PTX (ODDA-PTX) prodrug readily forms a noncovalent complex with human serum albumin (HSA). Preservation of the terminal carboxylic acid moiety on ODDA-PTX enables binding to HSA in the same manner as native long-chain fatty acids (LCFAs), within hydrophobic pockets, maintaining favorable electrostatic contacts between the ω-carboxylate of ODDA-PTX and positively charged amino acid residues of the protein. This carrier strategy for small molecule drugs is based on naturally evolved interactions between LCFAs and HSA, demonstrated here for PTX. ODDA-PTX shows differentiated pharmacokinetics, higher maximum tolerated doses and increased efficacy in vivo in multiple subcutaneous murine xenograft models of human cancer, as compared to two FDA-approved clinical formulations, Cremophor EL-formulated paclitaxel (crPTX) and Abraxane (nanoparticle albumin-bound (nab)-paclitaxel).
Published: 2019

21. Entropic effects enable life at extreme temperatures

Author: Kim, Young Hun, Leriche, Geoffray, Diraviyam, Karthik, Koyanagi, Takaoki, Gao, Kaifu, Onofrei, David, Patterson, Joseph, Guha, Anirvan, Gianneschi, Nathan, Holland, Gregory P, Gilson, Michael K, Mayer, Michael, Sept, David, and Yang, Jerry
Subjects: Biochemistry and Cell Biology, Physical Sciences, Biological Sciences, Generic health relevance, Adaptation, Physiological, Archaea, Calorimetry, Differential Scanning, Cell Membrane Permeability, Cryoelectron Microscopy, Entropy, Hot Temperature, Lipid Bilayers, Liposomes, Microscopy, Atomic Force, Molecular Dynamics Simulation
Abstract: Maintaining membrane integrity is a challenge at extreme temperatures. Biochemical synthesis of membrane-spanning lipids is one adaptation that organisms such as thermophilic archaea have evolved to meet this challenge and preserve vital cellular function at high temperatures. The molecular-level details of how these tethered lipids affect membrane dynamics and function, however, remain unclear. Using synthetic monolayer-forming lipids with transmembrane tethers, here, we reveal that lipid tethering makes membrane permeation an entropically controlled process that helps to limit membrane leakage at elevated temperatures relative to bilayer-forming lipid membranes. All-atom molecular dynamics simulations support a view that permeation through membranes made of tethered lipids reduces the torsional entropy of the lipids and leads to tighter lipid packing, providing a molecular interpretation for the increased transition-state entropy of leakage.
Published: 2019

22. A Thermodynamic Limit on the Role of Self-Propulsion in Enhanced Enzyme Diffusion

Author: Feng, Mudong and Gilson, Michael K
Subjects: Biochemistry and Cell Biology, Biological Sciences, Diffusion, Enzymes, Kinetics, Models, Biological, Thermodynamics, Physical Sciences, Chemical Sciences, Biophysics, Biological sciences, Chemical sciences, Physical sciences
Abstract: A number of enzymes reportedly exhibit enhanced diffusion in the presence of their substrates, with a Michaelis-Menten-like concentration dependence. Although no definite explanation of this phenomenon has emerged, a physical picture of enzyme self-propulsion using energy from the catalyzed reaction has been widely considered. Here, we present a kinematic and thermodynamic analysis of enzyme self-propulsion that is independent of any specific propulsion mechanism. Using this theory, along with biophysical data compiled for all enzymes so far shown to undergo enhanced diffusion, we show that the propulsion speed required to generate experimental levels of enhanced diffusion exceeds the speeds of well-known active biomolecules, such as myosin, by several orders of magnitude. Furthermore, the minimal power dissipation required to account for enzyme enhanced diffusion by self-propulsion markedly exceeds the chemical power available from enzyme-catalyzed reactions. Alternative explanations for the observation of enhanced enzyme diffusion therefore merit stronger consideration.
Published: 2019

23. Toward Learned Chemical Perception of Force Field Typing Rules.

Author: Zanette, Camila, Bannan, Caitlin C, Bayly, Christopher I, Fass, Josh, Gilson, Michael K, Shirts, Michael R, Chodera, John D, and Mobley, David L
Subjects: Chemical Sciences, Theoretical and Computational Chemistry, Bioengineering, Humans, Molecular Dynamics Simulation, Monte Carlo Method, Quantum Theory, Biochemistry and Cell Biology, Computer Software, Chemical Physics, Physical chemistry, Theoretical and computational chemistry
Abstract: Molecular mechanics force fields define how the energy and forces in a molecular system are computed from its atomic positions, thus enabling the study of such systems through computational methods like molecular dynamics and Monte Carlo simulations. Despite progress toward automated force field parametrization, considerable human expertise is required to develop or extend force fields. In particular, human input has long been required to define atom types, which encode chemically unique environments that determine which parameters will be assigned. However, relying on humans to establish atom types is suboptimal. Human-created atom types are often developed without statistical justification, leading to over- or under-fitting of data. Human-created types are also difficult to extend in a systematic and consistent manner when new chemistries must be modeled or new data becomes available. Finally, human effort is not scalable when force fields must be generated for new (bio)polymers, compound classes, or materials. To remedy these deficiencies, our long-term goal is to replace human specification of atom types with an automated approach, based on rigorous statistics and driven by experimental and/or quantum chemical reference data. In this work, we describe novel methods that automate the discovery of appropriate chemical perception: SMARTY allows for the creation of atom types, while SMIRKY goes further by automating the creation of fragment (nonbonded, bonds, angles, and torsions) types. These approaches enable the creation of move sets in atom or fragment type space, which are used within a Monte Carlo optimization approach. We demonstrate the power of these new methods by automating the rediscovery of human defined atom types (SMARTY) or fragment types (SMIRKY) in existing small molecule force fields. We assess these approaches using several molecular data sets, including one which covers a diverse subset of the DrugBank database.
Published: 2019

24. D3R Grand Challenge 3: blind prediction of protein–ligand poses and affinity rankings

Author: Gaieb, Zied, Parks, Conor D, Chiu, Michael, Yang, Huanwang, Shao, Chenghua, Walters, W Patrick, Lambert, Millard H, Nevins, Neysa, Bembenek, Scott D, Ameriks, Michael K, Mirzadegan, Tara, Burley, Stephen K, Amaro, Rommie E, and Gilson, Michael K
Subjects: Generic health relevance, Binding Sites, Cathepsins, Computer-Aided Design, Crystallography, X-Ray, Databases, Protein, Drug Design, Ligands, Molecular Docking Simulation, Protein Binding, Protein Conformation, Protein Kinase Inhibitors, Protein Kinases, Thermodynamics, D3R, Drug Design Data Resource, Docking, Scoring, Ligand ranking, Blinded prediction challenge, Medicinal and Biomolecular Chemistry, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry
Abstract: The Drug Design Data Resource aims to test and advance the state of the art in protein-ligand modeling by holding community-wide blinded, prediction challenges. Here, we report on our third major round, Grand Challenge 3 (GC3). Held 2017-2018, GC3 centered on the protein Cathepsin S and the kinases VEGFR2, JAK2, p38-α, TIE2, and ABL1, and included both pose-prediction and affinity-ranking components. GC3 was structured much like the prior challenges GC2015 and GC2. First, Stage 1 tested pose prediction and affinity ranking methods; then all available crystal structures were released, and Stage 2 tested only affinity rankings, now in the context of the available structures. Unique to GC3 was the addition of a Stage 1b self-docking subchallenge, in which the protein coordinates from all of the cocrystal structures used in the cross-docking challenge were released, and participants were asked to predict the pose of CatS ligands using these newly released structures. We provide an overview of the outcomes and discuss insights into trends and best-practices.
Published: 2019

25. Discovering de novo peptide substrates for enzymes using machine learning.

Author: Tallorin, Lorillee, Wang, JiaLei, Kim, Woojoo E, Sahu, Swagat, Kosa, Nicolas M, Yang, Pu, Thompson, Matthew, Gilson, Michael K, Frazier, Peter I, Burkart, Michael D, and Gianneschi, Nathan C
Subjects: Transferases (Other Substituted Phosphate Groups), Peptides, Bacterial Proteins, Recombinant Proteins, Bayes Theorem, Amino Acid Sequence, Protein Binding, Substrate Specificity, Machine Learning, Transferases
Abstract: The discovery of peptide substrates for enzymes with exclusive, selective activities is a central goal in chemical biology. In this paper, we develop a hybrid computational and biochemical method to rapidly optimize peptides for specific, orthogonal biochemical functions. The method is an iterative machine learning process by which experimental data is deposited into a mathematical algorithm that selects potential peptide substrates to be tested experimentally. Once tested, the algorithm uses the experimental data to refine future selections. This process is repeated until a suitable set of de novo peptide substrates are discovered. We employed this technology to discover orthogonal peptide substrates for 4'-phosphopantetheinyl transferase, an enzyme class that covalently modifies proteins. In this manner, we have demonstrated that machine learning can be leveraged to guide peptide optimization for specific biochemical functions not immediately accessible by biological screening techniques, such as phage display and random mutagenesis.
Published: 2018

26. Escaping Atom Types in Force Fields Using Direct Chemical Perception.

Author: Mobley, David L, Bannan, Caitlin C, Rizzi, Andrea, Bayly, Christopher I, Chodera, John D, Lim, Victoria T, Lim, Nathan M, Beauchamp, Kyle A, Slochower, David R, Shirts, Michael R, Gilson, Michael K, and Eastman, Peter K
Subjects: Theoretical and Computational Chemistry, Biochemistry and Cell Biology, Computer Software, Chemical Physics
Abstract: Traditional approaches to specifying a molecular mechanics force field encode all the information needed to assign force field parameters to a given molecule into a discrete set of atom types. This is equivalent to a representation consisting of a molecular graph comprising a set of vertices, which represent atoms labeled by atom type, and unlabeled edges, which represent chemical bonds. Bond stretch, angle bend, and dihedral parameters are then assigned by looking up bonded pairs, triplets, and quartets of atom types in parameter tables to assign valence terms and using the atom types themselves to assign nonbonded parameters. This approach, which we call indirect chemical perception because it operates on the intermediate graph of atom-typed nodes, creates a number of technical problems. For example, atom types must be sufficiently complex to encode all necessary information about the molecular environment, making it difficult to extend force fields encoded this way. Atom typing also results in a proliferation of redundant parameters applied to chemically equivalent classes of valence terms, needlessly increasing force field complexity. Here, we describe a new approach to assigning force field parameters via direct chemical perception. Rather than working through the intermediary of the atom-typed graph, direct chemical perception operates directly on the unmodified chemical graph of the molecule to assign parameters. In particular, parameters are assigned to each type of force field term (e.g., bond stretch, angle bend, torsion, and Lennard-Jones) based on standard chemical substructure queries implemented via the industry-standard SMARTS chemical perception language, using SMIRKS extensions that permit labeling of specific atoms within a chemical pattern. We use this to implement a new force field format, called the SMIRKS Native Open Force Field (SMIRNOFF) format. We demonstrate the power and generality of this approach using examples of specific molecules that pose problems for indirect chemical perception and construct and validate a minimalist yet very general force field, SMIRNOFF99Frosst. We find that a parameter definition file only ∼300 lines long provides coverage of all but
Published: 2018

27. Overview of the SAMPL6 host–guest binding affinity prediction challenge

Author: Rizzi, Andrea, Murkli, Steven, McNeill, John N, Yao, Wei, Sullivan, Matthew, Gilson, Michael K, Chiu, Michael W, Isaacs, Lyle, Gibb, Bruce C, Mobley, David L, and Chodera, John D
Subjects: Macromolecular and Materials Chemistry, Medicinal and Biomolecular Chemistry, Organic Chemistry, Chemical Sciences, Bioengineering, Bridged-Ring Compounds, Carboxylic Acids, Computer Simulation, Cycloparaffins, Drug Design, Imidazoles, Ligands, Macrocyclic Compounds, Molecular Structure, Protein Binding, Proteins, Software, Thermodynamics, SAMPL6, Host-guest, Blind challenge, Binding affinity, Free energy, Cucurbit[8]uril, Octa-acid, Host–guest, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry, Medicinal and biomolecular chemistry, Theoretical and computational chemistry
Abstract: Accurately predicting the binding affinities of small organic molecules to biological macromolecules can greatly accelerate drug discovery by reducing the number of compounds that must be synthesized to realize desired potency and selectivity goals. Unfortunately, the process of assessing the accuracy of current computational approaches to affinity prediction against binding data to biological macromolecules is frustrated by several challenges, such as slow conformational dynamics, multiple titratable groups, and the lack of high-quality blinded datasets. Over the last several SAMPL blind challenge exercises, host-guest systems have emerged as a practical and effective way to circumvent these challenges in assessing the predictive performance of current-generation quantitative modeling tools, while still providing systems capable of possessing tight binding affinities. Here, we present an overview of the SAMPL6 host-guest binding affinity prediction challenge, which featured three supramolecular hosts: octa-acid (OA), the closely related tetra-endo-methyl-octa-acid (TEMOA), and cucurbit[8]uril (CB8), along with 21 small organic guest molecules. A total of 119 entries were received from ten participating groups employing a variety of methods that spanned from electronic structure and movable type calculations in implicit solvent to alchemical and potential of mean force strategies using empirical force fields with explicit solvent models. While empirical models tended to obtain better performance than first-principle methods, it was not possible to identify a single approach that consistently provided superior results across all host-guest systems and statistical metrics. Moreover, the accuracy of the methodologies generally displayed a substantial dependence on the system considered, emphasizing the need for host diversity in blind evaluations. Several entries exploited previous experimental measurements of similar host-guest systems in an effort to improve their physical-based predictions via some manner of rudimentary machine learning; while this strategy succeeded in reducing systematic errors, it did not correspond to an improvement in statistical correlation. Comparison to previous rounds of the host-guest binding free energy challenge highlights an overall improvement in the correlation obtained by the affinity predictions for OA and TEMOA systems, but a surprising lack of improvement regarding root mean square error over the past several challenge rounds. The data suggests that further refinement of force field parameters, as well as improved treatment of chemical effects (e.g., buffer salt conditions, protonation states), may be required to further enhance predictive accuracy.
Published: 2018

28. BAT2: an Open-Source Tool for Flexible, Automated, and Low Cost Absolute Binding Free Energy Calculations.

Author: Heinzelmann, Germano, Huggins, David J., and Gilson, Michael K.
Published: 2024
Full Text: View/download PDF

29. The Open Force Field Initiative: Open Software and Open Science for Molecular Modeling.

Author: Wang, Lily, Behara, Pavan Kumar, Thompson, Matthew W., Gokey, Trevor, Wang, Yuanqing, Wagner, Jeffrey R., Cole, Daniel J., Gilson, Michael K., Shirts, Michael R., and Mobley, David L.
Published: 2024
Full Text: View/download PDF

30. Rapid, Accurate, Ranking of Protein–Ligand Binding Affinities with VM2, the Second-Generation Mining Minima Method.

Author: Gilson, Michael K., Stewart, Lawrence E., Potter, Michael J., and Webb, Simon P.
Published: 2024
Full Text: View/download PDF

31. Ligand-Based Compound Activity Prediction via Few-Shot Learning.

Author: Eckmann, Peter, Anderson, Jake, Yu, Rose, and Gilson, Michael K.
Published: 2024
Full Text: View/download PDF

32. Structural insights into the gating of DNA passage by the topoisomerase II DNA-gate.

Author: Chen, Shin-Fu, Huang, Nan-Lan, Lin, Jung-Hsin, Wu, Chyuan-Chuan, Wang, Ying-Ren, Yu, Yu-Jen, Gilson, Michael K, and Chan, Nei-Li
Subjects: Humans, DNA Topoisomerases, Type II, DNA, Crystallography, X-Ray, Allosteric Site, Molecular Conformation, Nucleic Acid Conformation, Protein Conformation, Catalysis, Models, Molecular, Molecular Dynamics Simulation, Poly-ADP-Ribose Binding Proteins, DNA Topoisomerases, Type II, Crystallography, X-Ray, Models, Molecular
Abstract: Type IIA topoisomerases (Top2s) manipulate the handedness of DNA crossovers by introducing a transient and protein-linked double-strand break in one DNA duplex, termed the DNA-gate, whose opening allows another DNA segment to be transported through to change the DNA topology. Despite the central importance of this gate-opening event to Top2 function, the DNA-gate in all reported structures of Top2-DNA complexes is in the closed state. Here we present the crystal structure of a human Top2 DNA-gate in an open conformation, which not only reveals structural characteristics of its DNA-conducting path, but also uncovers unexpected yet functionally significant conformational changes associated with gate-opening. This structure further implicates Top2's preference for a left-handed DNA braid and allows the construction of a model representing the initial entry of another DNA duplex into the DNA-gate. Steered molecular dynamics calculations suggests the Top2-catalyzed DNA passage may be achieved by a rocker-switch-type movement of the DNA-gate.
Published: 2018

33. Motor-like Properties of Nonmotor Enzymes

Author: Slochower, David R and Gilson, Michael K
Subjects: Organic Chemistry, Chemical Sciences, HIV/AIDS, Affordable and Clean Energy, Adenosine Kinase, Cyclic AMP-Dependent Protein Kinases, HIV Protease, Molecular Dynamics Simulation, Movement, Protein Conformation, Thermodynamics, Physical Sciences, Biological Sciences, Biophysics, Biological sciences, Chemical sciences, Physical sciences
Abstract: Molecular motors are thought to generate force and directional motion via nonequilibrium switching between energy surfaces. Because all enzymes can undergo such switching, we hypothesized that the ability to generate rotary motion and torque is not unique to highly adapted biological motor proteins but is instead a common feature of enzymes. We used molecular dynamics simulations to compute energy surfaces for hundreds of torsions in three enzymes-adenosine kinase, protein kinase A, and HIV-1 protease-and used these energy surfaces within a kinetic model that accounts for intersurface switching and intrasurface probability flows. When substrate is out of equilibrium with product, we find computed torsion rotation rates up ∼140 cycles s-1, with stall torques up to ∼2 kcal mol-1 cycle-1, and power outputs up to ∼50 kcal mol-1 s-1. We argue that these enzymes are instances of a general phenomenon of directional probability flows on asymmetric energy surfaces for systems out of equilibrium. Thus, we conjecture that cyclic probability fluxes, corresponding to rotations of torsions and higher-order collective variables, exist in any chiral molecule driven between states in a nonequilibrium manner; we call this the "Asymmetry-Directionality" conjecture. This is expected to apply as well to synthetic chiral molecules switched in a nonequilibrium manner between energy surfaces by light, redox chemistry, or catalysis.
Published: 2018

34. Accounting for apparent deviations between calorimetric and van't Hoff enthalpies

Author: Kantonen, Samuel A, Henriksen, Niel M, and Gilson, Michael K
Subjects: Biochemistry and Cell Biology, Biological Sciences, Algorithms, Amantadine, Calorimetry, Energy Transfer, Hot Temperature, Kinetics, Rimantadine, Thermodynamics, beta-Cyclodextrins, ITC, van't Hoff, Binding thermodynamics, Uncertainty, Pharmacology and Pharmaceutical Sciences, Biochemistry & Molecular Biology, Biochemistry and cell biology
Abstract: BackgroundIn theory, binding enthalpies directly obtained from calorimetry (such as ITC) and the temperature dependence of the binding free energy (van't Hoff method) should agree. However, previous studies have often found them to be discrepant.MethodsExperimental binding enthalpies (both calorimetric and van't Hoff) are obtained for two host-guest pairs using ITC, and the discrepancy between the two enthalpies is examined. Modeling of artificial ITC data is also used to examine how different sources of error propagate to both types of binding enthalpies.ResultsFor the host-guest pairs examined here, good agreement, to within about 0.4kcal/mol, is obtained between the two enthalpies. Additionally, using artificial data, we find that different sources of error propagate to either enthalpy uniquely, with concentration error and heat error propagating primarily to calorimetric and van't Hoff enthalpies, respectively.ConclusionsWith modern calorimeters, good agreement between van't Hoff and calorimetric enthalpies should be achievable, barring issues due to non-ideality or unanticipated measurement pathologies. Indeed, disagreement between the two can serve as a flag for error-prone datasets. A review of the underlying theory supports the expectation that these two quantities should be in agreement.General significanceWe address and arguably resolve long-standing questions regarding the relationship between calorimetric and van't Hoff enthalpies. In addition, we show that comparison of these two quantities can be used as an internal consistency check of a calorimetry study.
Published: 2018

35. Solvation Structure and Thermodynamic Mapping (SSTMap): An Open-Source, Flexible Package for the Analysis of Water in Molecular Dynamics Trajectories

Author: Haider, Kamran, Cruz, Anthony, Ramsey, Steven, Gilson, Michael K, and Kurtzman, Tom
Subjects: Chemical Sciences, Physical Chemistry, Theoretical and Computational Chemistry, Networking and Information Technology R&D (NITRD), Biochemistry and Cell Biology, Computer Software, Chemical Physics, Physical chemistry, Theoretical and computational chemistry
Abstract: We have developed SSTMap, a software package for mapping structural and thermodynamic water properties in molecular dynamics trajectories. The package introduces automated analysis and mapping of local measures of frustration and enhancement of water structure. The thermodynamic calculations are based on Inhomogeneous Fluid Solvation Theory (IST), which is implemented using both site-based and grid-based approaches. The package also extends the applicability of solvation analysis calculations to multiple molecular dynamics (MD) simulation programs by using existing cross-platform tools for parsing MD parameter and trajectory files. SSTMap is implemented in Python and contains both command-line tools and a Python module to facilitate flexibility in setting up calculations and for automated generation of large data sets involving analysis of multiple solutes. Output is generated in formats compatible with popular Python data science packages. This tool will be used by the molecular modeling community for computational analysis of water in problems of biophysical interest such as ligand binding and protein function.
Published: 2018

36. Rapid, Accurate, Ranking of Protein-Ligand Binding Affinities with VM2, the 2nd –Generation Mining Minima Method

Author: Gilson, Michael K., primary, Stewart, Lawrence E., additional, Potter, Michael J., additional, and Webb, Simon P., additional
Published: 2024
Full Text: View/download PDF

37. Free Energy Density of a Fluid and Its Role in Solvation and Binding

Author: Gilson, Michael K, primary and Kurtzman, Tom, additional
Published: 2024
Full Text: View/download PDF

38. The Free Energy Density of a Fluid and its Role in Solvation and Binding

Author: Gilson, Michael K., primary and Kurtzman, Tom, additional
Published: 2024
Full Text: View/download PDF

39. A fast, convenient, polarizable electrostatic model for molecular dynamics

Author: Wang, Liangyue, primary, Schauperl, Michael, additional, Mobley, David L., additional, Bayly, Christopher, additional, and Gilson, Michael K., additional
Published: 2024
Full Text: View/download PDF

40. Host–guest systems for the SAMPL9 blinded prediction challenge: phenothiazine as a privileged scaffold for binding to cyclodextrins

Author: Andrade, Brenda, primary, Chen, Ashley, additional, and Gilson, Michael K., additional
Published: 2024
Full Text: View/download PDF

41. Testing inhomogeneous solvation theory in structure-based ligand discovery

Author: Balius, Trent E, Fischer, Marcus, Stein, Reed M, Adler, Thomas B, Nguyen, Crystal N, Cruz, Anthony, Gilson, Michael K, Kurtzman, Tom, and Shoichet, Brian K
Subjects: Theory Of Computation, Medicinal and Biomolecular Chemistry, Chemical Sciences, Built Environment and Design, Information and Computing Sciences, Design, Algorithms, Binding Sites, Computational Biology, Crystallography, X-Ray, Kinetics, Ligands, Molecular Docking Simulation, Molecular Structure, Protein Binding, Protein Conformation, Solutions, Solvents, Thermodynamics, Water, water, inhomogeneous solvation theory, ligand discovery, structure-based drug design, docking
Abstract: Binding-site water is often displaced upon ligand recognition, but is commonly neglected in structure-based ligand discovery. Inhomogeneous solvation theory (IST) has become popular for treating this effect, but it has not been tested in controlled experiments at atomic resolution. To do so, we turned to a grid-based version of this method, GIST, readily implemented in molecular docking. Whereas the term only improves docking modestly in retrospective ligand enrichment, it could be added without disrupting performance. We thus turned to prospective docking of large libraries to investigate GIST's impact on ligand discovery, geometry, and water structure in a model cavity site well-suited to exploring these terms. Although top-ranked docked molecules with and without the GIST term often overlapped, many ligands were meaningfully prioritized or deprioritized; some of these were selected for testing. Experimentally, 13/14 molecules prioritized by GIST did bind, whereas none of the molecules that it deprioritized were observed to bind. Nine crystal complexes were determined. In six, the ligand geometry corresponded to that predicted by GIST, for one of these the pose without the GIST term was wrong, and three crystallographic poses differed from both predictions. Notably, in one structure, an ordered water molecule with a high GIST displacement penalty was observed to stay in place. Inclusion of this water-displacement term can substantially improve the hit rates and ligand geometries from docking screens, although the magnitude of its effects can be small and its impact in drug binding sites merits further controlled studies.
Published: 2017

42. Erratum to: Lessons learned from comparing molecular dynamics engines on the SAMPL5 dataset

Author: Shirts, Michael R, Klein, Christoph, Swails, Jason M, Yin, Jian, Gilson, Michael K, Mobley, David L, Case, David A, and Zhong, Ellen D
Subjects: Medicinal and Biomolecular Chemistry, Chemical Sciences, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry, Medicinal and biomolecular chemistry, Theoretical and computational chemistry
Abstract: Electronic Supplementary Material (ESM) (referred to in the article as Supporting Information) had erroneously not been published in the original publication. The missing ESM are provided in this erratum.
Published: 2017

43. Results of the 2017 Roadmap survey of the Statistical Assessment of Modeling of Proteins and Ligands (SAMPL) challenge community

Author: Mobley, David L, Chodera, John D, and Gilson, Michael K
Published: 2017

44. Predicting Binding Free Energies

Author: Mobley, David L and Gilson, Michael K
Subjects: Networking and Information Technology R&D (NITRD), Bioengineering, Patient Safety, Benchmarking, Bridged-Ring Compounds, Computer Simulation, Drug Discovery, Imidazoles, Ligands, Models, Molecular, Muramidase, Protein Binding, Proteins, Software, Thermodynamics, binding free energy, molecular simulation, alchemical, benchmark, biomolecular interactions, binding affinity, Medicinal and Biomolecular Chemistry, Biochemistry and Cell Biology, Chemical Engineering, Biophysics
Abstract: Binding free energy calculations based on molecular simulations provide predicted affinities for biomolecular complexes. These calculations begin with a detailed description of a system, including its chemical composition and the interactions among its components. Simulations of the system are then used to compute thermodynamic information, such as binding affinities. Because of their promise for guiding molecular design, these calculations have recently begun to see widespread applications in early-stage drug discovery. However, many hurdles remain in making them a robust and reliable tool. In this review, we highlight key challenges of these calculations, discuss some examples of these challenges, and call for the designation of standard community benchmark test systems that will help the research community generate and evaluate progress. In our view, progress will require careful assessment and evaluation of new methods, force fields, and modeling innovations on well-characterized benchmark systems, and we lay out our vision for how this can be achieved.
Published: 2017

45. Evaluation and Minimization of Uncertainty in ITC Binding Measurements: Heat Error, Concentration Error, Saturation, and Stoichiometry

Author: Kantonen, Samuel A, Henriksen, Niel M, and Gilson, Michael K
Subjects: Biochemistry and Cell Biology, Biological Sciences, Digestive Diseases, Biophysical Phenomena, Calorimetry, Hot Temperature, Least-Squares Analysis, Monte Carlo Method, Protein Binding, Reproducibility of Results, Research Design, Software, Thermodynamics, Uncertainty, ITC, Error evaluation, Error minimization, Data analysis, Pharmacology and Pharmaceutical Sciences, Biochemistry & Molecular Biology, Biochemistry and cell biology
Abstract: BackgroundIsothermal titration calorimetry (ITC) is uniquely useful for characterizing binding thermodynamics, because it straightforwardly provides both the binding enthalpy and free energy. However, the precision of the results depends on the experimental setup and how thermodynamic results are obtained from the raw data.MethodsExperiments and Monte Carlo analysis are used to study how uncertainties in injection heat and concentration propagate to binding enthalpies in various scenarios. We identify regimes in which it is preferable to fix the stoichiometry parameter, N, and evaluate the reliability of uncertainties provided by the least squares method.ResultsThe noise in the injection heat is mainly proportional in character, with ~1% and ~3% uncertainty at 27C and 65C, respectively; concentration errors are ~1%. Simulations of experiments based on these uncertainties delineate how experimental design and curve fitting methods influence the uncertainty in the final results.ConclusionsIn most cases, experimental uncertainty is minimized by using more injections and by fixing N at its known value. With appropriate technique, the uncertainty in measured binding enthalpies can be kept below ~2% under many conditions, including low C values.General significanceWe quantify uncertainties in ITC data due to heat and concentration error, and identify practices to minimize these uncertainties. The resulting guidelines are important when ITC data are used quantitatively, such as to test computer simulations of binding. Reproducibility and further study are supported by free distribution of the new software developed here.
Published: 2017

46. Lessons learned from comparing molecular dynamics engines on the SAMPL5 dataset

Author: Shirts, Michael R, Klein, Christoph, Swails, Jason M, Yin, Jian, Gilson, Michael K, Mobley, David L, Case, David A, and Zhong, Ellen D
Subjects: Bioengineering, Affordable and Clean Energy, Binding Sites, Ligands, Molecular Conformation, Molecular Dynamics Simulation, Molecular Structure, Protein Binding, Proteins, Software, Solvents, Thermodynamics, Water, Molecular dynamics, Simulation validation, Molecular simulation, SAMPL5, Medicinal and Biomolecular Chemistry, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry
Abstract: We describe our efforts to prepare common starting structures and models for the SAMPL5 blind prediction challenge. We generated the starting input files and single configuration potential energies for the host-guest in the SAMPL5 blind prediction challenge for the GROMACS, AMBER, LAMMPS, DESMOND and CHARMM molecular simulation programs. All conversions were fully automated from the originally prepared AMBER input files using a combination of the ParmEd and InterMol conversion programs. We find that the energy calculations for all molecular dynamics engines for this molecular set agree to better than 0.1 % relative absolute energy for all energy components, and in most cases an order of magnitude better, when reasonable choices are made for different cutoff parameters. However, there are some surprising sources of statistically significant differences. Most importantly, different choices of Coulomb's constant between programs are one of the largest sources of discrepancies in energies. We discuss the measures required to get good agreement in the energies for equivalent starting configurations between the simulation programs, and the energy differences that occur when simulations are run with program-specific default simulation parameter values. Finally, we discuss what was required to automate this conversion and comparison.
Published: 2017

47. The SAMPL5 host–guest challenge: computing binding free energies and enthalpies from explicit solvent simulations by the attach-pull-release (APR) method

Author: Yin, Jian, Henriksen, Niel M, Slochower, David R, and Gilson, Michael K
Subjects: Chemical Sciences, Organic Chemistry, Physical Chemistry, Theoretical and Computational Chemistry, Binding Sites, Ligands, Molecular Conformation, Molecular Dynamics Simulation, Molecular Structure, Physical Phenomena, Protein Binding, Proteins, Software, Solvents, Thermodynamics, Water, beta-Cyclodextrins, SAMPL5, Binding free energy, Binding enthalpy, Host-guest, Force field, Water model, Host–guest, Medicinal and Biomolecular Chemistry, Medicinal & Biomolecular Chemistry, Medicinal and biomolecular chemistry, Theoretical and computational chemistry
Abstract: The absolute binding free energies and binding enthalpies of twelve host-guest systems in the SAMPL5 blind challenge were computed using our attach-pull-release (APR) approach. This method has previously shown good correlations between experimental and calculated binding data in retrospective studies of cucurbit[7]uril (CB7) and β-cyclodextrin (βCD) systems. In the present work, the computed binding free energies for host octa acid (OA or OAH) and tetra-endo-methyl octa-acid (TEMOA or OAMe) with guests are in good agreement with prospective experimental data, with a coefficient of determination (R2) of 0.8 and root-mean-squared error of 1.7 kcal/mol using the TIP3P water model. The binding enthalpy calculations achieve moderate correlations, with R2 of 0.5 and RMSE of 2.5 kcal/mol, for TIP3P water. Calculations using the newly developed OPC water model also show good performance. Furthermore, the present calculations semi-quantitatively capture the experimental trend of enthalpy-entropy compensation observed, and successfully predict guests with the strongest and weakest binding affinity. The most populated binding poses of all twelve systems, based on clustering analysis of 750 ns molecular dynamics (MD) trajectories, were extracted and analyzed. Computational methods using MD simulations and explicit solvent models in a rigorous statistical thermodynamic framework, like APR, can generate reasonable predictions of binding thermodynamics. Especially with continuing improvement in simulation force fields, such methods hold the promise of making substantial contributions to hit identification and lead optimization in the drug discovery process.
Published: 2017

48. Probing the orientation of inhibitor and epoxy-eicosatrienoic acid binding in the active site of soluble epoxide hydrolase

Author: Lee, Kin Sing Stephen, Henriksen, Niel M, Ng, Connie J, Yang, Jun, Jia, Weitao, Morisseau, Christophe, Andaya, Armann, Gilson, Michael K, and Hammock, Bruce D
Subjects: Biochemistry and Cell Biology, Biological Sciences, Arachidonic Acids, Carboxylic Acids, Catalysis, Catalytic Domain, Computer Simulation, Crystallization, Cytochrome P-450 Enzyme System, Epoxide Hydrolases, Epoxy Compounds, Escherichia coli, Humans, Hydrolysis, Inhibitory Concentration 50, Light, Mass Spectrometry, Molecular Dynamics Simulation, Peptides, Recombinant Proteins, Solvents, Stereoisomerism, Substrate Specificity, Trypsin, Photoaffinity tag, Photolabel, Soluble epoxide hydrolase, Epoxyeicosatrienoic acid, Computational simulation, Peptide sequencing, Biochemistry & Molecular Biology
Abstract: Soluble epoxide hydrolase (sEH) is an important therapeutic target of many diseases, such as chronic obstructive pulmonary disease (COPD) and diabetic neuropathic pain. It acts by hydrolyzing and thus regulating specific bioactive long chain polyunsaturated fatty acid epoxides (lcPUFA), like epoxyeicosatrienoic acids (EETs). To better predict which epoxides could be hydrolyzed by sEH, one needs to dissect the important factors and structural requirements that govern the binding of the substrates to sEH. This knowledge allows further exploration of the physiological role played by sEH. Unfortunately, a crystal structure of sEH with a substrate bound has not yet been reported. In this report, new photoaffinity mimics of a sEH inhibitor and EET were prepared and used in combination with peptide sequencing and computational modeling, to identify the binding orientation of different regioisomers and enantiomers of EETs into the catalytic cavity of sEH. Results indicate that the stereochemistry of the epoxide plays a crucial role in dictating the binding orientation of the substrate.
Published: 2017

49. Overview of the SAMPL5 host–guest challenge: Are we doing better?

Author: Yin, Jian, Henriksen, Niel M, Slochower, David R, Shirts, Michael R, Chiu, Michael W, Mobley, David L, and Gilson, Michael K
Subjects: Medicinal and Biomolecular Chemistry, Chemical Sciences, Clinical Research, Bioengineering, Generic health relevance, Binding Sites, Drug Design, Ligands, Molecular Dynamics Simulation, Molecular Structure, Protein Binding, Proteins, Solvents, Structure-Activity Relationship, Thermodynamics, Host-guest, Molecular recognition, Computer-aided drug design, Blind challenge, Binding affinity, Host–guest, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry, Medicinal and biomolecular chemistry, Theoretical and computational chemistry
Abstract: The ability to computationally predict protein-small molecule binding affinities with high accuracy would accelerate drug discovery and reduce its cost by eliminating rounds of trial-and-error synthesis and experimental evaluation of candidate ligands. As academic and industrial groups work toward this capability, there is an ongoing need for datasets that can be used to rigorously test new computational methods. Although protein-ligand data are clearly important for this purpose, their size and complexity make it difficult to obtain well-converged results and to troubleshoot computational methods. Host-guest systems offer a valuable alternative class of test cases, as they exemplify noncovalent molecular recognition but are far smaller and simpler. As a consequence, host-guest systems have been part of the prior two rounds of SAMPL prediction exercises, and they also figure in the present SAMPL5 round. In addition to being blinded, and thus avoiding biases that may arise in retrospective studies, the SAMPL challenges have the merit of focusing multiple researchers on a common set of molecular systems, so that methods may be compared and ideas exchanged. The present paper provides an overview of the host-guest component of SAMPL5, which centers on three different hosts, two octa-acids and a glycoluril-based molecular clip, and two different sets of guest molecules, in aqueous solution. A range of methods were applied, including electronic structure calculations with implicit solvent models; methods that combine empirical force fields with implicit solvent models; and explicit solvent free energy simulations. The most reliable methods tend to fall in the latter class, consistent with results in prior SAMPL rounds, but the level of accuracy is still below that sought for reliable computer-aided drug design. Advances in force field accuracy, modeling of protonation equilibria, electronic structure methods, and solvent models, hold promise for future improvements.
Published: 2017

50. Blind prediction of cyclohexane–water distribution coefficients from the SAMPL5 challenge

Author: Bannan, Caitlin C, Burley, Kalistyn H, Chiu, Michael, Shirts, Michael R, Gilson, Michael K, and Mobley, David L
Subjects: Medicinal and Biomolecular Chemistry, Chemical Sciences, Bioengineering, Computer Simulation, Cyclohexanes, Drug Discovery, Models, Chemical, Molecular Structure, Pharmaceutical Preparations, Small Molecule Libraries, Solubility, Solvents, Thermodynamics, Uncertainty, Water, SAMPL, Distribution coefficient, Blind challenge, Free energy, Alchemical, Molecular simulation, Theoretical and Computational Chemistry, Medicinal & Biomolecular Chemistry, Medicinal and biomolecular chemistry, Theoretical and computational chemistry
Abstract: In the recent SAMPL5 challenge, participants submitted predictions for cyclohexane/water distribution coefficients for a set of 53 small molecules. Distribution coefficients (log D) replace the hydration free energies that were a central part of the past five SAMPL challenges. A wide variety of computational methods were represented by the 76 submissions from 18 participating groups. Here, we analyze submissions by a variety of error metrics and provide details for a number of reference calculations we performed. As in the SAMPL4 challenge, we assessed the ability of participants to evaluate not just their statistical uncertainty, but their model uncertainty-how well they can predict the magnitude of their model or force field error for specific predictions. Unfortunately, this remains an area where prediction and analysis need improvement. In SAMPL4 the top performing submissions achieved a root-mean-squared error (RMSE) around 1.5 kcal/mol. If we anticipate accuracy in log D predictions to be similar to the hydration free energy predictions in SAMPL4, the expected error here would be around 1.54 log units. Only a few submissions had an RMSE below 2.5 log units in their predicted log D values. However, distribution coefficients introduced complexities not present in past SAMPL challenges, including tautomer enumeration, that are likely to be important in predicting biomolecular properties of interest to drug discovery, therefore some decrease in accuracy would be expected. Overall, the SAMPL5 distribution coefficient challenge provided great insight into the importance of modeling a variety of physical effects. We believe these types of measurements will be a promising source of data for future blind challenges, especially in view of the relatively straightforward nature of the experiments and the level of insight provided.
Published: 2016

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

769 results on '"Gilson, Michael K."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources