6 results on '"Clyde, Austin"'
Search Results
2. Integration of Computational Docking into Anti-Cancer Drug Response Prediction Models.
- Author
-
Narykov, Oleksandr, Zhu, Yitan, Brettin, Thomas, Evrard, Yvonne A., Partin, Alexander, Shukla, Maulik, Xia, Fangfang, Clyde, Austin, Vasanthakumari, Priyanka, Doroshow, James H., and Stevens, Rick L.
- Subjects
COMPUTER simulation ,DEEP learning ,CLINICAL drug trials ,ANTINEOPLASTIC agents ,MACHINE learning ,GENE expression ,PREDICTION models - Abstract
Simple Summary: Anti-cancer drug response prediction models aim to reduce the time necessary for developing a treatment for patients affected by this complex disease. Their goal is to decrease the number of required biological experiments by computationally weeding out unpromising compounds. In this work, we explore the potential gains of incorporating large-scale applications of classical virtual screening techniques like molecular docking into cutting-edge deep learning models. We demonstrate improvement in performance as well as limitations of our approach. Cancer is a heterogeneous disease in that tumors of the same histology type can respond differently to a treatment. Anti-cancer drug response prediction is of paramount importance for both drug development and patient treatment design. Although various computational methods and data have been used to develop drug response prediction models, it remains a challenging problem due to the complexities of cancer mechanisms and cancer-drug interactions. To better characterize the interaction between cancer and drugs, we investigate the feasibility of integrating computationally derived features of molecular mechanisms of action into prediction models. Specifically, we add docking scores of drug molecules and target proteins in combination with cancer gene expressions and molecular drug descriptors for building response models. The results demonstrate a marginal improvement in drug response prediction performance when adding docking scores as additional features, through tests on large drug screening data. We discuss the limitations of the current approach and provide the research community with a baseline dataset of the large-scale computational docking for anti-cancer drugs. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
3. ChemoGraph: Interactive Visual Exploration of the Chemical Space.
- Author
-
Kale, Bharat, Clyde, Austin, Sun, Maoyuan, Ramanathan, Arvind, Stevens, Rick, and Papka, Michael E.
- Subjects
- *
MACHINE learning , *SPACE exploration , *DRUG discovery , *ANALYTICAL chemistry , *DATABASES , *VISUAL analytics - Abstract
Exploratory analysis of the chemical space is an important task in the field of cheminformatics. For example, in drug discovery research, chemists investigate sets of thousands of chemical compounds in order to identify novel yet structurally similar synthetic compounds to replace natural products. Manually exploring the chemical space inhabited by all possible molecules and chemical compounds is impractical, and therefore presents a challenge. To fill this gap, we present ChemoGraph, a novel visual analytics technique for interactively exploring related chemicals. In ChemoGraph, we formalize a chemical space as a hypergraph and apply novel machine learning models to compute related chemical compounds. It uses a database to find related compounds from a known space and a machine learning model to generate new ones, which helps enlarge the known space. Moreover, ChemoGraph highlights interactive features that support users in viewing, comparing, and organizing computationally identified related chemicals. With a drug discovery usage scenario and initial expert feedback from a case study, we demonstrate the usefulness of ChemoGraph. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
4. AI-accelerated protein-ligand docking for SARS-CoV-2 is 100-fold faster with no significant change in detection.
- Author
-
Clyde, Austin, Liu, Xuefeng, Brettin, Thomas, Yoo, Hyunseung, Partin, Alexander, Babuji, Yadu, Blaiszik, Ben, Mohd-Yusof, Jamaludin, Merzky, Andre, Turilli, Matteo, Jha, Shantenu, Ramanathan, Arvind, and Stevens, Rick
- Subjects
- *
MOLECULAR docking , *CHEMICAL libraries , *MACHINE learning , *DRUG discovery , *SARS-CoV-2 - Abstract
Protein-ligand docking is a computational method for identifying drug leads. The method is capable of narrowing a vast library of compounds down to a tractable size for downstream simulation or experimental testing and is widely used in drug discovery. While there has been progress in accelerating scoring of compounds with artificial intelligence, few works have bridged these successes back to the virtual screening community in terms of utility and forward-looking development. We demonstrate the power of high-speed ML models by scoring 1 billion molecules in under a day (50 k predictions per GPU seconds). We showcase a workflow for docking utilizing surrogate AI-based models as a pre-filter to a standard docking workflow. Our workflow is ten times faster at screening a library of compounds than the standard technique, with an error rate less than 0.01% of detecting the underlying best scoring 0.1% of compounds. Our analysis of the speedup explains that another order of magnitude speedup must come from model accuracy rather than computing speed. In order to drive another order of magnitude of acceleration, we share a benchmark dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. We believe this is strong evidence for the community to begin focusing on improving the accuracy of surrogate models to improve the ability to screen massive compound libraries 100 × or even 1000 × faster than current techniques and reduce missing top hits. The technique outlined aims to be a fast drop-in replacement for docking for screening billion-scale molecular libraries. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
5. cross-study analysis of drug response prediction in cancer cell lines.
- Author
-
Xia, Fangfang, Allen, Jonathan, Balaprakash, Prasanna, Brettin, Thomas, Garcia-Cardona, Cristina, Clyde, Austin, Cohn, Judith, Doroshow, James, Duan, Xiaotian, Dubinkina, Veronika, Evrard, Yvonne, Fan, Ya Ju, Gans, Jason, He, Stewart, Lu, Pinyi, Maslov, Sergei, Partin, Alexander, Shukla, Maulik, Stahlberg, Eric, and Wozniak, Justin M
- Subjects
CELL lines ,GENERALIZABILITY theory ,MACHINE learning ,INDEPENDENT sets ,FORECASTING ,MEDICAL screening - Abstract
To enable personalized cancer treatment, machine learning models have been developed to predict drug response as a function of tumor and drug features. However, most algorithm development efforts have relied on cross-validation within a single study to assess model accuracy. While an essential first step, cross-validation within a biological data set typically provides an overly optimistic estimate of the prediction performance on independent test sets. To provide a more rigorous assessment of model generalizability between different studies, we use machine learning to analyze five publicly available cell line-based data sets: National Cancer Institute 60, ancer Therapeutics Response Portal (CTRP), Genomics of Drug Sensitivity in Cancer, Cancer Cell Line Encyclopedia and Genentech Cell Line Screening Initiative (gCSI). Based on observed experimental variability across studies, we explore estimates of prediction upper bounds. We report performance results of a variety of machine learning models, with a multitasking deep neural network achieving the best cross-study generalizability. By multiple measures, models trained on CTRP yield the most accurate predictions on the remaining testing data, and gCSI is the most predictable among the cell line data sets included in this study. With these experiments and further simulations on partial data, two lessons emerge: (1) differences in viability assays can limit model generalizability across studies and (2) drug diversity, more than tumor diversity, is crucial for raising model generalizability in preclinical screening. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
6. Pandemic drugs at pandemic speed: infrastructure for accelerating COVID-19 drug discovery with hybrid machine learning- and physics-based simulations on high-performance computers
- Author
-
Nicola Scafuri, Andre Merzky, Dieter Kranzlmüller, Shantenu Jha, Mikhail Titov, Thomas Brettin, Heng Ma, Li Tan, Sauro Succi, Xiaotian Duan, D. Wifling, Mathis Bode, Anda Trifan, G. Mathias, Austin Clyde, Y. Donon, Alexander Partin, Fangfang Xia, Matteo Turilli, Peter V. Coveney, Sofia Vallecorsa, Agastya P. Bhati, Shunzhou Wan, Dario Alfè, A Di Meglio, Arvind Ramanathan, Walter Rocchia, Roger Highfield, Rick Stevens, Bhati, Agastya P., Wan, Shunzhou, Alfè, Dario, Clyde, Austin R., Bode, Mathi, Tan, Li, Titov, Mikhail, Merzky, Andre, Turilli, Matteo, Jha, Shantenu, Highfield, Roger R., Rocchia, Walter, Scafuri, Nicola, Succi, Sauro, Kranzlmüller, Dieter, Mathias, Gerald, Wifling, David, Donon, Yann, Di Meglio, Alberto, Vallecorsa, Sofia, Heng, Ma, Trifan, Anda, Ramanathan, Arvind, Brettin, Tom, Partin, Alexander, Xia, Fangfang, Duan, Xiaotan, Stevens, Rick, and Coveney, Peter V.
- Subjects
FOS: Computer and information sciences ,2019-20 coronavirus outbreak ,Computer Science - Machine Learning ,Coronavirus disease 2019 (COVID-19) ,Computer science ,Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) ,Biomedical Engineering ,Biophysics ,FOS: Physical sciences ,Bioengineering ,01 natural sciences ,Biochemistry ,Quantitative Biology - Quantitative Methods ,Bottleneck ,Machine Learning (cs.LG) ,Biomaterials ,Computational Engineering, Finance, and Science (cs.CE) ,03 medical and health sciences ,novel drug design ,0103 physical sciences ,Pandemic ,Physics - Biological Physics ,Computer Science - Computational Engineering, Finance, and Science ,Quantitative Methods (q-bio.QM) ,Research Articles ,030304 developmental biology ,0303 health sciences ,Hybrid machine ,010304 chemical physics ,Drug discovery ,Articles ,Physics based ,artificial intelligence ,Data science ,molecular dynamics ,machine learning ,Computer Science - Distributed, Parallel, and Cluster Computing ,Biological Physics (physics.bio-ph) ,FOS: Biological sciences ,Distributed, Parallel, and Cluster Computing (cs.DC) ,free energy predictions ,Biotechnology - Abstract
The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods, in this case, developed for linear accelerators, and physics-based methods. The twoin silicomethods, each have their own advantages and limitations which, interestingly, complement each other. Here, we present an innovative infrastructural development that combines both approaches to accelerate drug discovery. The scale of the potential resulting workflow is such that it is dependent on supercomputing to achieve extremely high throughput. We have demonstrated the viability of this workflow for the study of inhibitors for four COVID-19 target proteins and our ability to perform the required large-scale calculations to identify lead antiviral compounds through repurposing on a variety of supercomputers.
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.