5 results on '"Staffan Arvidsson McShane"'
Search Results
2. Disease phenotype prediction in multiple sclerosis
- Author
-
Stephanie Herman, Staffan Arvidsson McShane, Christina Zjukovskaja, Payam Emami Khoonsari, Anders Svenningsson, Joachim Burman, Ola Spjuth, and Kim Kultima
- Subjects
Multidisciplinary - Published
- 2023
- Full Text
- View/download PDF
3. Machine Learning Strategies When Transitioning between Biological Assays
- Author
-
Ola Spjuth, Staffan Arvidsson McShane, Ernst Ahlberg, and Tobias Noeske
- Subjects
Computer science ,Process (engineering) ,General Chemical Engineering ,Molecular Conformation ,Calibration set ,Pharmacology and Toxicology ,Library and Information Sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Article ,Machine Learning ,0103 physical sciences ,Retrospective Studies ,Training set ,010304 chemical physics ,business.industry ,General Chemistry ,Farmakologi och toxikologi ,Regression ,0104 chemical sciences ,Computer Science Applications ,010404 medicinal & biomolecular chemistry ,Drug development ,Biological Assay ,Artificial intelligence ,business ,computer ,Predictive modelling - Abstract
Machine learning is widely used in drug development to predict activity in biological assays based on chemical structure. However, the process of transitioning from one experimental setup to another for the same biological endpoint has not been extensively studied. In a retrospective study, we here explore different modeling strategies of how to combine data from the old and new assays when training conformal prediction models using data from hERG and Na-v assays. We suggest to continuously monitor the validity and efficiency of models as more data is accumulated from the new assay and select a modeling strategy based on these metrics. In order to maximize the utility of data from the old assay, we propose a strategy that augments the proper training set of an inductive conformal predictor by adding data from the old assay but only having data from the new assay in the calibration set, which results in valid (well-calibrated) models with improved efficiency compared to other strategies. We study the results for varying sizes of new and old assays, allowing for discussion of different practical scenarios. We also conclude that our proposed assay transition strategy is more beneficial, and the value of data from the new assay is higher, for the harder case of regression compared to classification problems.
- Published
- 2021
4. Assessing the calibration in toxicological in vitro models with conformal prediction
- Author
-
Andrea Morger, Fredrik Svensson, Staffan Arvidsson McShane, Niharika Gauraha, Ulf Norinder, Ola Spjuth, and Andrea Volkamer
- Subjects
Computer Sciences ,Information technology ,T58.5-58.64 ,Calibration plots ,Tox21 datasets ,Chemistry ,Datavetenskap (datalogi) ,Data drifts ,Applicability domain ,Conformal prediction ,QD1-999 ,Toxicity prediction ,600 Technik, Medizin, angewandte Wissenschaften::610 Medizin und Gesundheit::610 Medizin und Gesundheit ,Research Article - Abstract
Machine learning methods are widely used in drug discovery and toxicity prediction. While showing overall good performance in cross-validation studies, their predictive power (often) drops in cases where the query samples have drifted from the training data’s descriptor space. Thus, the assumption for applying machine learning algorithms, that training and test data stem from the same distribution, might not always be fulfilled. In this work, conformal prediction is used to assess the calibration of the models. Deviations from the expected error may indicate that training and test data originate from different distributions. Exemplified on the Tox21 datasets, composed of chronologically released Tox21Train, Tox21Test and Tox21Score subsets, we observed that while internally valid models could be trained using cross-validation on Tox21Train, predictions on the external Tox21Score data resulted in higher error rates than expected. To improve the prediction on the external sets, a strategy exchanging the calibration set with more recent data, such as Tox21Test, has successfully been introduced. We conclude that conformal prediction can be used to diagnose data drifts and other issues related to model calibration. The proposed improvement strategy—exchanging the calibration data only—is convenient as it does not require retraining of the underlying model. Supplementary Information The online version contains supplementary material available at 10.1186/s13321-021-00511-5.
- Published
- 2021
5. Predicting With Confidence: Using Conformal Prediction in Drug Discovery
- Author
-
Staffan Arvidsson McShane, Ola Spjuth, Ulf Norinder, and Jonathan Alvarsson
- Subjects
Quantitative structure–activity relationship ,Computer science ,Molecular Conformation ,Pharmaceutical Science ,Quantitative Structure-Activity Relationship ,Conformal map ,Confidence ,02 engineering and technology ,Machine learning ,computer.software_genre ,030226 pharmacology & pharmacy ,Upper and lower bounds ,Set (abstract data type) ,Machine Learning ,03 medical and health sciences ,0302 clinical medicine ,Applicability domain ,Drug Discovery ,Bioinformatics (Computational Biology) ,business.industry ,QSAR ,Prediction interval ,Reproducibility of Results ,Function (mathematics) ,021001 nanoscience & nanotechnology ,Regression ,Predictive modeling ,Bioinformatik (beräkningsbiologi) ,Artificial intelligence ,Conformal prediction ,0210 nano-technology ,business ,computer ,Algorithms - Abstract
One of the challenges with predictive modeling is how to quantify the reliability of the models' predictions on new objects. In this work we give an introduction to conformal prediction, a framework that sits on top of traditional machine learning algorithms and which outputs valid confidence estimates to predictions from QSAR models in the form of prediction intervals that are specific to each predicted object. For regression, a prediction interval consists of an upper and a lower bound. For classification, a prediction interval is a set that contains none, one, or many of the potential classes. The size of the prediction interval is affected by a user-specified confidence/significance level, and by the nonconformity of the predicted object; i.e., the strangeness as defined by a nonconformity function. Conformal prediction provides a rigorous and mathematically proven framework for in silico modeling with guarantees on error rates as well as a consistent handling of the models' applicability domain intrinsically linked to the underlying machine learning model. Apart from introducing the concepts and types of conformal prediction, we also provide an example application for modeling ABC transporters using conformal prediction, as well as a discussion on general implications for drug discovery.
- Published
- 2020
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.