153 results on '"Spjuth O"'
Search Results
2. New approach methods to assess developmental and adult neurotoxicity for regulatory use: a PARC work package 5 project
- Author
-
Tal, Tamara, Myhre, O., Fritsche, E., Rüegg, J., Craenen, K., Aiello-Holden, K., Agrillo, C., Babin, P.J., Escher, Beate, Dirven, H., Hellsten, K., Dolva, K., Hessel, E., Heusinkveld, H.J., Hadzhiev, Y., Hurem, S., Jagiello, K., Judzinska, B., Klüver, Nils, Knoll-Gellida, A., Kühne, B.A., Leist, M., Lislien, M., Lyche, J.L., Müller, F., Colbourne, J.K., Neuhaus, W., Pallocca, G., Seeger, B., Scharkin, I., Scholz, Stefan, Spjuth, O., Torres-Ruiz, M., Bartmann, K., Tal, Tamara, Myhre, O., Fritsche, E., Rüegg, J., Craenen, K., Aiello-Holden, K., Agrillo, C., Babin, P.J., Escher, Beate, Dirven, H., Hellsten, K., Dolva, K., Hessel, E., Heusinkveld, H.J., Hadzhiev, Y., Hurem, S., Jagiello, K., Judzinska, B., Klüver, Nils, Knoll-Gellida, A., Kühne, B.A., Leist, M., Lislien, M., Lyche, J.L., Müller, F., Colbourne, J.K., Neuhaus, W., Pallocca, G., Seeger, B., Scharkin, I., Scholz, Stefan, Spjuth, O., Torres-Ruiz, M., and Bartmann, K.
- Abstract
In the European regulatory context, rodent in vivo studies are the predominant source of neurotoxicity information. Although they form a cornerstone of neurotoxicological assessments, they are costly and the topic of ethical debate. While the public expects chemicals and products to be safe for the developing and mature nervous systems, considerable numbers of chemicals in commerce have not, or only to a limited extent, been assessed for their potential to cause neurotoxicity. As such, there is a societal push toward the replacement of animal models with in vitro or alternative methods. New approach methods (NAMs) can contribute to the regulatory knowledge base, increase chemical safety, and modernize chemical hazard and risk assessment. Provided they reach an acceptable level of regulatory relevance and reliability, NAMs may be considered as replacements for specific in vivo studies. The European Partnership for the Assessment of Risks from Chemicals (PARC) addresses challenges to the development and implementation of NAMs in chemical risk assessment. In collaboration with regulatory agencies, Project 5.2.1e (Neurotoxicity) aims to develop and evaluate NAMs for developmental neurotoxicity (DNT) and adult neurotoxicity (ANT) and to understand the applicability domain of specific NAMs for the detection of endocrine disruption and epigenetic perturbation. To speed up assay time and reduce costs, we identify early indicators of later-onset effects. Ultimately, we will assemble second-generation developmental neurotoxicity and first-generation adult neurotoxicity test batteries, both of which aim to provide regulatory hazard and risk assessors and industry stakeholders with robust, speedy, lower-cost, and informative next-generation hazard and risk assessment tools
- Published
- 2024
3. Development of new approach methods for the identification and characterization of endocrine metabolic disruptors—a PARC project
- Author
-
Braeuning, A., Balaguer, P., Bourguet, W., Carreras-Puigvert, J., Feiertag, K., Kamstra, J.H., Knapen, D., Lichtenstein, D., Marx-Stoelting, P., Rietdijk, J., Schubert, Kristin, Spjuth, O., Stinckens, E., Thedieck, K., van den Boom, R., Vergauwen, L., von Bergen, Martin, Wewer, N., Zalko, D., Braeuning, A., Balaguer, P., Bourguet, W., Carreras-Puigvert, J., Feiertag, K., Kamstra, J.H., Knapen, D., Lichtenstein, D., Marx-Stoelting, P., Rietdijk, J., Schubert, Kristin, Spjuth, O., Stinckens, E., Thedieck, K., van den Boom, R., Vergauwen, L., von Bergen, Martin, Wewer, N., and Zalko, D.
- Abstract
In past times, the analysis of endocrine disrupting properties of chemicals has mainly been focused on (anti-)estrogenic or (anti-)androgenic properties, as well as on aspects of steroidogenesis and the modulation of thyroid signaling. More recently, disruption of energy metabolism and related signaling pathways by exogenous substances, so-called metabolism-disrupting chemicals (MDCs) have come into focus. While general effects such as body and organ weight changes are routinely monitored in animal studies, there is a clear lack of mechanistic test systems to determine and characterize the metabolism-disrupting potential of chemicals. In order to contribute to filling this gap, one of the project within EU-funded Partnership for the Assessment of Risks of Chemicals (PARC) aims at developing novel in vitro methods for the detection of endocrine metabolic disruptors. Efforts will comprise projects related to specific signaling pathways, for example, involving mTOR or xenobiotic-sensing nuclear receptors, studies on hepatocytes, adipocytes and pancreatic beta cells covering metabolic and morphological endpoints, as well as metabolism-related zebrafish-based tests as an alternative to classic rodent bioassays. This paper provides an overview of the approaches and methods of these PARC projects and how this will contribute to the improvement of the toxicological toolbox to identify substances with endocrine disrupting properties and to decipher their mechanisms of action.
- Published
- 2023
4. An Open-Source Modular Framework for Automated Pipetting and Imaging Applications
- Author
-
Ouyang, Wei, Bowman, R. W., Wang, H., Bumke, K. E., Collins, J. T., Spjuth, O., Carreras-Puigvert, J., Diederich, B., Ouyang, Wei, Bowman, R. W., Wang, H., Bumke, K. E., Collins, J. T., Spjuth, O., Carreras-Puigvert, J., and Diederich, B.
- Abstract
The number of samples in biological experiments is continuously increasing, but complex protocols and human error in many cases lead to suboptimal data quality and hence difficulties in reproducing scientific findings. Laboratory automation can alleviate many of these problems by precisely reproducing machine-readable protocols. These instruments generally require high up-front investments, and due to the lack of open application programming interfaces (APIs), they are notoriously difficult for scientists to customize and control outside of the vendor-supplied software. Here, automated, high-throughput experiments are demonstrated for interdisciplinary research in life science that can be replicated on a modest budget, using open tools to ensure reproducibility by combining the tools OpenFlexure, Opentrons, ImJoy, and UC2. This automated sample preparation and imaging pipeline can easily be replicated and established in many laboratories as well as in educational contexts through easy-to-understand algorithms and easy-to-build microscopes. Additionally, the creation of feedback loops, with later pipetting or imaging steps depending on the analysis of previously acquired images, enables the realization of fully autonomous “smart” microscopy experiments. All documents and source files are publicly available to prove the concept of smart lab automation using inexpensive, open tools. It is believed this democratizes access to the power and repeatability of automated experiments., QC 20220517
- Published
- 2022
- Full Text
- View/download PDF
5. Cell morphology descriptors and gene ontology profiles improve prediction for mitochondrial toxicity
- Author
-
Seal, S., primary, Trapotsi, M.A., additional, Puigvert, J.C., additional, Yang, H., additional, Spjuth, O., additional, and Bender, A, additional
- Published
- 2021
- Full Text
- View/download PDF
6. Assessing the calibration in toxicological in vitro models with conformal prediction
- Author
-
Morger, A., Svensson, F., Arvidsson McShane, S., Gauraha, Niharika, Norinder, U., Spjuth, O., Volkamer, A., Morger, A., Svensson, F., Arvidsson McShane, S., Gauraha, Niharika, Norinder, U., Spjuth, O., and Volkamer, A.
- Abstract
Machine learning methods are widely used in drug discovery and toxicity prediction. While showing overall good performance in cross-validation studies, their predictive power (often) drops in cases where the query samples have drifted from the training data’s descriptor space. Thus, the assumption for applying machine learning algorithms, that training and test data stem from the same distribution, might not always be fulfilled. In this work, conformal prediction is used to assess the calibration of the models. Deviations from the expected error may indicate that training and test data originate from different distributions. Exemplified on the Tox21 datasets, composed of chronologically released Tox21Train, Tox21Test and Tox21Score subsets, we observed that while internally valid models could be trained using cross-validation on Tox21Train, predictions on the external Tox21Score data resulted in higher error rates than expected. To improve the prediction on the external sets, a strategy exchanging the calibration set with more recent data, such as Tox21Test, has successfully been introduced. We conclude that conformal prediction can be used to diagnose data drifts and other issues related to model calibration. The proposed improvement strategy—exchanging the calibration data only—is convenient as it does not require retraining of the underlying model., QC 20220301
- Published
- 2021
- Full Text
- View/download PDF
7. ELIXIR and Toxicology: a community in development [version 2; peer review: 2 approved]
- Author
-
Martens, M., Stierum, R., Schymanski, E.L., Evelo, C.T., Aalizadeh, R., Aladjov, H., Arturi, K., Audouze, K., Babica, P., Berka, K., Bessems, J., Blaha, L., Bolton, E.E., Cases, M., Damalas, D.E., Dave, K., Dilger, M., Exner, T., Geerke, D.P., Grafström, R., Gray, A., Hancock, J.M., Hollert, H., Jeliazkova, N., Jennen, D., Jourdan, F., Kahlem, P., Klanova, J., Kleinjans, J., Kondic, T., Kone, B., Lynch, I., Maran, U., Martinez Cuesta, S., Ménager, H., Neumann, S., Nymark, P., Oberacher, H., Ramirez, N., Remy, S., Rocca-Serra, P., Salek, R.M., Sallach, B., Sansone, S.-A., Sanz, F., Sarimveis, H., Sarntivijai, S., Schulze, Tobias, Slobodnik, J., Spjuth, O., Tedds, J., Thomaidis, N., Weber, R.J.M., van Westen, G.J.P., Wheelock, C.E., Williams, A.J., Witters, H., Zdrazil, B., Županič, A., Willighagen, E.L., Martens, M., Stierum, R., Schymanski, E.L., Evelo, C.T., Aalizadeh, R., Aladjov, H., Arturi, K., Audouze, K., Babica, P., Berka, K., Bessems, J., Blaha, L., Bolton, E.E., Cases, M., Damalas, D.E., Dave, K., Dilger, M., Exner, T., Geerke, D.P., Grafström, R., Gray, A., Hancock, J.M., Hollert, H., Jeliazkova, N., Jennen, D., Jourdan, F., Kahlem, P., Klanova, J., Kleinjans, J., Kondic, T., Kone, B., Lynch, I., Maran, U., Martinez Cuesta, S., Ménager, H., Neumann, S., Nymark, P., Oberacher, H., Ramirez, N., Remy, S., Rocca-Serra, P., Salek, R.M., Sallach, B., Sansone, S.-A., Sanz, F., Sarimveis, H., Sarntivijai, S., Schulze, Tobias, Slobodnik, J., Spjuth, O., Tedds, J., Thomaidis, N., Weber, R.J.M., van Westen, G.J.P., Wheelock, C.E., Williams, A.J., Witters, H., Zdrazil, B., Županič, A., and Willighagen, E.L.
- Abstract
Toxicology has been an active research field for many decades, with academic, industrial and government involvement. Modern omics and computational approaches are changing the field, from merely disease-specific observational models into target-specific predictive models. Traditionally, toxicology has strong links with other fields such as biology, chemistry, pharmacology and medicine. With the rise of synthetic and new engineered materials, alongside ongoing prioritisation needs in chemical risk assessment for existing chemicals, early predictive evaluations are becoming of utmost importance to both scientific and regulatory purposes. ELIXIR is an intergovernmental organisation that brings together life science resources from across Europe. To coordinate the linkage of various life science efforts around modern predictive toxicology, the establishment of a new ELIXIR Community is seen as instrumental. In the past few years, joint efforts, building on incidental overlap, have been piloted in the context of ELIXIR. For example, the EU-ToxRisk, diXa, HeCaToS, transQST, and the nanotoxicology community have worked with the ELIXIR TeSS, Bioschemas, and Compute Platforms and activities. In 2018, a core group of interested parties wrote a proposal, outlining a sketch of what this new ELIXIR Toxicology Community would look like. A recent workshop (held September 30th to October 1st, 2020) extended this into an ELIXIR Toxicology roadmap and a shortlist of limited investment-high gain collaborations to give body to this new community. This Whitepaper outlines the results of these efforts and defines our vision of the ELIXIR Toxicology Community and how it complements other ELIXIR activities.
- Published
- 2021
8. Metabolomics - the stethoscope for the 21st century
- Author
-
Ashrafian, H, Sounderajah, V, Glen, R, Ebbels, T, Blaise, BJ, Kalra, D, Kultima, K, Spjuth, O, Tenori, L, Salek, R, Kale, N, Haug, K, Schober, D, Rocca-Serra, P, O'Donovan, C, Steinbeck, C, Cano, I, De Atauri, P, Cascante, M, and National Institute of Health Research
- Subjects
Precision medicine ,Metabolomics ,Personalised medicine ,Microbiology ,1117 Public Health and Health Services - Abstract
Metabolomics offers systematic identification and quantification of all metabolic products from the human body. This field could provide clinicians with new sets of diagnostic biomarkers for disease states in addition to quantifying treatment response to medications at an individualised level. This literature review aims to highlight the technology underpinning metabolic profiling, identify potential applications of metabolomics in clinical practice and discuss the translational challenges that the field faces. We searched PubMed, Medline and Embase for primary and secondary research articles regarding clinical applications of metabolomics. Metabolic profiling can be performed using mass spectrometry and NMR based techniques using a variety of biological samples. This is carried out in vivo or in vitro following careful sample collection, preparation and analysis. The potential clinical applications constitute disruptive innovations in their respective specialities, particularly oncology and metabolic medicine. Outstanding issues currently preventing widespread clinical use centre around scalability of data interpretation, standardisation of sample handling practice and e-infrastructure. Routine utilisation of metabolomics at a patient and population level will constitute an integral part of future healthcare provision.
- Published
- 2020
9. OpenRiskNet : an open E-infrastructure to support data sharing, knowledge integration, and in silico analysis and modeling in risk assessment
- Author
-
Exner, T., Dokler, J., Bachler, D., Kramer, S., Notredam, C., Jennen, D., Gkoutos, G., Sarimveis, H., Jacobs, M., Spjuth, O., Jennings, P., Dudgeon, T., Hardy, B., Farcal, Lucian Romeo, Bois, Frédéric Y., and Civs, Gestionnaire
- Subjects
[SDV.TOX] Life Sciences [q-bio]/Toxicology - Published
- 2018
10. The future of metabolomics in ELIXIR
- Author
-
van Rijswijk, M, Beirnaert, C, Caron, C, Cascante, M, Dominguez, V, Dunn, WB, Ebbels, TMD, Giacomoni, F, Gonzalez-Beltran, A, Hankemeier, T, Haug, K, Izquierdo-Garcia, JL, Jimenez, RC, Jourdan, F, Kale, N, Klapa, MI, Kohlbacher, O, Koort, K, Kultima, K, Le Corguillé, G, Moreno, P, Moschonas, NK, Neumann, S, O'Donovan, C, Reczko, M, Rocca-Serra, P, Rosato, A, Salek, RM, Sansone, S-A, Satagopam, V, Schober, D, Shimmo, R, Spicer, RA, Spjuth, O, Thévenot, EA, Viant, MR, Weber, RJM, Willighagen, EL, Zanetti, G, Steinbeck, C, Dutch Techcentre for Life Sciences [Utrecht], Netherlands Metabolomics Centre, Department of Mathematics and Computer Science, ADReM, University of Antwerp (UA), French Institute of Bioinformatics (ELIXIR-FR), Department of Biochemistry and Molecular Biomedicine, Faculty of Science and Technology, Biochemistry and Molecular Biology, University of Barcelona, Birmingham Metabolomics Training Centre, University of Birmingham, Department of Surgery and Cancer, Imperial College London, Unité de Nutrition Humaine (UNH), Institut National de la Recherche Agronomique (INRA)-Université Clermont Auvergne [2017-2020] (UCA [2017-2020]), MetaboHUB, Oxford e-Research Centre, University of Oxford, Leiden Academic Centre for Drug Research (LACDR), Universiteit Leiden, European Bioinformatics Institute [Hinxton] (EMBL-EBI), EMBL Heidelberg, Centro Nacional de Investigaciones Cardiovasculares, Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES), ELIXIR Hub [Cambridge], Métabolisme et Xénobiotiques (ToxAlim-MeX), ToxAlim (ToxAlim), Institut National de la Recherche Agronomique (INRA)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université de Toulouse (UT)-Ecole Nationale Vétérinaire de Toulouse (ENVT), Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National Polytechnique (Toulouse) (Toulouse INP), Université de Toulouse (UT)-Ecole d'Ingénieurs de Purpan (INP - PURPAN), Université de Toulouse (UT)-Université de Toulouse (UT)-Institut National de la Recherche Agronomique (INRA)-Université Toulouse III - Paul Sabatier (UT3), Université de Toulouse (UT)-Université de Toulouse (UT), Forth ICE-HT, Institute of Chemical Engineering Sciences, Foundation for Research and Technology - Hellas (FORTH), Max Planck Institute for Developmental Biology, Max-Planck-Gesellschaft, Department of Computer Science, Duke University [Durham], Center for Bioinformatics, University of Tübingen, The Centre of Excellence in Neural and Behavioural Sciences, Tallinn University, School of Natural Sciences and Health, Department of Medical Sciences (University of Miyasaki), University of Miyasaki, ABiMS - Informatique et bioinformatique = Analysis and Bioinformatics for Marine Science (ABIMS), Fédération de recherche de Roscoff (FR2424), Station biologique de Roscoff (SBR), Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Station biologique de Roscoff (SBR), Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS)-Sorbonne Université (SU)-Centre National de la Recherche Scientifique (CNRS), Université Pierre et Marie Curie - Paris 6 (UPMC), Centre National de la Recherche Scientifique (CNRS), Department of General Biology, Universidade Federal de Minas Gerais [Belo Horizonte] (UFMG), University of Patras, Department of Stress and Developmental Biology, Leibniz Institute of Plant Biochemistry (IPB), BSRC 'Alexander Fleming', Magnetic Resonance Center/Interuniversity Consortium of Magnetic Resonance Metalloproteins, Università degli Studi di Firenze = University of Florence (UniFI), Luxembourg Centre For Systems Biomedicine (LCSB), University of Luxembourg [Luxembourg], Université du Luxembourg (Uni.lu), Department of Pharmaceutical Biosciences, Uppsala University, Laboratoire Sciences des Données et de la Décision (LS2D), Département Métrologie Instrumentation & Information (DM2I), Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA)), Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Laboratoire d'Intégration des Systèmes et des Technologies (LIST (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay, Department of Bioinformatics, BiGCaT, Maastricht University [Maastricht], NUTRIM, Data-Intensive Computing, CRS4 Bioinformat, Friedrich-Schiller-Universität = Friedrich Schiller University Jena [Jena, Germany], ANR-11-INBS-0010,METABOHUB,Développement d'une infrastructure française distribuée pour la métabolomique dédiée à l'innovation(2011), European Project: 654241,H2020,H2020-EINFRA-2014-2,PhenoMeNal(2015), European Commission, Unité de Nutrition Humaine - Clermont Auvergne (UNH), Institut National de la Recherche Agronomique (INRA)-Université Clermont Auvergne (UCA), Leiden University, Université Toulouse III - Paul Sabatier (UT3), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National Polytechnique (Toulouse) (Toulouse INP), Université Fédérale Toulouse Midi-Pyrénées-Ecole Nationale Vétérinaire de Toulouse (ENVT), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Ecole d'Ingénieurs de Purpan (INPT - EI Purpan), Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Recherche Agronomique (INRA), ABiMS - Informatique et bioinformatique = Analysis and Bioinformatics for Marine Science (FR2424), Université Pierre et Marie Curie (Paris 6), Federal University of Minas Gerais Belo Horizonte, University of Patras [Patras], University of Florence (UNIFI), Laboratoire d'analyse des données et d'intelligence des systèmes (LADIS), Laboratoire d'Intégration des Systèmes et des Technologies (LIST), Université Paris-Saclay-Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Direction de Recherche Technologique (CEA) (DRT (CEA)), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Laboratoire d'Intégration des Systèmes et des Technologies (LIST), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA), Maastricht University, Friedrich-Schiller-Universität Jena, ANR-11-INBS-0010/11-INBS-0010,METABOHUB,Développement d’une infrastructure française distribuée pour la métabolomique dédiée à l’innovation(2011), Commission of the European Communities, European Molecular Biology Laboratory, Apollo-University Of Cambridge Repository, van Rijswijk, Merlijn [0000-0002-1067-7766], Beirnaert, Charlie [0000-0003-3007-2569], Gonzalez-Beltran, Alejandra [0000-0003-3499-8262], Haug, Kenneth [0000-0003-3168-4145], Jimenez, Rafael C [0000-0001-5404-7670], Kale, Namrata [0000-0002-4255-8104], Klapa, Maria I [0000-0002-2047-3185], Moschonas, Nicholas K [0000-0002-2556-537X], Rosato, Antonio [0000-0001-6172-0368], Salek, Reza M [0000-0001-8604-1732], Sansone, Susanna-Assunta [0000-0001-5306-5690], Schober, Daniel [0000-0001-8014-6648], Spicer, Rachel A [0000-0002-2807-8796], Spjuth, Ola [0000-0002-8083-2864], Thévenot, Etienne A [0000-0003-1019-4577], Willighagen, Egon L [0000-0001-7542-0286], Steinbeck, Christoph [0000-0001-6966-0814], Apollo - University of Cambridge Repository, RS: NUTRIM - R1 - Obesity, diabetes and cardiovascular health, Bioinformatica, RS: NUTRIM - R4 - Gene-environment interaction, University of Oxford [Oxford], Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Recherche Agronomique (INRA)-Université Toulouse III - Paul Sabatier (UT3), Università degli Studi di Firenze = University of Florence [Firenze] (UNIFI), Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Commissariat à l'énergie atomique et aux énergies alternatives (CEA)-Université Paris-Saclay-Laboratoire d'Intégration des Systèmes et des Technologies (LIST), Università degli Studi di Firenze = University of Florence [Firenze], Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Ecole Nationale Vétérinaire de Toulouse (ENVT), Université Fédérale Toulouse Midi-Pyrénées-Ecole d'Ingénieurs de Purpan (INPT - EI Purpan), and Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées
- Subjects
0301 basic medicine ,databases ,Data management ,computational workflows ,infrastructure en ligne ,Cloud computing ,Computational biology ,Biology ,General Biochemistry, Genetics and Molecular Biology ,Chemical Biology of the Cell ,analyse métabolomique ,03 medical and health sciences ,statistical analysis ,[CHIM.ANAL]Chemical Sciences/Analytical chemistry ,[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,[INFO.INFO-DL]Computer Science [cs]/Digital Libraries [cs.DL] ,Use case ,General Pharmacology, Toxicology and Pharmaceutics ,signal processing ,computer.programming_language ,bioinformatics infrastructure ,training ,030102 biochemistry & molecular biology ,General Immunology and Microbiology ,metabolomics, bioinformatics, distributed computing, cloud ,business.industry ,cloud computing ,pipeline ,Articles ,General Medicine ,Opinion Article ,Data science ,metabolomics ,Open data ,Identification (information) ,030104 developmental biology ,Workflow ,multi-omics approaches ,Elixir (programming language) ,[INFO.INFO-BI]Computer Science [cs]/Bioinformatics [q-bio.QM] ,business ,data standards ,computer ,Omics technologies - Abstract
We are grateful to the proteomics community for sharing their experiences from their meeting “The future of proteomics in ELIXIR” on March 1–2 2017 in Tübingen, allowing us to build on this and organise our workshop as a one-day event.The meeting was funded by PhenoMeNal, European Commission's Horizon2020 programme, grant agreement number 654241; Metabolomics, the youngest of the major omics technologies, is supported by an active community of researchers and infrastructure developers across Europe. To coordinate and focus efforts around infrastructure building for metabolomics within Europe, a workshop on the "Future of metabolomics in ELIXIR" was organised at Frankfurt Airport in Germany. This one-day strategic workshop involved representatives of ELIXIR Nodes, members of the PhenoMeNal consortium developing an e-infrastructure that supports workflow-based metabolomics analysis pipelines, and experts from the international metabolomics community. The workshop established metabolite identification as the critical area, where a maximal impact of computational metabolomics and data management on other fields could be achieved. In particular, the existing four ELIXIR Use Cases, where the metabolomics community - both industry and academia - would benefit most, and which could be exhaustively mapped onto the current five ELIXIR Platforms were discussed. This opinion article is a call for support for a new ELIXIR metabolomics Use Case, which aligns with and complements the existing and planned ELIXIR Platforms and Use Cases
- Published
- 2017
- Full Text
- View/download PDF
11. Exploring the usefulness of morphological profiling of cells to study toxicity mechanisms
- Author
-
Georgieva, P., primary, Schaal, W., additional, and Spjuth, O., additional
- Published
- 2018
- Full Text
- View/download PDF
12. OpenRiskNet, an open e-infrastructure to support data sharing, knowledge integration and in silico analysis and modelling in risk assessment
- Author
-
Exner, T.E., primary, Dokler, J., additional, Bachler, D., additional, Farcal, L.R., additional, Evelo, C.T., additional, Willighagen, E., additional, Jennen, D.G.J., additional, Jabocs, M., additional, Doganis, P., additional, Sarimveis, H., additional, Lynch, I., additional, Gkoutos, G., additional, Kramer, S., additional, Notredame, C., additional, Spjuth, O., additional, Jennings, P., additional, Dudgeon, T., additional, Bois, F., additional, and Hardy, B., additional
- Published
- 2018
- Full Text
- View/download PDF
13. Bioclipse-R: integrating management and visualization of life science data with statistical analysis
- Author
-
Spjuth, O., Spjuth, O., Georgiev, V., Carlsson, L., Alvarsson, J., Berg, A., Willighagen, E., Wikberg, J. E., Eklund, M., Spjuth, O., Spjuth, O., Georgiev, V., Carlsson, L., Alvarsson, J., Berg, A., Willighagen, E., Wikberg, J. E., and Eklund, M.
- Abstract
SUMMARY: Bioclipse, a graphical workbench for the life sciences, provides functionality for managing and visualizing life science data. We introduce Bioclipse-R, which integrates Bioclipse and the statistical programming language R. The synergy between Bioclipse and R is demonstrated by the construction of a decision support system for anticancer drug screening and mutagenicity prediction, which shows how Bioclipse-R can be used to perform complex tasks from within a single software system. Availability and implementation: Bioclipse-R is implemented as a set of Java plug-ins for Bioclipse based on the R-package rj. Source code and binary packages are available from https://github.com/bioclipse and http://www.bioclipse.net/bioclipse-r, respectively. CONTACT: martin.eklund@farmbio.uu.se SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
- Published
- 2013
14. Harmonising and linking biomedical and clinical data across disparate data archives to enable integrative cross-biobank research
- Author
-
Spjuth, O, Krestyaninova, M, Hastings, J, Shen, HY, Heikkinen, J, Waldenberger, M, Langhammer, A, Ladenvall, C, Esko, T, Persson, M A, Heggland, J, Dietrich, J, Ose, S, Gieger, C, Ried, JS, Peters, A, Fortier, I, de Geus, EJC, Klovins, J, Zaharenko, L, Willemsen, G, Hottenga, JJ, Litton, JE, Karvanen, J, Boomsma, DI, Groop, L, Rung, J, Palmgren, J, Pedersen, NL, McCarthy, MI, Duijn, Cornelia, Hveem, K, Metspalu, A, Ripatti, S, Prokopenko, I, Harris, JR, Spjuth, O, Krestyaninova, M, Hastings, J, Shen, HY, Heikkinen, J, Waldenberger, M, Langhammer, A, Ladenvall, C, Esko, T, Persson, M A, Heggland, J, Dietrich, J, Ose, S, Gieger, C, Ried, JS, Peters, A, Fortier, I, de Geus, EJC, Klovins, J, Zaharenko, L, Willemsen, G, Hottenga, JJ, Litton, JE, Karvanen, J, Boomsma, DI, Groop, L, Rung, J, Palmgren, J, Pedersen, NL, McCarthy, MI, Duijn, Cornelia, Hveem, K, Metspalu, A, Ripatti, S, Prokopenko, I, and Harris, JR
- Published
- 2016
15. Toward the Replacement of Animal Experiments through the Bioinformatics-driven Analysis of 'Omics' Data from Human Cell Cultures.
- Author
-
Grafstrom, R. C., Grafstrom, R. C., Nymark, P., Hongisto, V., Spjuth, O., Ceder, R., Willighagen, E., Hardy, B., Kaski, S., Kohonen, P., Grafstrom, R. C., Grafstrom, R. C., Nymark, P., Hongisto, V., Spjuth, O., Ceder, R., Willighagen, E., Hardy, B., Kaski, S., and Kohonen, P.
- Abstract
This paper outlines the work for which Roland Grafstrom and Pekka Kohonen were awarded the 2014 Lush Science Prize. The research activities of the Grafstrom laboratory have, for many years, covered cancer biology studies, as well as the development and application of toxicity-predictive in vitro models to determine chemical safety. Through the integration of in silico analyses of diverse types of genomics data (transcriptomic and proteomic), their efforts have proved to fit well into the recently-developed Adverse Outcome Pathway paradigm. Genomics analysis within state-of-the-art cancer biology research and Toxicology in the 21st Century concepts share many technological tools. A key category within the Three Rs paradigm is the Replacement of animals in toxicity testing with alternative methods, such as bioinformatics-driven analyses of data obtained from human cell cultures exposed to diverse toxicants. This work was recently expanded within the pan-European SEURAT-1 project (Safety Evaluation Ultimately Replacing Animal Testing), to replace repeat-dose toxicity testing with data-rich analyses of sophisticated cell culture models. The aims and objectives of the SEURAT project have been to guide the application, analysis, interpretation and storage of 'omics' technology-derived data within the service-oriented sub-project, ToxBank. Particularly addressing the Lush Science Prize focus on the relevance of toxicity pathways, a 'data warehouse' that is under continuous expansion, coupled with the development of novel data storage and management methods for toxicology, serve to address data integration across multiple 'omics' technologies. The prize winners' guiding principles and concepts for modern knowledge management of toxicological data are summarised. The translation of basic discovery results ranged from chemical-testing and material-testing data, to information relevant to human health and environmental safety.
- Published
- 2015
16. Interactive predictive toxicology with Bioclipse and OpenTox
- Author
-
Willighagen, E.L., Affentranger, R., Grafström, R., Hardy, B., Jeliazkova, N., Spjuth, O., Harland, L., Forster, M., Bioinformatica, and RS: NUTRIM - R4 - Gene-environment interaction
- Subjects
Engineering ,Decision support system ,business.industry ,Data mining ,Predictive toxicology ,computer.software_genre ,business ,Data science ,computer ,Complement (complexity) ,Pharmaceutical industry ,Visualization - Abstract
Computational predictive toxicology draws knowledge from many independent sources, providing a rich support tool to assess a wide variety of toxicological properties. A key example would be for it to complement alternative testing methods. The integration of Bioclipse and OpenTox permits toxicity prediction based on the analysis of chemical structures, and visualization of the substructure contributions to the toxicity prediction. In analogy of the decision support that is already in use in the pharmaceutical industry for designing new drug leads, we use this approach in two case studies in malaria research, using a combination of local and remote predictive models. This way, we find drug leads without predicted toxicity.
- Published
- 2012
17. Using iterative MapReduce for parallel virtual screening
- Author
-
Ahmed, Laeeq, Edlund, Åke, Laure, Erwin, Spjuth, O., Ahmed, Laeeq, Edlund, Åke, Laure, Erwin, and Spjuth, O.
- Abstract
Virtual Screening is a technique in chemo informatics used for Drug discovery by searching large libraries of molecule structures. Virtual Screening often uses SVM, a supervised machine learning technique used for regression and classification analysis. Virtual screening using SVM not only involves huge datasets, but it is also compute expensive with a complexity that can grow at least up to O(n2). SVM based applications most commonly use MPI, which becomes complex and impractical with large datasets. As an alternative to MPI, MapReduce, and its different implementations, have been successfully used on commodity clusters for analysis of data for problems with very large datasets. Due to the large libraries of molecule structures in virtual screening, it becomes a good candidate for MapReduce. In this paper we present a MapReduce implementation of SVM based virtual screening, using Spark, an iterative MapReduce programming model. We show that our implementation has a good scaling behaviour and opens up the possibility of using huge public cloud infrastructures efficiently for virtual screening., QC 20140619
- Published
- 2013
- Full Text
- View/download PDF
18. The Chemistry Development Kit (CDK) v2.0: atom typing, depiction, molecular formulas, and substructure searching
- Author
-
Egon Willighagen, Jw, Mayfield, Alvarsson J, Berg A, Carlsson L, Jeliazkova N, Kuhn S, Pluskal T, Rojas-Chertó M, Spjuth O, Torrance G, Ct, Evelo, Guha R, and Steinbeck C
19. Linking the Resource Description Framework to cheminformatics and proteochemometrics
- Author
-
Willighagen Egon L, Alvarsson Jonathan, Andersson Annsofie, Eklund Martin, Lampa Samuel, Lapins Maris, Spjuth Ola, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 - Abstract
Abstract Background Semantic web technologies are finding their way into the life sciences. Ontologies and semantic markup have already been used for more than a decade in molecular sciences, but have not found widespread use yet. The semantic web technology Resource Description Framework (RDF) and related methods show to be sufficiently versatile to change that situation. Results The work presented here focuses on linking RDF approaches to existing molecular chemometrics fields, including cheminformatics, QSAR modeling and proteochemometrics. Applications are presented that link RDF technologies to methods from statistics and cheminformatics, including data aggregation, visualization, chemical identification, and property prediction. They demonstrate how this can be done using various existing RDF standards and cheminformatics libraries. For example, we show how IC50 and Ki values are modeled for a number of biological targets using data from the ChEMBL database. Conclusions We have shown that existing RDF standards can suitably be integrated into existing molecular chemometrics methods. Platforms that unite these technologies, like Bioclipse, makes this even simpler and more transparent. Being able to create and share workflows that integrate data aggregation and analysis (visual and statistical) is beneficial to interoperability and reproducibility. The current work shows that RDF approaches are sufficiently powerful to support molecular chemometrics workflows.
- Published
- 2011
- Full Text
- View/download PDF
20. Computational toxicology using the OpenTox application programming interface and Bioclipse
- Author
-
Willighagen Egon L, Jeliazkova Nina, Hardy Barry, Grafström Roland C, and Spjuth Ola
- Subjects
Medicine ,Biology (General) ,QH301-705.5 ,Science (General) ,Q1-390 - Abstract
Abstract Background Toxicity is a complex phenomenon involving the potential adverse effect on a range of biological functions. Predicting toxicity involves using a combination of experimental data (endpoints) and computational methods to generate a set of predictive models. Such models rely strongly on being able to integrate information from many sources. The required integration of biological and chemical information sources requires, however, a common language to express our knowledge ontologically, and interoperating services to build reliable predictive toxicology applications. Findings This article describes progress in extending the integrative bio- and cheminformatics platform Bioclipse to interoperate with OpenTox, a semantic web framework which supports open data exchange and toxicology model building. The Bioclipse workbench environment enables functionality from OpenTox web services and easy access to OpenTox resources for evaluating toxicity properties of query molecules. Relevant cases and interfaces based on ten neurotoxins are described to demonstrate the capabilities provided to the user. The integration takes advantage of semantic web technologies, thereby providing an open and simplifying communication standard. Additionally, the use of ontologies ensures proper interoperation and reliable integration of toxicity information from both experimental and computational sources. Conclusions A novel computational toxicity assessment platform was generated from integration of two open science platforms related to toxicology: Bioclipse, that combines a rich scriptable and graphical workbench environment for integration of diverse sets of information sources, and OpenTox, a platform for interoperable toxicology data and computational services. The combination provides improved reliability and operability for handling large data sets by the use of the Open Standards from the OpenTox Application Programming Interface. This enables simultaneous access to a variety of distributed predictive toxicology databases, and algorithm and model resources, taking advantage of the Bioclipse workbench handling the technical layers.
- Published
- 2011
- Full Text
- View/download PDF
21. Open Data, Open Source and Open Standards in chemistry: The Blue Obelisk five years on
- Author
-
O'Boyle Noel M, Guha Rajarshi, Willighagen Egon L, Adams Samuel E, Alvarsson Jonathan, Bradley Jean-Claude, Filippov Igor V, Hanson Robert M, Hanwell Marcus D, Hutchison Geoffrey R, James Craig A, Jeliazkova Nina, Lang Andrew SID, Langner Karol M, Lonie David C, Lowe Daniel M, Pansanel Jérôme, Pavlov Dmitry, Spjuth Ola, Steinbeck Christoph, Tenderholt Adam L, Theisen Kevin J, and Murray-Rust Peter
- Subjects
Information technology ,T58.5-58.64 ,Chemistry ,QD1-999 - Abstract
Abstract Background The Blue Obelisk movement was established in 2005 as a response to the lack of Open Data, Open Standards and Open Source (ODOSOS) in chemistry. It aims to make it easier to carry out chemistry research by promoting interoperability between chemistry software, encouraging cooperation between Open Source developers, and developing community resources and Open Standards. Results This contribution looks back on the work carried out by the Blue Obelisk in the past 5 years and surveys progress and remaining challenges in the areas of Open Data, Open Standards, and Open Source in chemistry. Conclusions We show that the Blue Obelisk has been very successful in bringing together researchers and developers with common interests in ODOSOS, leading to development of many useful resources freely available to the chemistry community.
- Published
- 2011
- Full Text
- View/download PDF
22. Brunn: An open source laboratory information system for microplates with a graphical plate layout design process
- Author
-
Larsson Rolf, Spjuth Ola, Andersson Claes, Alvarsson Jonathan, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Compound profiling and drug screening generates large amounts of data and is generally based on microplate assays. Current information systems used for handling this are mainly commercial, closed source, expensive, and heavyweight and there is a need for a flexible lightweight open system for handling plate design, and validation and preparation of data. Results A Bioclipse plugin consisting of a client part and a relational database was constructed. A multiple-step plate layout point-and-click interface was implemented inside Bioclipse. The system contains a data validation step, where outliers can be removed, and finally a plate report with all relevant calculated data, including dose-response curves. Conclusions Brunn is capable of handling the data from microplate assays. It can create dose-response curves and calculate IC50 values. Using a system of this sort facilitates work in the laboratory. Being able to reuse already constructed plates and plate layouts by starting out from an earlier step in the plate layout design process saves time and cuts down on error sources.
- Published
- 2011
- Full Text
- View/download PDF
23. Use of historic metabolic biotransformation data as a means of anticipating metabolic sites using MetaPrint2D and Bioclipse
- Author
-
Glen Robert C, Adams Samuel, Spjuth Ola, Carlsson Lars, and Boyer Scott
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Predicting metabolic sites is important in the drug discovery process to aid in rapid compound optimisation. No interactive tool exists and most of the useful tools are quite expensive. Results Here a fast and reliable method to analyse ligands and visualise potential metabolic sites is presented which is based on annotated metabolic data, described by circular fingerprints. The method is available via the graphical workbench Bioclipse, which is equipped with advanced features in cheminformatics. Conclusions Due to the speed of predictions (less than 50 ms per molecule), scientists can get real time decision support when editing chemical structures. Bioclipse is a rich client, which means that all calculations are performed on the local computer and do not require network connection. Bioclipse and MetaPrint2D are free for all users, released under open source licenses, and available from http://www.bioclipse.net.
- Published
- 2010
- Full Text
- View/download PDF
24. Towards interoperable and reproducible QSAR analyses: Exchange of datasets
- Author
-
Spjuth Ola, Willighagen Egon L, Guha Rajarshi, Eklund Martin, and Wikberg Jarl ES
- Subjects
Information technology ,T58.5-58.64 ,Chemistry ,QD1-999 - Abstract
Abstract Background QSAR is a widely used method to relate chemical structures to responses or properties based on experimental observations. Much effort has been made to evaluate and validate the statistical modeling in QSAR, but these analyses treat the dataset as fixed. An overlooked but highly important issue is the validation of the setup of the dataset, which comprises addition of chemical structures as well as selection of descriptors and software implementations prior to calculations. This process is hampered by the lack of standards and exchange formats in the field, making it virtually impossible to reproduce and validate analyses and drastically constrain collaborations and re-use of data. Results We present a step towards standardizing QSAR analyses by defining interoperable and reproducible QSAR datasets, consisting of an open XML format (QSAR-ML) which builds on an open and extensible descriptor ontology. The ontology provides an extensible way of uniquely defining descriptors for use in QSAR experiments, and the exchange format supports multiple versioned implementations of these descriptors. Hence, a dataset described by QSAR-ML makes its setup completely reproducible. We also provide a reference implementation as a set of plugins for Bioclipse which simplifies setup of QSAR datasets, and allows for exporting in QSAR-ML as well as old-fashioned CSV formats. The implementation facilitates addition of new descriptor implementations from locally installed software and remote Web services; the latter is demonstrated with REST and XMPP Web services. Conclusions Standardized QSAR datasets open up new ways to store, query, and exchange data for subsequent analyses. QSAR-ML supports completely reproducible creation of datasets, solving the problems of defining which software components were used and their versions, and the descriptor ontology eliminates confusions regarding descriptors by defining them crisply. This makes is easy to join, extend, combine datasets and hence work collectively, but also allows for analyzing the effect descriptors have on the statistical model's performance. The presented Bioclipse plugins equip scientists with graphical tools that make QSAR-ML easily accessible for the community.
- Published
- 2010
- Full Text
- View/download PDF
25. An eScience-Bayes strategy for analyzing omics data
- Author
-
Spjuth Ola, Eklund Martin, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. Results We present a general approach to omics data analysis that alleviates these problems. By combining eScience and Bayesian methods, we retrieve scientific information and data from multiple sources and coherently incorporate them into large models. These models improve the accuracy of predictions and offer new insights into the underlying mechanisms. This "eScience-Bayes" approach is demonstrated in two proof-of-principle applications, one for breast cancer prognosis prediction from transcriptomic data and one for protein-protein interaction studies based on proteomic data. Conclusions Bayesian statistics provide the flexibility to tailor statistical models to the complex data structures in omics biology as well as permitting coherent integration of multiple data sources. However, Bayesian methods are in general computationally demanding and require specification of possibly thousands of prior distributions. eScience can help us overcome these difficulties. The eScience-Bayes thus approach permits us to fully leverage on the advantages of Bayesian methods, resulting in models with improved predictive performance that gives more information about the underlying biological system.
- Published
- 2010
- Full Text
- View/download PDF
26. Bioclipse 2: A scriptable integration platform for the life sciences
- Author
-
Wagener Johannes, Torrance Gilleain, Mäsak Carl, Kuhn Stefan, Eklund Martin, Berg Arvid, Alvarsson Jonathan, Spjuth Ola, Willighagen Egon L, Steinbeck Christoph, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Contemporary biological research integrates neighboring scientific domains to answer complex questions in fields such as systems biology and drug discovery. This calls for tools that are intuitive to use, yet flexible to adapt to new tasks. Results Bioclipse is a free, open source workbench with advanced features for the life sciences. Version 2.0 constitutes a complete rewrite of Bioclipse, and delivers a stable, scalable integration platform for developers and an intuitive workbench for end users. All functionality is available both from the graphical user interface and from a built-in novel domain-specific language, supporting the scientist in interdisciplinary research and reproducible analyses through advanced visualization of the inputs and the results. New components for Bioclipse 2 include a rewritten editor for chemical structures, a table for multiple molecules that supports gigabyte-sized files, as well as a graphical editor for sequences and alignments. Conclusion Bioclipse 2 is equipped with advanced tools required to carry out complex analysis in the fields of bio- and cheminformatics. Developed as a Rich Client based on Eclipse, Bioclipse 2 leverages on today's powerful desktop computers for providing a responsive user interface, but also takes full advantage of the Web and networked (Web/Cloud) services for more demanding calculations or retrieval of data. The fact that Bioclipse 2 is based on an advanced and widely used service platform ensures wide extensibility, making it easy to add new algorithms, visualizations, as well as scripting commands. The intuitive tools for end users and the extensible architecture make Bioclipse 2 ideal for interdisciplinary and integrative research. Bioclipse 2 is released under the Eclipse Public License (EPL), a flexible open source license that allows additional plugins to be of any license. Bioclipse 2 is implemented in Java and supported on all major platforms; Source code and binaries are freely available at http://www.bioclipse.net.
- Published
- 2009
- Full Text
- View/download PDF
27. XMPP for cloud computing in bioinformatics supporting discovery and invocation of asynchronous web services
- Author
-
Willighagen Egon L, Spjuth Ola, Wagener Johannes, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Life sciences make heavily use of the web for both data provision and analysis. However, the increasing amount of available data and the diversity of analysis tools call for machine accessible interfaces in order to be effective. HTTP-based Web service technologies, like the Simple Object Access Protocol (SOAP) and REpresentational State Transfer (REST) services, are today the most common technologies for this in bioinformatics. However, these methods have severe drawbacks, including lack of discoverability, and the inability for services to send status notifications. Several complementary workarounds have been proposed, but the results are ad-hoc solutions of varying quality that can be difficult to use. Results We present a novel approach based on the open standard Extensible Messaging and Presence Protocol (XMPP), consisting of an extension (IO Data) to comprise discovery, asynchronous invocation, and definition of data types in the service. That XMPP cloud services are capable of asynchronous communication implies that clients do not have to poll repetitively for status, but the service sends the results back to the client upon completion. Implementations for Bioclipse and Taverna are presented, as are various XMPP cloud services in bio- and cheminformatics. Conclusion XMPP with its extensions is a powerful protocol for cloud services that demonstrate several advantages over traditional HTTP-based Web services: 1) services are discoverable without the need of an external registry, 2) asynchronous invocation eliminates the need for ad-hoc solutions like polling, and 3) input and output types defined in the service allows for generation of clients on the fly without the need of an external semantics description. The many advantages over existing technologies make XMPP a highly interesting candidate for next generation online services in bioinformatics.
- Published
- 2009
- Full Text
- View/download PDF
28. The C1C2: A framework for simultaneous model selection and assessment
- Author
-
Wikberg Jarl ES, Spjuth Ola, and Eklund Martin
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background There has been recent concern regarding the inability of predictive modeling approaches to generalize to new data. Some of the problems can be attributed to improper methods for model selection and assessment. Here, we have addressed this issue by introducing a novel and general framework, the C1C2, for simultaneous model selection and assessment. The framework relies on a partitioning of the data in order to separate model choice from model assessment in terms of used data. Since the number of conceivable models in general is vast, it was also of interest to investigate the employment of two automatic search methods, a genetic algorithm and a brute-force method, for model choice. As a demonstration, the C1C2 was applied to simulated and real-world datasets. A penalized linear model was assumed to reasonably approximate the true relation between the dependent and independent variables, thus reducing the model choice problem to a matter of variable selection and choice of penalizing parameter. We also studied the impact of assuming prior knowledge about the number of relevant variables on model choice and generalization error estimates. The results obtained with the C1C2 were compared to those obtained by employing repeated K-fold cross-validation for choosing and assessing a model. Results The C1C2 framework performed well at finding the true model in terms of choosing the correct variable subset and producing reasonable choices for the penalizing parameter, even in situations when the independent variables were highly correlated and when the number of observations was less than the number of variables. The C1C2 framework was also found to give accurate estimates of the generalization error. Prior information about the number of important independent variables improved the variable subset choice but reduced the accuracy of generalization error estimates. Using the genetic algorithm worsened the model choice but not the generalization error estimates, compared to using the brute-force method. The results obtained with repeated K-fold cross-validation were similar to those produced by the C1C2 in terms of model choice, however a lower accuracy of the generalization error estimates was observed. Conclusion The C1C2 framework was demonstrated to work well for finding the true model within a penalized linear model class and accurately assess its generalization error, even for datasets with many highly correlated independent variables, a low observation-to-variable ratio, and model assumption deviations. A complete separation of the model choice and the model assessment in terms of data used for each task improves the estimates of the generalization error.
- Published
- 2008
- Full Text
- View/download PDF
29. Proteochemometric modeling of HIV protease susceptibility
- Author
-
Prusis Peteris, Spjuth Ola, Eklund Martin, Lapins Maris, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background A major obstacle in treatment of HIV is the ability of the virus to mutate rapidly into drug-resistant variants. A method for predicting the susceptibility of mutated HIV strains to antiviral agents would provide substantial clinical benefit as well as facilitate the development of new candidate drugs. Therefore, we used proteochemometrics to model the susceptibility of HIV to protease inhibitors in current use, utilizing descriptions of the physico-chemical properties of mutated HIV proteases and 3D structural property descriptions for the protease inhibitors. The descriptions were correlated to the susceptibility data of 828 unique HIV protease variants for seven protease inhibitors in current use; the data set comprised 4792 protease-inhibitor combinations. Results The model provided excellent predictability (R2 = 0.92, Q2 = 0.87) and identified general and specific features of drug resistance. The model's predictive ability was verified by external prediction in which the susceptibilities to each one of the seven inhibitors were omitted from the data set, one inhibitor at a time, and the data for the six remaining compounds were used to create new models. This analysis showed that the over all predictive ability for the omitted inhibitors was Q2 inhibitors = 0.72. Conclusion Our results show that a proteochemometric approach can provide generalized susceptibility predictions for new inhibitors. Our proteochemometric model can directly analyze inhibitor-protease interactions and facilitate treatment selection based on viral genotype. The model is available for public use, and is located at HIV Drug Research Centre.
- Published
- 2008
- Full Text
- View/download PDF
30. Bioclipse: an open source workbench for chemo- and bioinformatics
- Author
-
Wagener Johannes, Eklund Martin, Kuhn Stefan, Willighagen Egon L, Helmus Tobias, Spjuth Ola, Murray-Rust Peter, Steinbeck Christoph, and Wikberg Jarl ES
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background There is a need for software applications that provide users with a complete and extensible toolkit for chemo- and bioinformatics accessible from a single workbench. Commercial packages are expensive and closed source, hence they do not allow end users to modify algorithms and add custom functionality. Existing open source projects are more focused on providing a framework for integrating existing, separately installed bioinformatics packages, rather than providing user-friendly interfaces. No open source chemoinformatics workbench has previously been published, and no sucessful attempts have been made to integrate chemo- and bioinformatics into a single framework. Results Bioclipse is an advanced workbench for resources in chemo- and bioinformatics, such as molecules, proteins, sequences, spectra, and scripts. It provides 2D-editing, 3D-visualization, file format conversion, calculation of chemical properties, and much more; all fully integrated into a user-friendly desktop application. Editing supports standard functions such as cut and paste, drag and drop, and undo/redo. Bioclipse is written in Java and based on the Eclipse Rich Client Platform with a state-of-the-art plugin architecture. This gives Bioclipse an advantage over other systems as it can easily be extended with functionality in any desired direction. Conclusion Bioclipse is a powerful workbench for bio- and chemoinformatics as well as an advanced integration platform. The rich functionality, intuitive user interface, and powerful plugin architecture make Bioclipse the most advanced and user-friendly open source workbench for chemo- and bioinformatics. Bioclipse is released under Eclipse Public License (EPL), an open source license which sets no constraints on external plugin licensing; it is totally open for both open source plugins as well as commercial ones. Bioclipse is freely available at http://www.bioclipse.net.
- Published
- 2007
- Full Text
- View/download PDF
31. Improved Detection of Drug-Induced Liver Injury by Integrating Predicted In Vivo and In Vitro Data.
- Author
-
Seal S, Williams D, Hosseini-Gerami L, Mahale M, Carpenter AE, Spjuth O, and Bender A
- Subjects
- Humans, Animals, Rats, Chemical and Drug Induced Liver Injury
- Abstract
Drug-induced liver injury (DILI) has been a significant challenge in drug discovery, often leading to clinical trial failures and necessitating drug withdrawals. Over the last decade, the existing suite of in vitro proxy-DILI assays has generally improved at identifying compounds with hepatotoxicity. However, there is considerable interest in enhancing the in silico prediction of DILI because it allows for evaluating large sets of compounds more quickly and cost-effectively, particularly in the early stages of projects. In this study, we aim to study ML models for DILI prediction that first predict nine proxy-DILI labels and then use them as features in addition to chemical structural features to predict DILI. The features include in vitro (e.g., mitochondrial toxicity, bile salt export pump inhibition) data, in vivo (e.g., preclinical rat hepatotoxicity studies) data, pharmacokinetic parameters of maximum concentration, structural fingerprints, and physicochemical parameters. We trained DILI-prediction models on 888 compounds from the DILI data set (composed of DILIst and DILIrank) and tested them on a held-out external test set of 223 compounds from the DILI data set. The best model, DILIPredictor, attained an AUC-PR of 0.79. This model enabled the detection of the top 25 toxic compounds (2.68 LR+, positive likelihood ratio) compared to models using only structural features (1.65 LR+ score). Using feature interpretation from DILIPredictor, we identified the chemical substructures causing DILI and differentiated cases of DILI caused by compounds in animals but not in humans. For example, DILIPredictor correctly recognized 2-butoxyethanol as nontoxic in humans despite its hepatotoxicity in mice models. Overall, the DILIPredictor model improves the detection of compounds causing DILI with an improved differentiation between animal and human sensitivity and the potential for mechanism evaluation. DILIPredictor required only chemical structures as input for prediction and is publicly available at https://broad.io/DILIPredictor for use via web interface and with all code available for download.
- Published
- 2024
- Full Text
- View/download PDF
32. Artificial intelligence for high content imaging in drug discovery.
- Author
-
Carreras-Puigvert J and Spjuth O
- Subjects
- Humans, Drug Discovery methods, Artificial Intelligence
- Abstract
Artificial intelligence (AI) and high-content imaging (HCI) are contributing to advancements in drug discovery, propelled by the recent progress in deep neural networks. This review highlights AI's role in analysis of HCI data from fixed and live-cell imaging, enabling novel label-free and multi-channel fluorescent screening methods, and improving compound profiling. HCI experiments are rapid and cost-effective, facilitating large data set accumulation for AI model training. However, the success of AI in drug discovery also depends on high-quality data, reproducible experiments, and robust validation to ensure model performance. Despite challenges like the need for annotated compounds and managing vast image data, AI's potential in phenotypic screening and drug profiling is significant. Future improvements in AI, including increased interpretability and integration of multiple modalities, are expected to solidify AI and HCI's role in drug discovery., Competing Interests: Declaration of competing interest OS and JCP declare ownership in Phenaros Pharmaceuticals AB, a company exploiting AI, automation and HCI for drug discovery., (Copyright © 2024 The Author(s). Published by Elsevier Ltd.. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
33. CPSign: conformal prediction for cheminformatics modeling.
- Author
-
Arvidsson McShane S, Norinder U, Alvarsson J, Ahlberg E, Carlsson L, and Spjuth O
- Abstract
Conformal prediction has seen many applications in pharmaceutical science, being able to calibrate outputs of machine learning models and producing valid prediction intervals. We here present the open source software CPSign that is a complete implementation of conformal prediction for cheminformatics modeling. CPSign implements inductive and transductive conformal prediction for classification and regression, and probabilistic prediction with the Venn-ABERS methodology. The main chemical representation is signatures but other types of descriptors are also supported. The main modeling methodology is support vector machines (SVMs), but additional modeling methods are supported via an extension mechanism, e.g. DeepLearning4J models. We also describe features for visualizing results from conformal models including calibration and efficiency plots, as well as features to publish predictive models as REST services. We compare CPSign against other common cheminformatics modeling approaches including random forest, and a directed message-passing neural network. The results show that CPSign produces robust predictive performance with comparative predictive efficiency, with superior runtime and lower hardware requirements compared to neural network based models. CPSign has been used in several studies and is in production-use in multiple organizations. The ability to work directly with chemical input files, perform descriptor calculation and modeling with SVM in the conformal prediction framework, with a single software package having a low footprint and fast execution time makes CPSign a convenient and yet flexible package for training, deploying, and predicting on chemical data. CPSign can be downloaded from GitHub at https://github.com/arosbio/cpsign .Scientific contribution CPSign provides a single software that allows users to perform data preprocessing, modeling and make predictions directly on chemical structures, using conformal and probabilistic prediction. Building and evaluating new models can be achieved at a high abstraction level, without sacrificing flexibility and predictive performance-showcased with a method evaluation against contemporary modeling approaches, where CPSign performs on par with a state-of-the-art deep learning based model., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
34. A Decade in a Systematic Review: The Evolution and Impact of Cell Painting.
- Author
-
Seal S, Trapotsi MA, Spjuth O, Singh S, Carreras-Puigvert J, Greene N, Bender A, and Carpenter AE
- Abstract
High-content image-based assays have fueled significant discoveries in the life sciences in the past decade (2013-2023), including novel insights into disease etiology, mechanism of action, new therapeutics, and toxicology predictions. Here, we systematically review the substantial methodological advancements and applications of Cell Painting. Advancements include improvements in the Cell Painting protocol, assay adaptations for different types of perturbations and applications, and improved methodologies for feature extraction, quality control, and batch effect correction. Moreover, machine learning methods recently surpassed classical approaches in their ability to extract biologically useful information from Cell Painting images. Cell Painting data have been used alone or in combination with other - omics data to decipher the mechanism of action of a compound, its toxicity profile, and many other biological effects. Overall, key methodological advances have expanded Cell Painting's ability to capture cellular responses to various perturbations. Future advances will likely lie in advancing computational and experimental techniques, developing new publicly available datasets, and integrating them with other high-content data types., Competing Interests: The authors declare the following competing financial interest(s): S. Singh and A.E.C. serve as scientific advisors for companies that use image-based profiling and Cell Painting (A.E.C.: Recursion, SyzOnc, Quiver Bioscience; S. Singh: Waypoint Bio, Dewpoint Therapeutics, DeepCell) and receive honoraria for occasional talks at pharmaceutical and biotechnology companies. J.C.P. and O.S. declare ownership in Phenaros Pharmaceuticals.
- Published
- 2024
- Full Text
- View/download PDF
35. New approach methods to assess developmental and adult neurotoxicity for regulatory use: a PARC work package 5 project.
- Author
-
Tal T, Myhre O, Fritsche E, Rüegg J, Craenen K, Aiello-Holden K, Agrillo C, Babin PJ, Escher BI, Dirven H, Hellsten K, Dolva K, Hessel E, Heusinkveld HJ, Hadzhiev Y, Hurem S, Jagiello K, Judzinska B, Klüver N, Knoll-Gellida A, Kühne BA, Leist M, Lislien M, Lyche JL, Müller F, Colbourne JK, Neuhaus W, Pallocca G, Seeger B, Scharkin I, Scholz S, Spjuth O, Torres-Ruiz M, and Bartmann K
- Abstract
In the European regulatory context, rodent in vivo studies are the predominant source of neurotoxicity information. Although they form a cornerstone of neurotoxicological assessments, they are costly and the topic of ethical debate. While the public expects chemicals and products to be safe for the developing and mature nervous systems, considerable numbers of chemicals in commerce have not, or only to a limited extent, been assessed for their potential to cause neurotoxicity. As such, there is a societal push toward the replacement of animal models with in vitro or alternative methods. New approach methods (NAMs) can contribute to the regulatory knowledge base, increase chemical safety, and modernize chemical hazard and risk assessment. Provided they reach an acceptable level of regulatory relevance and reliability, NAMs may be considered as replacements for specific in vivo studies. The European Partnership for the Assessment of Risks from Chemicals (PARC) addresses challenges to the development and implementation of NAMs in chemical risk assessment. In collaboration with regulatory agencies, Project 5.2.1e (Neurotoxicity) aims to develop and evaluate NAMs for developmental neurotoxicity (DNT) and adult neurotoxicity (ANT) and to understand the applicability domain of specific NAMs for the detection of endocrine disruption and epigenetic perturbation. To speed up assay time and reduce costs, we identify early indicators of later-onset effects. Ultimately, we will assemble second-generation developmental neurotoxicity and first-generation adult neurotoxicity test batteries, both of which aim to provide regulatory hazard and risk assessors and industry stakeholders with robust, speedy, lower-cost, and informative next-generation hazard and risk assessment tools., Competing Interests: EF and KB are shareholders of the DNTOX GmbH offering neurotoxicity testing services. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2024 Tal, Myhre, Fritsche, Rüegg, Craenen, Aiello-Holden, Agrillo, Babin, Escher, Dirven, Hellsten, Dolva, Hessel, Heusinkveld, Hadzhiev, Hurem, Jagiello, Judzinska, Klüver, Knoll-Gellida, Kühne, Leist, Lislien, Lyche, Müller, Colbourne, Neuhaus, Pallocca, Seeger, Scharkin, Scholz, Spjuth, Torres-Ruiz and Bartmann.)
- Published
- 2024
- Full Text
- View/download PDF
36. Corrigendum: Development of new approach methods for the identification and characterization of endocrine metabolic disruptors-a PARC project.
- Author
-
Braeuning A, Balaguer P, Bourguet W, Carreras-Puigvert J, Feiertag K, Kamstra JH, Knapen D, Lichtenstein D, Marx-Stoelting P, Rietdijk J, Schubert K, Spjuth O, Stinckens E, Thedieck K, van den Boom R, Vergauwen L, von Bergen M, Wewer N, and Zalko D
- Abstract
[This corrects the article DOI: 10.3389/ftox.2023.1212509.]., (Copyright © 2024 Braeuning, Balaguer, Bourguet, Carreras-Puigvert, Feiertag, Kamstra, Knapen, Lichtenstein, Marx-Stoelting, Rietdijk, Schubert, Spjuth, Stinckens, Thedieck, van den Boom, Vergauwen, von Bergen, Wewer and Zalko.)
- Published
- 2024
- Full Text
- View/download PDF
37. From pixels to phenotypes: Integrating image-based profiling with cell health data as BioMorph features improves interpretability.
- Author
-
Seal S, Carreras-Puigvert J, Singh S, Carpenter AE, Spjuth O, and Bender A
- Subjects
- Phenotype, DNA Replication, Software
- Abstract
Cell Painting assays generate morphological profiles that are versatile descriptors of biological systems and have been used to predict in vitro and in vivo drug effects. However, Cell Painting features extracted from classical software such as CellProfiler are based on statistical calculations and often not readily biologically interpretable. In this study, we propose a new feature space, which we call BioMorph, that maps these Cell Painting features with readouts from comprehensive Cell Health assays. We validated that the resulting BioMorph space effectively connected compounds not only with the morphological features associated with their bioactivity but with deeper insights into phenotypic characteristics and cellular processes associated with the given bioactivity. The BioMorph space revealed the mechanism of action for individual compounds, including dual-acting compounds such as emetine, an inhibitor of both protein synthesis and DNA replication. Overall, BioMorph space offers a biologically relevant way to interpret the cell morphological features derived using software such as CellProfiler and to generate hypotheses for experimental validation.
- Published
- 2024
- Full Text
- View/download PDF
38. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA Drug-Induced Cardiotoxicity Rank.
- Author
-
Seal S, Spjuth O, Hosseini-Gerami L, García-Ortegón M, Singh S, Bender A, and Carpenter AE
- Subjects
- Humans, Cardiotoxicity etiology, Cardiotoxicity metabolism, Drug Development
- Abstract
Drug-induced cardiotoxicity (DICT) is a major concern in drug development, accounting for 10-14% of postmarket withdrawals. In this study, we explored the capabilities of chemical and biological data to predict cardiotoxicity, using the recently released DICTrank data set from the United States FDA. We found that such data, including protein targets, especially those related to ion channels (e.g., hERG), physicochemical properties (e.g., electrotopological state), and peak concentration in plasma offer strong predictive ability for DICT. Compounds annotated with mechanisms of action such as cyclooxygenase inhibition could distinguish between most-concern and no-concern DICT. Cell Painting features for ER stress discerned most-concern cardiotoxic from nontoxic compounds. Models based on physicochemical properties provided substantial predictive accuracy (AUCPR = 0.93). With the availability of omics data in the future, using biological data promises enhanced predictability and deeper mechanistic insights, paving the way for safer drug development. All models from this study are available at https://broad.io/DICTrank_Predictor.
- Published
- 2024
- Full Text
- View/download PDF
39. Insights into Drug Cardiotoxicity from Biological and Chemical Data: The First Public Classifiers for FDA DICTrank.
- Author
-
Seal S, Spjuth O, Hosseini-Gerami L, García-Ortegón M, Singh S, Bender A, and Carpenter AE
- Abstract
Drug-induced cardiotoxicity (DICT) is a major concern in drug development, accounting for 10-14% of postmarket withdrawals. In this study, we explored the capabilities of various chemical and biological data to predict cardiotoxicity, using the recently released Drug-Induced Cardiotoxicity Rank (DICTrank) dataset from the United States FDA. We analyzed a diverse set of data sources, including physicochemical properties, annotated mechanisms of action (MOA), Cell Painting, Gene Expression, and more, to identify indications of cardiotoxicity. We found that such data, including protein targets, especially those related to ion channels (such as hERG), physicochemical properties (such as electrotopological state) as well as peak concentration in plasma offer strong predictive ability as well as valuable insights into DICT. We also found compounds annotated with particular mechanisms of action, such as cyclooxygenase inhibition, could distinguish between most-concern and no-concern DICT compounds. Cell Painting features related to ER stress discern the most-concern cardiotoxic compounds from non-toxic compounds. While models based on physicochemical properties currently provide substantial predictive accuracy (AUCPR = 0.93), this study also underscores the potential benefits of incorporating more comprehensive biological data in future DICT predictive models. With the availability of - omics data in the future, using biological data promises enhanced predictability and delivers deeper mechanistic insights, paving the way for safer therapeutic drug development. All models and data used in this study are publicly released at https://broad.io/DICTrank_Predictor., Competing Interests: Author Declarations S Singh and AEC serve as scientific advisors for companies that use image-based profiling and Cell Painting (AEC: Recursion, SyzOnc, S Singh: Waypoint Bio, Dewpoint Therapeutics) and receive honoraria for occasional talks at pharmaceutical and biotechnology companies. OS declares shares in Phenaros Pharmaceuticals. LGH is an employee at Ignota Labs where CellScape is a proprietary software. All other authors declare no relevant competing interests.
- Published
- 2023
- Full Text
- View/download PDF
40. ELIXIR and Toxicology: a community in development.
- Author
-
Martens M, Stierum R, Schymanski EL, Evelo CT, Aalizadeh R, Aladjov H, Arturi K, Audouze K, Babica P, Berka K, Bessems J, Blaha L, Bolton EE, Cases M, Damalas DΕ, Dave K, Dilger M, Exner T, Geerke DP, Grafström R, Gray A, Hancock JM, Hollert H, Jeliazkova N, Jennen D, Jourdan F, Kahlem P, Klanova J, Kleinjans J, Kondic T, Kone B, Lynch I, Maran U, Martinez Cuesta S, Ménager H, Neumann S, Nymark P, Oberacher H, Ramirez N, Remy S, Rocca-Serra P, Salek RM, Sallach B, Sansone SA, Sanz F, Sarimveis H, Sarntivijai S, Schulze T, Slobodnik J, Spjuth O, Tedds J, Thomaidis N, Weber RJM, van Westen GJP, Wheelock CE, Williams AJ, Witters H, Zdrazil B, Županič A, and Willighagen EL
- Subjects
- Europe, Risk Assessment, Biological Science Disciplines
- Abstract
Toxicology has been an active research field for many decades, with academic, industrial and government involvement. Modern omics and computational approaches are changing the field, from merely disease-specific observational models into target-specific predictive models. Traditionally, toxicology has strong links with other fields such as biology, chemistry, pharmacology and medicine. With the rise of synthetic and new engineered materials, alongside ongoing prioritisation needs in chemical risk assessment for existing chemicals, early predictive evaluations are becoming of utmost importance to both scientific and regulatory purposes. ELIXIR is an intergovernmental organisation that brings together life science resources from across Europe. To coordinate the linkage of various life science efforts around modern predictive toxicology, the establishment of a new ELIXIR Community is seen as instrumental. In the past few years, joint efforts, building on incidental overlap, have been piloted in the context of ELIXIR. For example, the EU-ToxRisk, diXa, HeCaToS, transQST, and the nanotoxicology community have worked with the ELIXIR TeSS, Bioschemas, and Compute Platforms and activities. In 2018, a core group of interested parties wrote a proposal, outlining a sketch of what this new ELIXIR Toxicology Community would look like. A recent workshop (held September 30th to October 1st, 2020) extended this into an ELIXIR Toxicology roadmap and a shortlist of limited investment-high gain collaborations to give body to this new community. This Whitepaper outlines the results of these efforts and defines our vision of the ELIXIR Toxicology Community and how it complements other ELIXIR activities., Competing Interests: No competing interests were disclosed., (Copyright: © 2023 Martens M et al.)
- Published
- 2023
- Full Text
- View/download PDF
41. Author Correction: A method for Boolean analysis of protein interactions at a molecular level.
- Author
-
Raykova D, Kermpatsou D, Malmqvist T, Harrison PJ, Sander MR, Stiller C, Heldin J, Leino M, Ricardo S, Klemm A, David L, Spjuth O, Vemuri K, Dimberg A, Sundqvist A, Norlin M, Klaesson A, Kampf C, and Söderberg O
- Published
- 2023
- Full Text
- View/download PDF
42. Evaluating the utility of brightfield image data for mechanism of action prediction.
- Author
-
Harrison PJ, Gupta A, Rietdijk J, Wieslander H, Carreras-Puigvert J, Georgiev P, Wählby C, Spjuth O, and Sintorn IM
- Subjects
- Microscopy, Fluorescence methods, Cells, Cultured, Image Processing, Computer-Assisted methods
- Abstract
Fluorescence staining techniques, such as Cell Painting, together with fluorescence microscopy have proven invaluable for visualizing and quantifying the effects that drugs and other perturbations have on cultured cells. However, fluorescence microscopy is expensive, time-consuming, labor-intensive, and the stains applied can be cytotoxic, interfering with the activity under study. The simplest form of microscopy, brightfield microscopy, lacks these downsides, but the images produced have low contrast and the cellular compartments are difficult to discern. Nevertheless, by harnessing deep learning, these brightfield images may still be sufficient for various predictive purposes. In this study, we compared the predictive performance of models trained on fluorescence images to those trained on brightfield images for predicting the mechanism of action (MoA) of different drugs. We also extracted CellProfiler features from the fluorescence images and used them to benchmark the performance. Overall, we found comparable and largely correlated predictive performance for the two imaging modalities. This is promising for future studies of MoAs in time-lapse experiments for which using fluorescence images is problematic. Explorations based on explainable AI techniques also provided valuable insights regarding compounds that were better predicted by one modality over the other., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2023 Harrison et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
43. Development of new approach methods for the identification and characterization of endocrine metabolic disruptors-a PARC project.
- Author
-
Braeuning A, Balaguer P, Bourguet W, Carreras-Puigvert J, Feiertag K, Kamstra JH, Knapen D, Lichtenstein D, Marx-Stoelting P, Rietdijk J, Schubert K, Spjuth O, Stinckens E, Thedieck K, van den Boom R, Vergauwen L, von Bergen M, Wewer N, and Zalko D
- Abstract
In past times, the analysis of endocrine disrupting properties of chemicals has mainly been focused on (anti-)estrogenic or (anti-)androgenic properties, as well as on aspects of steroidogenesis and the modulation of thyroid signaling. More recently, disruption of energy metabolism and related signaling pathways by exogenous substances, so-called metabolism-disrupting chemicals (MDCs) have come into focus. While general effects such as body and organ weight changes are routinely monitored in animal studies, there is a clear lack of mechanistic test systems to determine and characterize the metabolism-disrupting potential of chemicals. In order to contribute to filling this gap, one of the project within EU-funded Partnership for the Assessment of Risks of Chemicals (PARC) aims at developing novel in vitro methods for the detection of endocrine metabolic disruptors. Efforts will comprise projects related to specific signaling pathways, for example, involving mTOR or xenobiotic-sensing nuclear receptors, studies on hepatocytes, adipocytes and pancreatic beta cells covering metabolic and morphological endpoints, as well as metabolism-related zebrafish-based tests as an alternative to classic rodent bioassays. This paper provides an overview of the approaches and methods of these PARC projects and how this will contribute to the improvement of the toxicological toolbox to identify substances with endocrine disrupting properties and to decipher their mechanisms of action., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The handling editor TV declared a shared research group PARC PROJECT: European Partnership on the Assessment of Risks from Chemicals (PARC) with the authors at the time of review., (Copyright © 2023 Braeuning, Balaguer, Bourguet, Carreras-Puigvert, Feiertag, Kamstra, Knapen, Lichtenstein, Marx-Stoelting, Rietdijk, Schubert, Spjuth, Stinckens, Thedieck, van den Boom, Vergauwen, von Bergen, Wewer and Zalko.)
- Published
- 2023
- Full Text
- View/download PDF
44. Merging bioactivity predictions from cell morphology and chemical fingerprint models using similarity to training data.
- Author
-
Seal S, Yang H, Trapotsi MA, Singh S, Carreras-Puigvert J, Spjuth O, and Bender A
- Abstract
The applicability domain of machine learning models trained on structural fingerprints for the prediction of biological endpoints is often limited by the lack of diversity of chemical space of the training data. In this work, we developed similarity-based merger models which combined the outputs of individual models trained on cell morphology (based on Cell Painting) and chemical structure (based on chemical fingerprints) and the structural and morphological similarities of the compounds in the test dataset to compounds in the training dataset. We applied these similarity-based merger models using logistic regression models on the predictions and similarities as features and predicted assay hit calls of 177 assays from ChEMBL, PubChem and the Broad Institute (where the required Cell Painting annotations were available). We found that the similarity-based merger models outperformed other models with an additional 20% assays (79 out of 177 assays) with an AUC > 0.70 compared with 65 out of 177 assays using structural models and 50 out of 177 assays using Cell Painting models. Our results demonstrated that similarity-based merger models combining structure and cell morphology models can more accurately predict a wide range of biological assay outcomes and further expanded the applicability domain by better extrapolating to new structural and morphology spaces., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
45. Disease phenotype prediction in multiple sclerosis.
- Author
-
Herman S, Arvidsson McShane S, Zjukovskaja C, Khoonsari PE, Svenningsson A, Burman J, Spjuth O, and Kultima K
- Abstract
Progressive multiple sclerosis (PMS) is currently diagnosed retrospectively. Here, we work toward a set of biomarkers that could assist in early diagnosis of PMS. A selection of cerebrospinal fluid metabolites (n = 15) was shown to differentiate between PMS and its preceding phenotype in an independent cohort (AUC = 0.93). Complementing the classifier with conformal prediction showed that highly confident predictions could be made, and that three out of eight patients developing PMS within three years of sample collection were predicted as PMS at that time point. Finally, this methodology was applied to PMS patients as part of a clinical trial for intrathecal treatment with rituximab. The methodology showed that 68% of the patients decreased their similarity to the PMS phenotype one year after treatment. In conclusion, the inclusion of confidence predictors contributes with more information compared to traditional machine learning, and this information is relevant for disease monitoring., Competing Interests: The authors declare no competing interests., (© 2023 The Authors.)
- Published
- 2023
- Full Text
- View/download PDF
46. In Silico Prediction of Human Clinical Pharmacokinetics with ANDROMEDA by Prosilico: Predictions for an Established Benchmarking Data Set, a Modern Small Drug Data Set, and a Comparison with Laboratory Methods.
- Author
-
Fagerholm U, Hellberg S, Alvarsson J, and Spjuth O
- Subjects
- Animals, Humans, Permeability, Pharmacokinetics, Pharmaceutical Preparations, Computer Simulation, Benchmarking, Models, Biological
- Abstract
There is an ongoing aim to replace animal and in vitro laboratory models with in silico methods. Such replacement requires the successful validation and comparably good performance of the alternative methods. We have developed an in silico prediction system for human clinical pharmacokinetics, based on machine learning, conformal prediction and a new physiologically-based pharmacokinetic model, i.e. ANDROMEDA. The objectives of this study were: a) to evaluate how well ANDROMEDA predicts the human clinical pharmacokinetics of a previously proposed benchmarking data set comprising 24 physicochemically diverse drugs and 28 small drug molecules new to the market in 2021; b) to compare its predictive performance with that of laboratory methods; and c) to investigate and describe the pharmacokinetic characteristics of the modern drugs. Median and maximum prediction errors for the selected major parameters were ca 1.2 to 2.5-fold and 16-fold for both data sets, respectively. Prediction accuracy was on par with, or better than, the best laboratory-based prediction methods (superior performance for a vast majority of the comparisons), and the prediction range was considerably broader. The modern drugs have higher average molecular weight than those in the benchmarking set from 15 years earlier ( ca 200 g/mol higher), and were predicted to (generally) have relatively complex pharmacokinetics, including permeability and dissolution limitations and significant renal, biliary and/or gut-wall elimination. In conclusion, the results were overall better than those obtained with laboratory methods, and thus serve to further validate the ANDROMEDA in silico system for the prediction of human clinical pharmacokinetics of modern and physicochemically diverse drugs.
- Published
- 2023
- Full Text
- View/download PDF
47. Estimating diagnostic uncertainty in artificial intelligence assisted pathology using conformal prediction.
- Author
-
Olsson H, Kartasalo K, Mulliqi N, Capuccini M, Ruusuvuori P, Samaratunga H, Delahunt B, Lindskog C, Janssen EAM, Blilie A, Egevad L, Spjuth O, and Eklund M
- Subjects
- Male, Humans, Uncertainty, Prostate, Biopsy, Artificial Intelligence, Neoplasms
- Abstract
Unreliable predictions can occur when an artificial intelligence (AI) system is presented with data it has not been exposed to during training. We demonstrate the use of conformal prediction to detect unreliable predictions, using histopathological diagnosis and grading of prostate biopsies as example. We digitized 7788 prostate biopsies from 1192 men in the STHLM3 diagnostic study, used for training, and 3059 biopsies from 676 men used for testing. With conformal prediction, 1 in 794 (0.1%) predictions is incorrect for cancer diagnosis (compared to 14 errors [2%] without conformal prediction) while 175 (22%) of the predictions are flagged as unreliable when the AI-system is presented with new data from the same lab and scanner that it was trained on. Conformal prediction could with small samples (N = 49 for external scanner, N = 10 for external lab and scanner, and N = 12 for external lab, scanner and pathology assessment) detect systematic differences in external data leading to worse predictive performance. The AI-system with conformal prediction commits 3 (2%) errors for cancer detection in cases of atypical prostate tissue compared to 44 (25%) without conformal prediction, while the system flags 143 (80%) unreliable predictions. We conclude that conformal prediction can increase patient safety of AI-systems., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
48. From biomedical cloud platforms to microservices: next steps in FAIR data and analysis.
- Author
-
Sheffield NC, Bonazzi VR, Bourne PE, Burdett T, Clark T, Grossman RL, Spjuth O, and Yates AD
- Published
- 2022
- Full Text
- View/download PDF
49. The Impact of Reference Data Selection for the Prediction Accuracy of Intrinsic Hepatic Metabolic Clearance.
- Author
-
Fagerholm U, Spjuth O, and Hellberg S
- Subjects
- Humans, Kinetics, Metabolic Clearance Rate, Microsomes, Liver metabolism, Hepatocytes metabolism, Liver metabolism
- Abstract
In vitro-in vivo prediction results for hepatic metabolic clearance (CL
H ) and intrinsic CLH (CLint ) vary widely among studies. Reasons are not fully investigated and understood. The possibility to select favorable reference data for in vivo CLH and CLint and unbound fraction in plasma (fu ) is among possible explanations. The main objective was to investigate how reference data selection influences log in vitro and in vivo CLint -correlations (r2 ). Another aim was to make a head-to-head comparison vs an in silico prediction method. Human hepatocyte CLint -data for 15 compounds from two studies were selected. These were correlated to in vivo CLint estimated using different reported CLH - and fu -estimates. Depending on the choice of reference data, r2 from two studies were 0.07 to 0.86 and 0.06 to 0.79. When using average reference estimates a r2 of 0.62 was achieved. Inclusion of two outliers in one of the studies resulted in a r2 of 0.38, which was lower than the predictive accuracy (q2 ) for the in silico method (0.48). In conclusion, the selection of reference data appears to play a major role for demonstrated predictions and the in silico method showed higher accuracy and wider range than hepatocytes for human in vivo CLint -predictions., Competing Interests: Declaration of Competing Interests The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2022. Published by Elsevier Inc.)- Published
- 2022
- Full Text
- View/download PDF
50. In Silico Predictions of the Gastrointestinal Uptake of Macrocycles in Man Using Conformal Prediction Methodology.
- Author
-
Fagerholm U, Hellberg S, Alvarsson J, and Spjuth O
- Subjects
- Administration, Oral, Caco-2 Cells, Computer Simulation, Humans, Permeability, Pharmaceutical Preparations, Solubility, Intestinal Absorption, Models, Biological
- Abstract
The gastrointestinal uptake of macrocyclic compounds is not fully understood. Here we applied our previously validated integrated system based on machine learning and conformal prediction to predict the passive fraction absorbed (f
a ), maximum fraction dissolved (fdiss ), substrate specificities for major efflux transporters and total fraction absorbed (fa,tot ) for a selected set of designed macrocyclic compounds (n = 37; MW 407-889 g/mol) and macrocyclic drugs (n = 16; MW 734-1203 g/mole) in vivo in man. Major aims were to increase the understanding of oral absorption of macrocycles and further validate our methodology. We predicted designed macrocycles to have high fa and low to high fdiss and fa,tot , and average estimates were higher than for the larger macrocyclic drugs. With few exceptions, compounds were predicted to be effluxed and well absorbed. A 2-fold median prediction error for fa,tot was achieved for macrocycles (validation set). Advantages with our methodology include that it enables predictions for macrocycles with low permeability, Caco-2 recovery and solubility (BCS IV), and provides prediction intervals and guides optimization of absorption. The understanding of oral absorption of macrocycles was increased and the methodology was validated for prediction of the uptake of macrocycles in man., Competing Interests: Declaration of Competing Interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Urban Fagerholm, Sven Hellberg and Ola Spjuth declare shares in Prosilico AB, a Swedish company that develops solutions for human clinical ADME/PK predictions. Ola Spjuth declares shares in Aros Bio AB, a company developing the CPSign software., (Copyright © 2022 American Pharmacists Association. Published by Elsevier Inc. All rights reserved.)- Published
- 2022
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.