72 results on '"Christoph Helma"'
Search Results
2. Modeling Chronic Toxicity: A Comparison of Experimental Variability With (Q)SAR/Read-Across Predictions
- Author
-
Christoph Helma, David Vorgrimmler, Denis Gebele, Martin Gütlein, Barbara Engeli, Jürg Zarn, Benoit Schilter, and Elena Lo Piparo
- Subjects
(Q)SAR ,read-across ,LOAEL ,experimental variability ,lazar ,Therapeutics. Pharmacology ,RM1-950 - Abstract
This study compares the accuracy of (Q)SAR/read-across predictions with the experimental variability of chronic lowest-observed-adverse-effect levels (LOAELs) from in vivo experiments. We could demonstrate that predictions of the lazy structure-activity relationships (lazar) algorithm within the applicability domain of the training data have the same variability as the experimental training data. Predictions with a lower similarity threshold (i.e., a larger distance from the applicability domain) are also significantly better than random guessing, but the errors to be expected are higher and a manual inspection of prediction results is highly recommended.
- Published
- 2018
- Full Text
- View/download PDF
3. Nano-Lazar: Read across Predictions for Nanoparticle Toxicities with Calculated and Measured Properties
- Author
-
Christoph Helma, Micha Rautenberg, and Denis Gebele
- Subjects
nanoparticle ,toxicity ,QSAR ,read-across ,predictive toxicology ,machine learning ,Therapeutics. Pharmacology ,RM1-950 - Abstract
The lazar framework for read across predictions was expanded for the prediction of nanoparticle toxicities, and a new methodology for calculating nanoparticle descriptors from core and coating structures was implemented. Nano-lazar provides a flexible and reproducible framework for downloading data and ontologies from the open eNanoMapper infrastructure, developing and validating nanoparticle read across models, open-source code and a free graphical interface for nanoparticle read-across predictions. In this study we compare different nanoparticle descriptor sets and local regression algorithms. Sixty independent crossvalidation experiments were performed for the Net Cell Association endpoint of the Protein Corona dataset. The best RMSE and r2 results originated from models with protein corona descriptors and the weighted random forest algorithm, but their 95% prediction interval is significantly less accurate than for models with simpler descriptor sets (measured and calculated nanoparticle properties). The most accurate prediction intervals were obtained with measured nanoparticle properties (no statistical significant difference (p < 0.05) of RMSE and r2 values compared to protein corona descriptors). Calculated descriptors are interesting for cheap and fast high-throughput screening purposes. RMSE and prediction intervals of random forest models are comparable to protein corona models, but r2 values are significantly lower.
- Published
- 2017
- Full Text
- View/download PDF
4. Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties
- Author
-
Monika Batke, Martin Gütlein, Falko Partosch, Ursula Gundert-Remy, Christoph Helma, Stefan Kramer, Andreas Maunz, Madeleine Seeland, and Annette Bitsch
- Subjects
QSAR ,read across ,Non-animal methods ,Predictive Clustering Tree (PCT) method ,toxicological and structural similarity ,Therapeutics. Pharmacology ,RM1-950 - Abstract
1.AbstractInterest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g. the European Union´s Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine structural similarity with shared mechanisms of action. Substances with similar chemical structure and toxicological profile form candidate categories suitable for read-across. We combined two databases on repeated dose toxicity, RepDose database and ELINCS database to form a common database for the identification of categories. The resulting database contained physicochemical, structural and toxicological data, which were refined and curated for cluster analyses. We applied the Predictive Clustering Tree (PCT) approach for clustering chemicals based on structural and on toxicological information to detect groups of chemicals with similar toxic profiles and pathways/mechanisms of toxicity. As many of the experimental toxicity values were not available, this data was imputed by predicting them with a multi-label classification method, prior to clustering. The clustering results were evaluated by assessing chemical and toxicological similarities with the aim of identifying clusters with a concordance between structural information and toxicity profiles/mechanisms. From these chosen clusters, seven were selected for a quantitative read-across, based on a small ratio of NOAEL of the members with the highest and the lowest NOAEL in the cluster (
- Published
- 2016
- Full Text
- View/download PDF
5. Out-of-bag discriminative graph mining.
- Author
-
Andreas Maunz, David Vorgrimmler, and Christoph Helma
- Published
- 2013
- Full Text
- View/download PDF
6. Latent Structure Pattern Mining.
- Author
-
Andreas Maunz, Christoph Helma, Tobias Cramer, and Stefan Kramer 0001
- Published
- 2010
- Full Text
- View/download PDF
7. Large-scale graph mining using backbone refinement classes.
- Author
-
Andreas Maunz, Christoph Helma, and Stefan Kramer 0001
- Published
- 2009
- Full Text
- View/download PDF
8. Answering Scientific Questions with linked European Nanosafety Data.
- Author
-
Egon L. Willighagen, Micha Rautenberg, Denis Gebele, Linda Rieswijk, Friederike Ehrhart, Jiakang Chang, Georgios Drakakis, Penny Nymark, Pekka Kohonen, Gareth I. Owen, Haralambos Sarimveis, Christoph Helma, and Nina Jeliazkova
- Published
- 2016
9. Molecular feature mining in HIV data.
- Author
-
Stefan Kramer 0001, Luc De Raedt, and Christoph Helma
- Published
- 2001
- Full Text
- View/download PDF
10. Efficient mining for structurally diverse subgraph patterns in large molecular databases.
- Author
-
Andreas Maunz, Christoph Helma, and Stefan Kramer 0001
- Published
- 2011
- Full Text
- View/download PDF
11. Stochastic Propositionalization of Non-determinate Background Knowledge.
- Author
-
Stefan Kramer 0001, Bernhard Pfahringer, and Christoph Helma
- Published
- 1998
- Full Text
- View/download PDF
12. Mining for Causes of Cancer: Machine Learning Experiments at Various Levels of Detail.
- Author
-
Stefan Kramer 0001, Bernhard Pfahringer, and Christoph Helma
- Published
- 1997
13. Data Mining and Machine Learning Techniques for the Identification of Mutagenicity Inducing Substructures and Structure Activity Relationships of Noncongeneric Compounds.
- Author
-
Christoph Helma, Tobias Cramer, Stefan Kramer 0001, and Luc De Raedt
- Published
- 2004
- Full Text
- View/download PDF
14. Statistical Evaluation of the Predictive Toxicology Challenge 2000-2001.
- Author
-
Hannu Toivonen, Ashwin Srinivasan 0001, Ross D. King, Stefan Kramer 0001, and Christoph Helma
- Published
- 2003
- Full Text
- View/download PDF
15. A Survey of the Predictive Toxicology Challenge 2000-2001.
- Author
-
Christoph Helma and Stefan Kramer 0001
- Published
- 2003
- Full Text
- View/download PDF
16. The Predictive Toxicology Challenge 2000-2001.
- Author
-
Christoph Helma, Ross D. King, Stefan Kramer 0001, and Ashwin Srinivasan 0001
- Published
- 2001
- Full Text
- View/download PDF
17. Collaborative development of predictive toxicology applications.
- Author
-
Barry J. Hardy, Nicki Douglas, Christoph Helma, Micha Rautenberg, Nina Jeliazkova, Vedrin Jeliazkov, Ivelina Nikolova, Romualdo Benigni, Olga Tcheremenskaia, Stefan Kramer 0001, Tobias Girschick, Fabian Buchwald, Jörg Wicker, Andreas Karwath, Martin Gütlein, Andreas Maunz, Haralambos Sarimveis, Georgia Melagraki, Antreas Afantitis, Pantelis Sopasakis, David Gallagher, Vladimir Poroikov, Dmitry Filimonov, Alexey V. Zakharov, Alexey Lagunin, Tatyana Gloriozova, Sergey Novikov, Natalia Skvortsova, Dmitry S. Druzhilovskiy, Sunil Chawla, Indira Ghosh, Surajit Ray, Hitesh Patel, and Sylvia Escher
- Published
- 2010
- Full Text
- View/download PDF
18. Nano-Lazar: Read across Predictions for Nanoparticle Toxicities with Calculated and Measured Properties
- Author
-
Micha Rautenberg, Christoph Helma, and Denis Gebele
- Subjects
Quantitative structure–activity relationship ,Mean squared error ,Nanoparticle ,02 engineering and technology ,predictive toxicology ,010501 environmental sciences ,k-nearest-neighbors ,computer.software_genre ,01 natural sciences ,k-nearest neighbors algorithm ,Nano ,Methods ,Pharmacology (medical) ,0105 earth and related environmental sciences ,Mathematics ,Pharmacology ,QSAR ,nanoparticle ,lcsh:RM1-950 ,Local regression ,Prediction interval ,toxicity ,021001 nanoscience & nanotechnology ,Random forest ,lcsh:Therapeutics. Pharmacology ,machine learning ,Data mining ,0210 nano-technology ,Biological system ,computer ,read-across - Abstract
The lazar framework for read across predictions was expanded for the prediction of nanoparticle toxicities, and a new methodology for calculating nanoparticle descriptors from core and coating structures was implemented. Nano-lazar provides a flexible and reproducible framework for downloading data and ontologies from the open eNanoMapper infrastructure, developing and validating nanoparticle read across models, open-source code and a free graphical interface for nanoparticle read-across predictions. In this study we compare different nanoparticle descriptor sets and local regression algorithms. Sixty independent crossvalidation experiments were performed for the Net Cell Association endpoint of the Protein Corona dataset. The best RMSE and r2 results originated from models with protein corona descriptors and the weighted random forest algorithm, but their 95% prediction interval is significantly less accurate than for models with simpler descriptor sets (measured and calculated nanoparticle properties). The most accurate prediction intervals were obtained with measured nanoparticle properties (no statistical significant difference (p < 0.05) of RMSE and r2 values compared to protein corona descriptors). Calculated descriptors are interesting for cheap and fast high-throughput screening purposes. RMSE and prediction intervals of random forest models are comparable to protein corona models, but r2 values are significantly lower.
- Published
- 2017
19. The ToxBank Data Warehouse: Supporting the Replacement of In Vivo Repeated Dose Systemic Toxicity Testing
- Author
-
Glyn Stacey, Rebecca Ceder, Vedrin Jeliazkov, Pekka Kohonen, Michael Rautenberg, Scott Miller, Glenn J. Myatt, Christoph Helma, Roland C. Grafström, Nina Jeliazkova, Jeff Wiseman, Lyn Healy, David Bower, Egon Willighagen, Silvia Maggioni, Michael Crump, Emilio Benfenati, Barry Hardy, Kevin P. Cross, Bioinformatica, RS: NUTRIM - R4 - Gene-environment interaction, and RS: CARIM School for Cardiovascular Diseases
- Subjects
Representational state transfer ,MECHANISM ,computer.internet_protocol ,Computer science ,Cellular models ,computer.software_genre ,Repeated-dose toxicity testing ,0302 clinical medicine ,Resource (project management) ,Wikis ,Structural Biology ,Predictive toxicology ,Drug Discovery ,0303 health sciences ,Cheminformatics ,computer.file_format ,Toxicogenomics ,Data warehouse ,3. Good health ,Computer Science Applications ,ToxBank ,CARBON-TETRACHLORIDE ,Open standard ,Data exchange ,030220 oncology & carcinogenesis ,Molecular Medicine ,Data mining ,Web service ,Semantic web ,Bioinformatics ,TOXICOLOGY ,03 medical and health sciences ,Databases ,OpenTox ,VALPROIC ACID ,Oxidation ,HYPOXIA-INDUCIBLE FACTOR-1-ALPHA ,HEPATIC TOXICITY ,RDF ,Semantic Web ,030304 developmental biology ,DRUG-INDUCED PHOSPHOLIPIDOSIS ,Redox chemistry ,SEURAT-1 ,business.industry ,Electron transport ,Organic Chemistry ,Reference compounds ,TISSUE ,CELLS ,Alternative Testing Strategies ,Gene expression ,Software engineering ,business ,INDUCED HEPATOTOXICITY ,computer - Abstract
The aim of the SEURAT-1 (Safety Evaluation Ultimately Replacing Animal Testing-1) research cluster, comprised of seven EU FP7 Health projects co-financed by Cosmetics Europe, is to generate a proof-of-concept to show how the latest technologies, systems toxicology and toxicogenomics can be combined to deliver a test replacement for repeated dose systemic toxicity testing on animals. The SEURAT-1 strategy is to adopt a mode-of-action framework to describe repeated dose toxicity, combining in vitro and in silico methods to derive predictions of in vivo toxicity responses. ToxBank is the cross-cluster infrastructure project whose activities include the development of a data warehouse to provide a web-accessible shared repository of research data and protocols, a physical compounds repository, reference or 'gold compounds' for use across the cluster (available via wiki.toxbank.net), and a reference resource for biomaterials. Core technologies used in the data warehouse include the ISA-Tab universal data exchange format, REpresentational State Transfer (REST) web services, the W3C Resource Description Framework (RDF) and the OpenTox standards. We describe the design of the data warehouse based on cluster requirements, the implementation based on open standards, and finally the underlying concepts and initial results of a data analysis utilizing public data related to the gold compounds.
- Published
- 2013
- Full Text
- View/download PDF
20. Deliverable Report D5.7 Final report on User Applications
- Author
-
Lucian Farcal, Philip Doganis, Georgia Tsiliki, Haralambos Sarimveis, Nina Jeliazkova, Egon Willighagen, Linda Rieswijk, Penny Nymark, Micha Rautenberg, Christoph Helma, Maja Brajnik, and Barry Hardy
- Abstract
Deliverable 5.7 reports on the Tasks 5.3 and 5.9 on the user applications achievements. The application infrastructure developed within eNanoMapper project aims to support the data management in the area of nanosafety research and to enable an integrated approach for the risk assessment of nanomaterials. To achieve these, eNanoMapper developed an ontology, a data infrastructure and modelling tools with applicability in risk assessment of nanomaterials. eNanoMapper developed resources and tools for predicting toxicity of nanomaterials and worked towards improving the standards in risk assessment of nanomaterials.
- Published
- 2016
- Full Text
- View/download PDF
21. Innovative Strategies to Develop Chemical Categories Using a Combination of Structural and Toxicological Properties
- Author
-
Christoph Helma, Monika Batke, Stefan Kramer, Falko Partosch, Madeleine Seeland, Martin Gütlein, Annette Bitsch, Andreas Maunz, Ursula Gundert-Remy, and Publica
- Subjects
0301 basic medicine ,Quantitative structure–activity relationship ,read across ,Predictive Clustering Tree (PCT) method ,Computer science ,010501 environmental sciences ,computer.software_genre ,600 Technik, Medizin, angewandte Wissenschaften::610 Medizin und Gesundheit ,01 natural sciences ,03 medical and health sciences ,Pharmacology (medical) ,Cluster analysis ,0105 earth and related environmental sciences ,Original Research ,Alternative methods ,Pharmacology ,toxicological and structural similarity ,business.industry ,QSAR ,lcsh:RM1-950 ,non-animal methods ,readacross ,Identification (information) ,Tree (data structure) ,030104 developmental biology ,Conceptual approach ,lcsh:Therapeutics. Pharmacology ,Knowledge base ,Data mining ,Web service ,business ,computer - Abstract
Interest is increasing in the development of non-animal methods for toxicological evaluations. These methods are however, particularly challenging for complex toxicological endpoints such as repeated dose toxicity. European Legislation, e.g., the European Union's Cosmetic Directive and REACH, demands the use of alternative methods. Frameworks, such as the Read-across Assessment Framework or the Adverse Outcome Pathway Knowledge Base, support the development of these methods. The aim of the project presented in this publication was to develop substance categories for a read-across with complex endpoints of toxicity based on existing databases. The basic conceptual approach was to combine structural similarity with shared mechanisms of action. Substances with similar chemical structure and toxicological profile form candidate categories suitable for read-across. We combined two databases on repeated dose toxicity, RepDose database, and ELINCS database to form a common database for the identification of categories. The resulting database contained physicochemical, structural, and toxicological data, which were refined and curated for cluster analyses. We applied the Predictive Clustering Tree (PCT) approach for clustering chemicals based on structural and on toxicological information to detect groups of chemicals with similar toxic profiles and pathways/mechanisms of toxicity. As many of the experimental toxicity values were not available, this data was imputed by predicting them with a multi-label classification method, prior to clustering. The clustering results were evaluated by assessing chemical and toxicological similarities with the aim of identifying clusters with a concordance between structural information and toxicity profiles/mechanisms. From these chosen clusters, seven were selected for a quantitative read-across, based on a small ratio of NOAEL of the members with the highest and the lowest NOAEL in the cluster (< 5). We discuss the limitations of the approach. Based on this analysis we propose improvements for a follow-up approach, such as incorporation of metabolic information and more detailed mechanistic information. The software enables the user to allocate a substance in a cluster and to use this information for a possible read- across. The clustering tool is provided as a free web service, accessible at http://mlc-reach.informatik.uni-mainz.de. peerReviewed
- Published
- 2016
22. Combinatorial QSAR Modeling of Human Intestinal Absorption
- Author
-
Claudia Suenderhauf, Felix Hammann, Christoph Helma, Jörg Huwyler, and Andreas Maunz
- Subjects
Quantitative structure–activity relationship ,Chemistry ,business.industry ,Quantitative Structure-Activity Relationship ,Pharmaceutical Science ,Feature selection ,Pattern recognition ,Bioinformatics ,Perceptron ,Matthews correlation coefficient ,Intestinal absorption ,Random forest ,Support vector machine ,Intestinal Absorption ,Binary classification ,Drug Discovery ,Humans ,Molecular Medicine ,Artificial intelligence ,business ,Algorithms - Abstract
Intestinal drug absorption in humans is a central topic in drug discovery. In this study, we use a broad selection of machine learning and statistical methods for the classification and numerical prediction of this key end point. Our data set is based on a selection of 458 small druglike compounds with FDA approval. Using easily available tools, we calculated one- to three-dimensional physicochemical descriptors and used various methods of feature selection (best-first backward selection, correlation analysis, and decision tree analysis). We then used decision tree induction (DTI), fragment-based lazy-learning (LAZAR), support vector machine classification, multilayer perceptrons, random forests, k-nearest neighbor and Naïve Bayes analysis to model absorption ratios and binary classification (well-absorbed and poorly absorbed compounds). Best performance for classification was seen with DTI using the chi-squared analysis interaction detector (CHAID) algorithm, yielding corrected classification rate of 88% (Matthews correlation coefficient of 75%). In numeric predictions, the multilayer perceptron performed best, achieving a root mean squared error of 25.823 and a coefficient of determination of 0.6. In line with current understanding is the importance of descriptors such as lipophilic partition coefficients (log P) and hydrogen bonding. However, we are able to highlight the utility of gravitational indices and moments of inertia, reflecting the role of structural symmetry in oral absorption. Our models are based on a diverse data set of marketed drugs representing a broad chemical space. These models therefore contribute substantially to the molecular understanding of human intestinal drug absorption and qualify for a generalized use in drug discovery and lead optimization.
- Published
- 2010
- Full Text
- View/download PDF
23. Prediction of Adverse Drug Reactions Using Decision Tree Modeling
- Author
-
Heike Gutmann, Christoph Helma, Felix Hammann, Jürgen Drewe, and N Vogt
- Subjects
Drug ,medicine.medical_specialty ,Databases, Factual ,Drug-Related Side Effects and Adverse Reactions ,media_common.quotation_subject ,Drug Evaluation, Preclinical ,Decision tree ,Pharmacology ,Decision Support Techniques ,Drug Hypersensitivity ,Small Molecule Libraries ,Pharmacotherapy ,Artificial Intelligence ,Central Nervous System Diseases ,Pharmacovigilance ,medicine ,Humans ,Computer Simulation ,Pharmacology (medical) ,Drug reaction ,Intensive care medicine ,media_common ,Drug discovery ,business.industry ,Decision Trees ,Reproducibility of Results ,Pharmaceutical Preparations ,Kidney Diseases ,Chemical and Drug Induced Liver Injury ,business ,Algorithms ,Software ,Target organ ,Decision tree model ,Forecasting - Abstract
Drug safety is of great importance to public health. The detrimental effects of drugs not only limit their application but also cause suffering in individual patients and evoke distrust of pharmacotherapy. For the purpose of identifying drugs that could be suspected of causing adverse reactions, we present a structure-activity relationship analysis of adverse drug reactions (ADRs) in the central nervous system (CNS), liver, and kidney, and also of allergic reactions, for a broad variety of drugs (n = 507) from the Swiss drug registry. Using decision tree induction, a machine learning method, we determined the chemical, physical, and structural properties of compounds that predispose them to causing ADRs. The models had high predictive accuracies (78.9-90.2%) for allergic, renal, CNS, and hepatic ADRs. We show the feasibility of predicting complex end-organ effects using simple models that involve no expensive computations and that can be used (i) in the selection of the compound during the drug discovery stage, (ii) to understand how drugs interact with the target organ systems, and (iii) for generating alerts in postmarketing drug surveillance and pharmacovigilance.
- Published
- 2010
- Full Text
- View/download PDF
24. Predictive Models for Carcinogenicity and Mutagenicity: Frameworks, State-of-the-Art, and Perspectives
- Author
-
Todd M. Martin, David M. DeMarini, Romualdo Benigni, D Kirkland, Paolo Mazzatorta, W G E J Schoonen, G Ouédraogo-Arras, Chihae Yang, R D Snyder, Ann M. Richard, Christoph Helma, Benoît Schilter, and Emilio Benfenati
- Subjects
Alternative methods ,Cancer Research ,Quantitative structure–activity relationship ,Priority setting ,Computer science ,Health, Toxicology and Mutagenesis ,In silico ,In vitro toxicology ,Quantitative Structure-Activity Relationship ,Expert Systems ,Rodentia ,Computational biology ,Pharmacology ,Models, Biological ,Risk Assessment ,Rodent carcinogenicity ,Models, Chemical ,Carcinogens ,False positive paradox ,Animals ,Humans ,Carcinogen ,Forecasting ,Mutagens - Abstract
Mutagenicity and carcinogenicity are endpoints of major environmental and regulatory concern. These endpoints are also important targets for development of alternative methods for screening and prediction due to the large number of chemicals of potential concern and the tremendous cost (in time, money, animals) of rodent carcinogenicity bioassays. Both mutagenicity and carcinogenicity involve complex, cellular processes that are only partially understood. Advances in technologies and generation of new data will permit a much deeper understanding. In silico methods for predicting mutagenicity and rodent carcinogenicity based on chemical structural features, along with current mutagenicity and carcinogenicity data sets, have performed well for local prediction (i.e., within specific chemical classes), but are less successful for global prediction (i.e., for a broad range of chemicals). The predictivity of in silico methods can be improved by improving the quality of the data base and endpoints used for modelling. In particular, in vitro assays for clastogenicity need to be improved to reduce false positives (relative to rodent carcinogenicity) and to detect compounds that do not interact directly with DNA or have epigenetic activities. New assays emerging to complement or replace some of the standard assays include Vitotox, GreenScreenGC, and RadarScreen. The needs of industry and regulators to assess thousands of compounds necessitate the development of high-throughput assays combined with innovative data-mining and in silico methods. Various initiatives in this regard have begun, including CAESAR, OSIRIS, CHEMOMENTUM, CHEMPREDICT, OpenTox, EPAA, and ToxCast. In silico methods can be used for priority setting, mechanistic studies, and to estimate potency. Ultimately, such efforts should lead to improvements in application of in silico methods for predicting carcinogenicity to assist industry and regulators and to enhance protection of public health.
- Published
- 2009
- Full Text
- View/download PDF
25. Artificial Intelligence and Data Mining for Toxicity Prediction
- Author
-
Christoph Helma and Jeroen Kazius
- Subjects
Computer science ,Drug Discovery ,Molecular Medicine ,General Medicine ,Data mining ,computer.software_genre ,computer - Published
- 2006
- Full Text
- View/download PDF
26. eNanoMapper – A database and ontology framework for design and safety assessment of nanomaterials
- Author
-
Christoph Helma, Roland Grafström, Pekka Kohonen, Egon Willighagen, Vesa Hongisto, Nikolay Kochev, Micha Rautenberg, Nina Jeliazkova, Linda Rieswijk, Charalampos Chomenidis, Bengt Fadeel, Penny Nymark, Georgios Drakakis, Haralambos Sarimveis, Gareth Owen, Denis Gebele, Barry Hardy, Georgia Tsiliki, G. Kilic, Lucian Farcal, Friederike Ehrhart, J. Chang, and Philip Doganis
- Subjects
Engineering ,business.industry ,General Medicine ,Ontology (information science) ,Toxicology ,business ,Software engineering ,nanomaterials - Abstract
eNanoMapper developed a modular infrastructure for data storage, sharing and searching, an ontologyfor the categorisation and characterisation of nanomaterials andcomputational models fornanomaterials safety assessment.
- Published
- 2016
- Full Text
- View/download PDF
27. Automated and reproducible read-across like models for predicting carcinogenic potency
- Author
-
Andreas Maunz, Elena Lo Piparo, Christoph Helma, David Vorgrimmler, and Benoît Schilter
- Subjects
Computer science ,Quantitative Structure-Activity Relationship ,Context (language use) ,Pharmacology ,Toxicology ,Machine learning ,computer.software_genre ,Models, Biological ,Risk Assessment ,Automation ,Mice ,Animals ,Carcinogenic potency ,Web site ,Alternative methods ,User Friendly ,business.industry ,Reproducibility of Results ,General Medicine ,Margin of exposure ,Rats ,Models, Animal ,Carcinogens ,Artificial intelligence ,business ,computer ,Software - Abstract
Several qualitative (hazard-based) models for chronic toxicity prediction are available through commercial and freely available software, but in the context of risk assessment a quantitative value is mandatory in order to be able to apply a Margin of Exposure (predicted toxicity/exposure estimate) approach to interpret the data. Recently quantitative models for the prediction of the carcinogenic potency have been developed, opening some hopes in this area, but this promising approach is currently limited by the fact that the proposed programs are neither publically nor commercially available. In this article we describe how two models (one for mouse and one for rat) for the carcinogenic potency (TD50) prediction have been developed, using lazar (Lazy Structure Activity Relationships), a procedure similar to read-across, but automated and reproducible. The models obtained have been compared with the recently published ones, resulting in a similar performance. Our aim is also to make the models freely available in the near future thought a user friendly internet web site.
- Published
- 2014
28. Data quality in predictive toxicology: reproducibility of rodent carcinogenicity experiments
- Author
-
Christoph Helma, Bernhard Pfahringer, Stefan Kramer, and Eva Gottmann
- Subjects
Male ,Databases, Factual ,Carcinogenicity Tests ,Health, Toxicology and Mutagenesis ,Concordance ,Predictive toxicology ,Biology ,Pharmacology ,Bioinformatics ,Risk Assessment ,Rodent carcinogenicity ,Mice ,Structure-Activity Relationship ,Species Specificity ,Predictive Value of Tests ,Neoplasms ,Animals ,Carcinogen ,Reproducibility ,Public Health, Environmental and Occupational Health ,Reproducibility of Results ,Rats ,Data quality ,Carcinogens ,Female ,Risk assessment ,Target organ ,Research Article - Abstract
We compared 121 replicate rodent carcinogenicity assays from the two parts (National Cancer Institute/National Toxicology Program and literature) of the Carcinogenic Potency Database (CPDB) to estimate the reliability of these experiments. We estimated a concordance of 57% between the overall rodent carcinogenicity classifications from both sources. This value did not improve substantially when additional biologic information (species, sex, strain, target organs) was considered. These results indicate that rodent carcinogenicity assays are much less reproducible than previously expected, an effect that should be considered in the development of structure-activity relationship models and the risk assessment process.
- Published
- 2001
- Full Text
- View/download PDF
29. Data quality in predictive toxicology: identification of chemical structures and calculation of chemical properties
- Author
-
Christoph Helma, Stefan Kramer, Bernhard Pfahringer, and Eva Gottmann
- Subjects
Databases, Factual ,Drug-Related Side Effects and Adverse Reactions ,Computer science ,Health, Toxicology and Mutagenesis ,media_common.quotation_subject ,Predictive toxicology ,Toxicology ,computer.software_genre ,Structure-Activity Relationship ,Knowledge extraction ,Humans ,Quality (business) ,Representation (mathematics) ,media_common ,Toxicity data ,Molecular Structure ,business.industry ,Public Health, Environmental and Occupational Health ,Identification (information) ,Data Interpretation, Statistical ,Data quality ,Data mining ,business ,Quality assurance ,computer ,Research Article - Abstract
Every technique for toxicity prediction and for the detection of structure-activity relationships relies on the accurate estimation and representation of chemical and toxicologic properties. In this paper we discuss the potential sources of errors associated with the identification of compounds, the representation of their structures, and the calculation of chemical descriptors. It is based on a case study where machine learning techniques were applied to data from noncongeneric compounds and a complex toxicologic end point (carcinogenicity). We propose methods applicable to the routine quality control of large chemical datasets, but our main intention is to raise awareness about this topic and to open a discussion about quality assurance in predictive toxicology. The accuracy and reproducibility of toxicity data will be reported in another paper.
- Published
- 2000
- Full Text
- View/download PDF
30. A public domain image-analysis program for the single-cell gel-electrophoresis (comet) assay
- Author
-
Maria Uhl and Christoph Helma
- Subjects
Gel electrophoresis ,Computer science ,business.industry ,Health, Toxicology and Mutagenesis ,Image processing ,DNA ,computer.software_genre ,Public domain ,Image (mathematics) ,Comet assay ,Software ,Image Processing, Computer-Assisted ,Genetics ,Operating system ,Comet (programming) ,Comet Assay ,Macro ,business ,computer ,DNA Damage - Abstract
The single-cell gel electrophoresis (or comet) assay has gained widespread acceptance as a cheap and simple genotoxicity test, but it requires a computer-assisted image-analysis system. As commercial programs are expensive and inflexible, we decided to develop an image-analysis system based on public domain programs and make it publicly available for the scientific community. Our system is based on the scientific image-processing program NIH Image, and was written in its Pascal-like macro language. User interaction was kept as simple as possible, to enable the measurement of a large number of cells with a few keystrokes. Therefore, the time for image analysis is very low, even on slow computers. The comet macro can be obtained from http://mailbox.univie.ac.at/christoph.helma++ +/comet/, NIH Image is available at http://rsb.info.nih.gov/nih-image/. Both programs are free of charge.
- Published
- 2000
- Full Text
- View/download PDF
31. Genotoxic and Ecotoxic Effects of Groundwaters and Their Relation to Routinely Measured Chemical Parameters
- Author
-
Eva Gottmann, Christian Windpassinger, Hans Steinkellner, Rolf Schulte-Hermann, Peter Eckl, Wolfgang Rodinger, Fekadu Kassie, Christoph Helma, and Siegfried Knasmüller
- Subjects
biology ,Daphnia magna ,General Chemistry ,Selenastrum ,Tradescantia ,biology.organism_classification ,medicine.disease_cause ,Environmental chemistry ,Botany ,Micronucleus test ,medicine ,Environmental Chemistry ,Bioassay ,Ecotoxicology ,Ecotoxicity ,Genotoxicity - Abstract
The primary aim of the present investigation was to study possible adverse effects of groundwater from an aquifer south of Austria's capital, Vienna, and to relate these toxicological effects to routinely measured physical/chemical parameters. Fourty-three water samples were tested for genotoxic and ecotoxic effects. For genotoxicity testing the Salmonella/microsome assay, the micronucleus test with primary rat hepatocytes and micronucleus tests with plants( Tradescantia, Vicia faba) were used. In ecotoxicity tests, algae (Selenastrum capricornutum), water cress (Lepidium sativum), and water flea (Dapnia magna) were studied as target organisms. In genotoxicity assays, 10 samples (23%) gave a weak positive response with a single end point, but only one sample (2%) was genotoxic in three different test systems. Thirty-six samples (86%) caused adverse effects in ecotoxicity assays. Plants(algae and water cress) were more sensitive than daphnie. No correlations between toxic effects and physical/chemical parameters were detected. The genotoxicity experiments indicate presently a low risk from genotoxic compounds. The ecotoxic (especially phytotoxic) properties of many water samples raise concern about their suitability for irrigation purposes. The lacking correlation between results from toxicity tests and physical/chemical data indicates that it is presently impossible to predict toxic properties from routine physical/chemical measurements with a sufficient level of safety. It is therefore important to include biological toxicity assays in groundwater monitoring programs.
- Published
- 1998
- Full Text
- View/download PDF
32. Gentoxische Substanzen in Wässern
- Author
-
Christoph Helma and Siegfried Knasmüller
- Subjects
Pollution - Abstract
Neben Abwassern aus der Industrie entstehen auch bei anderen technischen Prozessen Abwasser, die gentoxische Substanzen enthalten konnen. Dabei handelt es sich sowohl um punktformige Quellen wie z.B. dem Bergbau, Kraftwerken oder Anlagen zur Abfall- bzw. Abwasserentsorgung als auch um diffuse Emissionen aus Landwirtschaft und Verkehr. Obwohl eine Vielzahl von mutagenen Verbindungen bekannt ist, war es in konkreten Fallen noch nicht moglich, die mutagene Aktivitat von Wassern bestimmten Substanzen zuzuordnen. Bei der Chlorierung und Ozonierung im Rahmen der Trinkwasseraufbereitung konnen ebenfalls Stoffe mit mutagenem Potential entstehen. Dieses Gesundheitsrisiko fur den menschen kann derzeit noch nicht abschliesend beurteilt werden.
- Published
- 1997
- Full Text
- View/download PDF
33. Comparative Evaluation of Four Bacterial Assays for the Detection of Genotoxic Effects in the Dissolved Water Phases of Aqueous Matrices
- Author
-
Lu Wenquing, Siegfried Knasmüller, Volker Mersch-Sundermann, Christoph Helma, Virginia S. Houk, Ursula Glasbrenner, Rolf Schulte-Hermann, Carmen Klein, and Fekadu Kassie
- Subjects
Salmonella ,Chromatography ,biology ,Chemistry ,General Chemistry ,biology.organism_classification ,medicine.disease_cause ,Industrial wastewater treatment ,SOS chromotest ,Environmental chemistry ,medicine ,Microsome ,Environmental Chemistry ,Leachate ,Effluent ,Bacteria ,Genotoxicity - Abstract
The aim of this study was to evaluate the perfomance of four bacterial short-term genotoxicity assays (Salmonella/microsome assay, SOS Chromotest, Microscreen phage-induction assay, differential DNA repair test) that are widely used and/or have a promising potential for the genotoxicity testing of water samples. Twenty-three samples of different origins (drinking and bathing water, surface water, municipal and industrial wastewater, pulp mill effluents, groundwater, and landfill leachates) were tested in these assays. In total, 20 samples were genotoxic: 13 in the Salmonella/microsome assay, 13 in the SOS Chromotest, 8 in the Microscreen phage-induction assay, and 19 in the differential DNA repair test. Although the differential DNA repair test was the most sensitive system, positive results were obtained also with some of the negative control samples, and it had the least power to detect different genotoxic potencies. The Microscreen assay was the least sensitive system due to nonlinear results and samp...
- Published
- 1996
- Full Text
- View/download PDF
34. Genotoxic effects of the chlorinated hydroxyfuranones 3-chloro-4-(dichloromethyl)-5-hydroxy-2[5H]-furanone and 3,4-dichloro-5-hydroxy-2[5H]-furanone in Tradescantia micronucleus assays
- Author
-
Siegfried Knasmüller, Christoph Helma, Leif Kronberg, and Te-Hsiu Ma
- Subjects
chemistry.chemical_classification ,Micronucleus Tests ,Dose-Response Relationship, Drug ,biology ,Mutagenicity Tests ,Chemistry ,General Medicine ,Tradescantia ,Plants ,Salmonella typhi ,biology.organism_classification ,Medicinal chemistry ,Ames test ,Toxicology ,Water chlorination ,chemistry.chemical_compound ,Clastogen ,Micronucleus test ,Pollen ,Furans ,Micronucleus ,5-Hydroxy-2(5H)-furanone ,Lactone ,Mutagens - Abstract
This is the first report of clastogenic effects of chlorinated hydroxyfuranones (CHFs) in plants. Two byproducts of water chlorination, 3-chloro-4-(dichloromethyl)-5-hydroxy-2[5H]-furanone (MX) and 3,4-dichloro-5-hydroxy-2[5H]-furanone (MA) induced a dose dependent increase of micronuclei (MN) in pollen mother cells of Tradescantia when doses up to 100 micrograms MX and 500 micrograms MA were applied directly to the inflorescences. In contrast, exposure of the stems in aqueous solutions containing up to 1 mg/l MX and 10 mg/l MA did not cause a positive response.
- Published
- 1995
- Full Text
- View/download PDF
35. A Large-Scale Empirical Evaluation of Cross-Validation and External Test Set Validation in (Q)SAR
- Author
-
Andreas Karwath, Martin Gütlein, Stefan Kramer, and Christoph Helma
- Subjects
Computer science ,media_common.quotation_subject ,Organic Chemistry ,Scale (descriptive set theory) ,Variance (accounting) ,computer.software_genre ,Cross-validation ,Computer Science Applications ,Model validation ,Workflow ,Structural Biology ,Cheminformatics ,Test set ,Drug Discovery ,Molecular Medicine ,Quality (business) ,Data mining ,computer ,media_common - Abstract
(Q)SAR model validation is essential to ensure the quality of inferred models and to indicate future model predictivity on unseen compounds. Proper validation is also one of the requirements of regulatory authorities in order to accept the (Q)SAR model, and to approve its use in real world scenarios as alternative testing method. However, at the same time, the question of how to validate a (Q)SAR model, in particular whether to employ variants of cross-validation or external test set validation, is still under discussion. In this paper, we empirically compare a k-fold cross-validation with external test set validation. To this end we introduce a workflow allowing to realistically simulate the common problem setting of building predictive models for relatively small datasets. The workflow allows to apply the built and validated models on large amounts of unseen data, and to compare the performance of the different validation approaches. The experimental results indicate that cross-validation produces higher performant (Q)SAR models than external test set validation, reduces the variance of the results, while at the same time underestimates the performance on unseen compounds. The experimental results reported in this paper suggest that, contrary to current conception in the community, cross-validation may play a significant role in evaluating the predictivity of (Q)SAR models.
- Published
- 2012
36. Die Belastung von Wässern mit gentoxischen Substanzen
- Author
-
Rolf Schulte-Hermann, Christoph Helma, and Siegfried Knasmüller
- Subjects
Chemistry ,Pollution ,Molecular biology - Abstract
Die Kontamination von Gewassern mit gentoxischen Substanzen ist sowohl aus human- als auch aus okotoxikologischen Gesichtspunkten bedenklich. In epidemiologischen Studien wurden Korrelationen zwischen gentoxischen Gewasserbelastungen und kanzerogenen Effekten beim Menschen aber auch bei Fischen und Muscheln hergestellt. In dem vorliegenden Ubersichtsartikel werden biologische Methoden vorgestellt, die die Detektion gentoxischer Substanzen in Gewassern ermoglichen. Gegenuber chemisch-analytischen Methoden weisen sie den Vorteil auf, das nicht Einzelsubstanzen, sondern die Kombinationswirkung aller Schadstoffe in einem biologischen Testsystem erfast werden. Zur Prufung der Gentoxizitat kommen sowohlin vitro als auchin vivo Systeme zum Einsatz. Als Indikatororganismen dienen Bakterien, Pflanzen und Tiere, bei denen primare DNA-Schaden, Punkt-, Chromosomen- und Genommutationen als gentoxische Endpunkte nachgewiesen werden. Die Vor- und Nachteile der verschiedenen Testsysteme, ihre Einsatzmoglichkeit und die Kombination mehrerer Verfahren zu Testbatterien werden diskutiert und Methoden zur Konzentrierung von Wasserinhaltsstoffen vorgestellt.
- Published
- 1994
- Full Text
- View/download PDF
37. Enhanced clastogenicity of contaminated groundwater following UV irradiation detected by the Tradescantia micronucleus assay
- Author
-
R. Schulte-Hermann, Christoph Helma, Regina Sommer, and Siegfried Knasmüller
- Subjects
Micronucleus Tests ,biology ,Hydrocarbons, Halogenated ,Ultraviolet Rays ,Chemistry ,Radiochemistry ,Water Pollution, Radioactive ,Industrial Waste ,General Medicine ,Tradescantia ,Plants ,Contamination ,biology.organism_classification ,medicine.disease_cause ,Activated charcoal ,Tap water ,Micronucleus test ,Water Pollution, Chemical ,medicine ,Soil Pollutants ,Soil Pollutants, Radioactive ,Irradiation ,Micronucleus ,Genotoxicity ,Mutagens - Abstract
The Tradescantia micronucleus (Trad-MCN) assay was used to determine clastogenic effects of contaminated groundwater collected near a hazardous waste landfill. Water samples were taken from a purification plant (activated charcoal filtration, UV irradiation) which was built to avoid groundwater contamination by this landfill. Five series of experiments were conducted during approximately 4 months. In addition, water samples were irradiated under laboratory conditions with increasing doses of UV light. Several field water samples gave positive, dose-dependent effects before filtration and irradiation. Maximal values ( 6.1 ± 4.7 micronuclei (MCN)/100 tetrads) were six-fold above controls. UV irradiation of activated charcoal-filtered water resulted in an enhancement of MCN frequencies. Exposure of groundwater to UV irradiation in the laboratory led to a dose-dependent increase of micronuclei. At the highest dose (1500 J/2) the MCN frequency was more than six times higher than in the unirradiated sample ( 5.4 ± 1.0 vs. 0.8 ± 0.4 MCN/100 tetrads). The clastogenicity of UV-irradiated samples decreased with a half-life of approximately 1 day. Irradiation of tap water did not increase the MCN frequency. Our results indicate that irradiation of water with UV light for disinfection purposes might lead to a transiently increased genotoxicity of chemically polluted water samples.
- Published
- 1994
- Full Text
- View/download PDF
38. Collaborative development of predictive toxicology applications
- Author
-
Ivelina Nikolova, Jörg Wicker, Pantelis Sopasakis, Christoph Helma, Dmitry Filimonov, Hitesh Patel, Natalia Skvortsova, Vladimir Poroikov, D. S. Druzhilovsky, Alexey Lagunin, Vedrin Jeliazkov, Haralambos Sarimveis, Barry Hardy, Nicki Douglas, Tatyana A. Gloriozova, Sylvia Escher, Olga Tcheremenskaia, Andreas Karwath, Stefan Kramer, Fabian Buchwald, Indira Ghosh, Georgia Melagraki, Romualdo Benigni, Surajit Ray, Tobias Girschick, Andreas Maunz, Antreas Afantitis, Sergey V. Novikov, Alexey V. Zakharov, Sunil Chawla, Martin Gütlein, Micha Rautenberg, Nina Jeliazkova, David Gallagher, and Publica
- Subjects
Computer science ,Data management ,Interoperability ,Library and Information Sciences ,Ontology (information science) ,External Data Representation ,computer.software_genre ,01 natural sciences ,lcsh:Chemistry ,03 medical and health sciences ,Web application ,Physical and Theoretical Chemistry ,RDF ,030304 developmental biology ,0303 health sciences ,lcsh:T58.5-58.64 ,Application programming interface ,lcsh:Information technology ,business.industry ,computer.file_format ,Computer Graphics and Computer-Aided Design ,Data science ,0104 chemical sciences ,Computer Science Applications ,010404 medicinal & biomolecular chemistry ,lcsh:QD1-999 ,Web service ,business ,computer ,Research Article - Abstract
OpenTox provides an interoperable, standards-based Framework for the support of predictive toxicology data management, algorithms, modelling, validation and reporting. It is relevant to satisfying the chemical safety assessment requirements of the REACH legislation as it supports access to experimental data, (Quantitative) Structure-Activity Relationship models, and toxicological information through an integrating platform that adheres to regulatory requirements and OECD validation principles. Initial research defined the essential components of the Framework including the approach to data access, schema and management, use of controlled vocabularies and ontologies, architecture, web service and communications protocols, and selection and integration of algorithms for predictive modelling. OpenTox provides end-user oriented tools to non-computational specialists, risk assessors, and toxicological experts in addition to Application Programming Interfaces (APIs) for developers of new applications. OpenTox actively supports public standards for data representation, interfaces, vocabularies and ontologies, Open Source approaches to core platform components, and community-based collaboration approaches, so as to progress system interoperability goals. The OpenTox Framework includes APIs and services for compounds, datasets, features, algorithms, models, ontologies, tasks, validation, and reporting which may be combined into multiple applications satisfying a variety of different user needs. OpenTox applications are based on a set of distributed, interoperable OpenTox API-compliant REST web services. The OpenTox approach to ontology allows for efficient mapping of complementary data coming from different datasets into a unifying structure having a shared terminology and representation. Two initial OpenTox applications are presented as an illustration of the potential impact of OpenTox for high-quality and consistent structure-activity relationship modelling of REACH-relevant endpoints: ToxPredict which predicts and reports on toxicities for endpoints for an input chemical structure, and ToxCreate which builds and validates a predictive toxicity model based on an input toxicology dataset. Because of the extensible nature of the standardised Framework design, barriers of interoperability between applications and content are removed, as the user may combine data, models and validation from multiple sources in a dependable and time-effective way.
- Published
- 2010
39. Prediction of Toxic Effects of Pharmaceutical Agents
- Author
-
Christoph Helma and Andreas Maunz
- Subjects
Engineering ,business.industry ,Biochemical engineering ,Pharmacology ,business - Published
- 2009
- Full Text
- View/download PDF
40. Classification of cytochrome p(450) activities using machine learning methods
- Author
-
Felix Hammann, Ulli Baumann, Heike Gutmann, Christoph Helma, and Juergen Drewe
- Subjects
Models, Molecular ,Quantitative structure–activity relationship ,Decision tree ,Pharmaceutical Science ,Quantitative Structure-Activity Relationship ,Machine learning ,computer.software_genre ,Cytochrome P-450 Enzyme System ,Artificial Intelligence ,Cytochrome P-450 CYP1A2 ,Drug Discovery ,biology ,Artificial neural network ,business.industry ,Drug discovery ,Cytochrome P450 ,CHAID ,Random forest ,Support vector machine ,Cytochrome P-450 CYP2D6 ,biology.protein ,Molecular Medicine ,Artificial intelligence ,business ,computer ,Algorithms - Abstract
The cytochrome P(450) (CYP) system plays an integral part in the metabolism of drugs and other xenobiotics. Knowledge of the structural features required for interaction with any of the different isoforms of the CYP system is therefore immensely valuable in early drug discovery. In this paper, we focus on three major isoforms (CYP 1A2, CYP 2D6, and CYP 3A4) and present a data set of 335 structurally diverse drug compounds classified for their interaction (as substrate, inhibitor, or any interaction) with these isoforms. We also present machine learning models using a variety of commonly used methods (k-nearest neighbors, decision tree induction using the CHAID and CRT algorithms, random forests, artificial neural networks, and support vector machines using the radial basis function (RBF) and homogeneous polynomials as kernel functions). We discuss the physicochemical features relevant for each end point and compare it to similar studies. Many of these models perform exceptionally well, even with 10-fold cross-validation, yielding corrected classification rates of 81.7 to 91.9% for CYP 1A2, 89.2 to 92.9% for CYP 2D6, and 87.4 to 89.9% for CYP3A4. Our models help in understanding the structural requirements for CYP interactions and can serve as sensitive tools in virtual screenings and lead optimization for toxicological profiles in drug discovery.
- Published
- 2009
41. Large-scale graph mining using backbone refinement classes
- Author
-
Christoph Helma, Andreas Maunz, and Stefan Kramer
- Subjects
Binary tree ,Entropy (information theory) ,Tree mining ,Data mining ,Feature set ,computer.software_genre ,Algorithm ,Upper and lower bounds ,computer ,Mathematics - Abstract
We present a new approach to large-scale graph mining based on so-called backbone refinement classes. The method efficiently mines tree-shaped subgraph descriptors under minimum frequency and significance constraints, using classes of fragments to reduce feature set size and running times. The classes are defined in terms of fragments sharing a common backbone. The method is able to optimize structural inter-feature entropy as opposed to occurrences, which is characteristic for open or closed fragment mining. In the experiments, the proposed method reduces feature set sizes by >90 % and >30 % compared to complete tree mining and open tree mining, respectively. Evaluation using crossvalidation runs shows that their classification accuracy is similar to the complete set of trees but significantly better than that of open trees. Compared to open or closed fragment mining, a large part of the search space can be pruned due to an improved statistical constraint (dynamic upper bound adjustment), which is also confirmed in the experiments in lower running times compared to ordinary (static) upper bound pruning. Further analysis using large-scale datasets yields insight into important properties of the proposed descriptors, such as the dataset coverage and the class size represented by each descriptor. A final cross-validation run confirms that the novel descriptors render large training sets feasible which previously might have been intractable.
- Published
- 2009
42. Prediction of chemical toxicity with local support vector regression and activity-specific kernels
- Author
-
Andreas Maunz and Christoph Helma
- Subjects
Computer science ,Cyprinidae ,Quantitative Structure-Activity Relationship ,Bioengineering ,Feature selection ,computer.software_genre ,Machine learning ,Models, Biological ,Predictive Value of Tests ,Drug Discovery ,Feature (machine learning) ,Toxicity Tests, Acute ,Animals ,Dose-Response Relationship, Drug ,business.industry ,General Medicine ,Chemical similarity ,Support vector machine ,Lazy learning ,Kernel (statistics) ,Data Interpretation, Statistical ,Benchmark (computing) ,Molecular Medicine ,Regression Analysis ,Artificial intelligence ,Data mining ,business ,computer ,Water Pollutants, Chemical ,Applicability domain - Abstract
We propose a new kernel, based on 2-D structural chemical similarity, that integrates activity-specific information from the training data, and a new approach to applicability domain estimation that takes feature significances and activity distributions into consideration. The new kernel provides superior results than the well-established Tanimoto kernel, and activity-sensitive feature selection enhances prediction quality. Validation of local support vector regression models based on this kernel has been preformed with three publicly available datasets from the DSSTox project. One of them (Fathead Minnow Acute Toxicity) has been already modelled by other groups, and serves as a benchmark dataset, the other two (Maximum Recommended Therapeutic Dose, IRIS Lifetime Cancer Risk) have been modelled for the first time according to the knowledge of the authors. For all three models predictive accuracies increase with the prediction confidences that indicate the applicability domain. Depending on the confidence cutoff for acceptable predictions we were able to achieve90% predictions within 1 log unit of the experimental data for all datasets.
- Published
- 2008
43. The expanding role of predictive toxicology: an update on the (Q)SAR models for mutagens and carcinogens
- Author
-
Romualdo Benigni, Ann M. Richard, Tatiana I. Netzeva, Chihae Yang, Yin Tak Woo, Carol A. Marchant, Christoph Helma, Cecilia Bossa, Emilio Benfenati, Etje Hulzebos, and Rainer Franke
- Subjects
Cancer Research ,Computer science ,Mutagenicity Tests ,Health, Toxicology and Mutagenesis ,Authorization ,Quantitative Structure-Activity Relationship ,Legislation ,Predictive toxicology ,Models, Theoretical ,Toxicology ,Chemical hazard ,Risk analysis (engineering) ,Predictive Value of Tests ,Carcinogens ,Animals ,Humans ,Estimation methods ,Mutagenicity Test ,Mutagens - Abstract
Different regulatory schemes worldwide, and in particular, the preparation for the new REACH (Registration, Evaluation and Authorization of CHemicals) legislation in Europe, increase the reliance on estimation methods for predicting potential chemical hazard. To meet the increased expectations, the availability of valid (Q)SARs becomes a critical issue, especially for endpoints that have complex mechanisms of action, are time-and cost-consuming, and require a large number of animals to test. Here, findings from the survey on (Q)SARs for mutagenicity and carcinogenicity, initiated by the European Chemicals Bureau (ECB) and carried out by the Istituto Superiore di Sanita' are summarized, key aspects are discussed, and a broader view towards future needs and perspectives is given.
- Published
- 2007
44. Validation of counter propagation neural network models for predictive toxicology according to the OECD principles: a case study
- Author
-
Andrew Worth, M. Randić, Christoph Helma, Grace Patlewicz, Daniel Neagu, Paolo Mazzatorta, A. Gallegos, Manuela Pavan, Paola Gramatica, Ivanka Tsakovska, Qasim Chaudhry, Giuseppina Gini, Emilio Benfenati, Vinko Bandelj, Marjan Vračko, Mark T. D. Cronin, J. Devillers, Pierluigi Barbieri, T. I. Netzeva, Vracko, M, Bandelj, V, Barbieri, Pierluigi, Benfenati, E, Chaudhry, Q, Cronin, M, Devillers, J, Gallegos, A, Gini, G, Gramatica, P, Helma, C, Mazzatorta, P, Neagu, D, Netzeva, T, Pavan, M, Patlewicz, G, Randic, M, Tsakovska, I, and Worth, A.
- Subjects
Animal Use Alternatives ,Self-organizing map ,Counter propagation neural network ,Quantitative structure–activity relationship ,Databases, Factual ,Computer science ,Validation of QSAR models ,Cyprinidae ,Predictive Toxicology ,Quantitative Structure-Activity Relationship ,Bioengineering ,Predictive toxicology ,computer.software_genre ,Models, Biological ,Lethal Dose 50 ,Drug Discovery ,Animals ,Validation of QSAR model ,Duluth database ,Network model ,Toxicity data ,Artificial neural network ,Counter propagation ,Reproducibility of Results ,General Medicine ,Molecular Medicine ,Neural Networks, Computer ,Data mining ,computer ,Water Pollutants, Chemical - Abstract
The OECD has proposed five principles for validation of QSAR models used for regulatory purposes. Here we present a case study investigating how these principles can be applied to models based on Kohonen and counter propagation neural networks. The study is based on a counter propagation network model that has been built using toxicity data in fish fathead minnow for 541 compounds. The study demonstrates that most, if not all, of the OECD criteria may be met when modeling using this neural network approach.
- Published
- 2006
45. lazar
- Author
-
Christoph Helma
- Subjects
Computer science ,Toxicity ,Structure (category theory) ,Computational biology - Published
- 2005
- Full Text
- View/download PDF
46. A Brief Introduction to Predictive Toxicology
- Author
-
Christoph Helma
- Published
- 2005
- Full Text
- View/download PDF
47. Machine Learning and Data Mining
- Author
-
Christoph Helma and Stefan Kramer
- Subjects
Text mining ,Computer science ,business.industry ,Data stream mining ,Data mining ,Artificial intelligence ,Machine learning ,computer.software_genre ,business ,computer - Published
- 2005
- Full Text
- View/download PDF
48. Neural Networks and Kernel Machines for Vector and Structured Data . . . . . . . . . . . . . . . . . . . . . Paolo Frasconi
- Author
-
Christoph Helma
- Subjects
Artificial neural network ,business.industry ,Computer science ,Kernel (statistics) ,Pattern recognition ,Artificial intelligence ,business - Published
- 2005
- Full Text
- View/download PDF
49. Data mining and knowledge discovery in predictive toxicology
- Author
-
Christoph Helma
- Subjects
Databases, Factual ,Process (engineering) ,Computer science ,Information Storage and Retrieval ,Bioengineering ,Feature selection ,Predictive toxicology ,Prediction system ,computer.software_genre ,Toxicology ,Model validation ,Knowledge extraction ,Drug Discovery ,Feature (machine learning) ,Animals ,Humans ,Societies, Medical ,Data interpretation ,General Medicine ,Knowledge ,Data Interpretation, Statistical ,Molecular Medicine ,Data mining ,computer ,Algorithms ,Forecasting - Abstract
This article describes the knowledge discovery process in predictive toxicology. This process consists of five major steps (i) feature calculation, (ii) feature selection, (iii) model induction, (iv) model validation and (v) interpretation of predictions and models. Data mining is a part of the knowledge discovery process and consists of the application of data analysis and discovery algorithms, which can be useful in all of the above steps. A brief review of suitable algorithms and their advantages and disadvantages is given for each knowledge discovery step, followed by a more detailed description of a problem-specific implementation of the lazar prediction system.
- Published
- 2005
50. Data mining and machine learning techniques for the identification of mutagenicity inducing substructures and structure activity relationships of noncongeneric compounds
- Author
-
Luc De Raedt, Tobias Cramer, Christoph Helma, Stefan Kramer, Helma, Christoph, Cramer, Tobia, Kramer, Stefan, and De Raedt, Luc
- Subjects
Databases, Factual ,Computer science ,Information System ,system ,computer.software_genre ,Machine learning ,automated structure evaluation ,Structure-Activity Relationship ,Artificial Intelligence ,Computational Theory and Mathematic ,Mutagen ,Structure (mathematical logic) ,Interpretation (logic) ,business.industry ,Mutagenicity Tests ,Chemistry (all) ,Computer Science Applications1707 Computer Vision and Pattern Recognition ,General Chemistry ,Computer Science Applications ,Algorithm ,Support vector machine ,Identification (information) ,artificial-intelligence ,Mutagenicity Test ,Computational Theory and Mathematics ,Ab-initio Calculations ,Artificial intelligence ,Data mining ,program ,business ,computer ,carcinogenesis ,Algorithms ,Information Systems ,Mutagens - Abstract
This paper explores the utility of data mining and machine learning algorithms for the induction of mutagenicity structure-activity relationships (SARs) from noncongeneric data sets. We compare (i) a newly developed algorithm (MOLFEA) for the generation of descriptors (molecular fragments) for noncongeneric compounds with traditional SAR approaches (molecular properties) and (ii) different machine learning algorithms for the induction of SARs from these descriptors. In addition we investigate the optimal parameter settings for these programs and give an exemplary interpretation of the derived models. The predictive accuracies of models using MOLFEA derived descriptors is similar to10- 15 %age points higher than those using molecular properties alone. Using both types of descriptors together does not improve the derived models. From the applied machine learning techniques the rule learner PART and support vector machines gave the best results, although the differences between the learning algorithms are only marginal. We were able to achieve predictive accuracies up to 78% for 10-fold cross-validation. The resulting models are relatively easy to interpret and usable for predictive as well as for explanatory purposes. ispartof: Journal of chemical information and computer sciences vol:44 issue:4 pages:1402-1411 ispartof: location:United States status: published
- Published
- 2004
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.