116 results on '"Vanneschi L."'
Search Results
2. Complex Detection in Protein-Protein Interaction Networks: A Compact Overview for Researchers and Practitioners
- Author
-
Pizzuti, C., Rombo, S., Marchiori, E., Giacobini, M., Vanneschi, L., Bush, W., Giacobini, M., Vanneschi, L., Bush, W., Pizzuti, C, Rombo, SE, and Marchiori, E
- Subjects
Structure (mathematical logic) ,Computer science ,Systems biology ,Cell ,Data Science ,Nanotechnology ,Computational biology ,Protein protein interaction network ,Bioinformatics, network analysis ,medicine.anatomical_structure ,medicine ,Graph (abstract data type) ,Lecture Notes in Computer Science ,Cluster analysis ,Protein modules ,Biological network - Abstract
The availability of large volumes of protein-protein interaction data has allowed the study of biological networks to unveil the complex structure and organization in the cell. It has been recognized by biologists that proteins interacting with each other often participate in the same biological processes, and that protein modules may be often associated with specific biological functions. Thus the detection of protein complexes is an important research problem in systems biology. In this review, recent graph-based approaches to clustering protein interaction networks are described and classified with respect to common peculiarities. The goal is that of providing a useful guide and reference for both computer scientists and biologists.
- Published
- 2012
3. A Study on Gene Regulatory Network Reconstruction and Simulation
- Author
-
Farinaccio, A., Vanneschi, L., Provero, Paolo, Mauri, G., Giacobini, Mario Dante Lucio, Apolloni, B, Bassis, S, Esposito, A, Morabito, FC, Farinaccio, A, Vanneschi, L, Provero, P, Mauri, G, and Giacobini, M
- Subjects
computational biology ,evolutionary computation ,gene reconstruction ,bioinformatics ,INF/01 - INFORMATICA ,gene regulatory network ,genetic programming - Abstract
The inference of gene regulatory networks from expression data is an important area of research that provides insight to the inner workings of a biological system. In this paper we present a new reverse-engineering framework for gene regulatory network reconstruction. It exploits some interesting characteristics of genetic programming, like the ability of solving complex regression problems with little or no information about the underlying data and the ability of performing an automatic selection of features at learning time. Our framework not only reconstructs the topology of the network, but it is also able to simulate its dynamic behaviour, forecasting the gene expression values in the timepoints subsequent to the initial one. In this paper, we test the proposed framework on the well-known IRMA gene regulatory network, and we show that its performances are promising.
- Published
- 2011
4. Limiting the Number Fitness Cases in Genetic Programming Using Statistics
- Author
-
Giacobini, Mario Dante Lucio, Tomassini, M., and Vanneschi, L.
- Subjects
evolutionary algorithm ,statistics ,genetic programming - Published
- 2002
5. How Statistics can Help in Limiting the Number of Fitness Cases in Genetic Programming
- Author
-
Giacobini, Mario Dante Lucio, Tomassini, M., and Vanneschi, L.
- Subjects
evolutionary algorithm ,statistics ,genetic programming - Published
- 2002
6. A multi-dimensional genetic programming approach for multi-class classification problems
- Author
-
Ingalalli, V., Silva, S., Mauro Castelli, and Vanneschi, L.
7. Full-Reference Image Quality Expression via Genetic Programming
- Author
-
Illya Bakurov, Marco Buzzelli, Raimondo Schettini, Mauro Castelli, Leonardo Vanneschi, NOVA Information Management School (NOVA IMS), Information Management Research Center (MagIC) - NOVA Information Management School, Bakurov, I, Buzzelli, M, Schettini, R, Castelli, M, and Vanneschi, L
- Subjects
SSIM ,ssim ,Image quality ,INF/01 - INFORMATICA ,image quality ,image similarity ,genetic programming ,Computer Graphics and Computer-Aided Design ,full-reference image quality assessment ,Software - Abstract
Bakurov, I., Buzzelli, M., Schettini, R., Castelli, M., & Vanneschi, L. (2023). Full-Reference Image Quality Expression via Genetic Programming. IEEE Transactions on Image Processing, 32, 1458-1473. https://doi.org/10.1109/TIP.2023.3244662--- This work was supported by national funds through the FCT (Fundação para a Ciência e a Tecnologia) under the projects Algoritmos de Inteligência artificial no Consumo de crédito e conciliação de Endividamento (AICE) (DSAIPA/DS/0113/2019) and UIDB/04152/2020 - Centro de Investigação em Gestão de Informação (MagIC)/NOVA IMS. Mauro Castelli acknowledges the financial support from the Slovenian Research Agency (research core funding no. P5-0410). Full-reference image quality measures are a fundamental tool to approximate the human visual system in various applications for digital data management: from retrieval to compression to detection of unauthorized uses. Inspired by both the effectiveness and the simplicity of hand-crafted Structural Similarity Index Measure (SSIM), in this work, we present a framework for the formulation of SSIM-like image quality measures through genetic programming. We explore different terminal sets, defined from the building blocks of structural similarity at different levels of abstraction, and we propose a two-stage genetic optimization that exploits hoist mutation to constrain the complexity of the solutions. Our optimized measures are selected through a cross-dataset validation procedure, which results in superior performance against different versions of structural similarity, measured as correlation with human mean opinion scores. We also demonstrate how, by tuning on specific datasets, it is possible to obtain solutions that are competitive with (or even outperform) more complex image quality measures. authorsversion authorsversion published
- Published
- 2023
8. An Efficient Implementation of Flux Variability Analysis for Metabolic Networks
- Author
-
Bruno G. Galuzzi, Chiara Damiani, Claudio De Stefano, Francesco Fontanella, Leonardo Vanneschi, De Stefano, Fontanella, F, Vanneschi, L, Galuzzi, B, and Damiani, C
- Subjects
Metabolic networks, Flux Balance Analysis, Flux variability Analysis, Constraint-based modelling - Abstract
Flux Variability Analysis (FVA) is an important method to analyze the range of fluxes of a metabolic network. FVA consists in performing a large number of independent optimization problems, to obtain the maximum and minimum flux through each reaction in the network. Although several strategies to make the computation more efficient have been proposed, the computation time of an FVA can still be limiting. We present a two-step procedure to accelerate the FVA computational time that exploits the large presence within metabolic networks of sets of reactions that necessarily have an identical optimal flux value or only differ by a multiplication constant. The first step identifies such sets of reactions. The second step computes the maximum and minimum flux value for just one element of each of set, reducing the total number of optimization problems compared to the classical FVA. We show that, when applied to any metabolic network model included in the BiGG database, our FVA algorithm reduces the total number of optimization problems of about 35%, and the computation time of FVA of about 30%.
- Published
- 2023
9. Exploring the Solution Space of Cancer Evolution Inference Frameworks for Single-Cell Sequencing Data
- Author
-
Davide Maspero, Fabrizio Angaroni, Lucrezia Patruno, Daniele Ramazzotti, David Posada, Alex Graudenzi, De Stefano, C, Fontanella, F, Vanneschi, L, Maspero, D, Angaroni, F, Patruno, L, Ramazzotti, D, Posada, D, and Graudenzi, A
- Subjects
Single-cell sequencing ,Markov Chain Monte Carlo ,Cancer evolution - Abstract
In recent years, many algorithmic strategies have been developed to exploit single-cell mutational profiles generated via sequencing experiments of cancer samples and return reliable models of cancer evolution. Here, we introduce the COB-tree algorithm, which summarizes the solutions explored by state-of-the-art methods for clonal tree inference, to return a unique consensus optimum branching tree. The method proves to be highly effective in detecting pairwise temporal relations between genomic events, as demonstrated by extensive tests on simulated datasets. We also provide a new method to visualize and quantitatively inspect the solution space of the inference methods, via Principal Coordinate Analysis. Finally, the application of our method to a single-cell dataset of patient-derived melanoma xenografts shows significant differences between the COB-tree solution and the maximum likelihood ones.
- Published
- 2023
10. Genetic programming for structural similarity design at multiple spatial scales
- Author
-
Illya Bakurov, Marco Buzzelli, Mauro Castelli, Raimondo Schettini, Leonardo Vanneschi, Information Management Research Center (MagIC) - NOVA Information Management School, NOVA Information Management School (NOVA IMS), Bakurov, I, Buzzelli, M, Castelli, M, Schettini, R, and Vanneschi, L
- Subjects
Image Processing ,Genetic Programming ,Spatially-Varying Kernels ,Multi-Scale Processing ,Dilated Convolution ,Image Quality Assessment ,Spatially-Varying Kernel ,Theoretical Computer Science ,Multi-Scale Context ,Artificial Intelligence ,Structural Similarity ,Multi-Scale Structural Similarity Index ,Dilated Convolutions ,Evolutionary Computation ,Software - Abstract
Bakurov, I., Buzzelli, M., Castelli, M., Schettini, R., & Vanneschi, L. (2022). Genetic programming for structural similarity design at multiple spatial scales. In GECCO ’22. Proceedings of the 2022 Genetic and Evolutionary Computation Conference (pp. 911-919). (GECCO 2022 - The Genetic and Evolutionary Computation Conference, July 9-13, Boston, US). Association for Computing Machinery (ACM). ISBN 978-1-4503-9237-2/22/07 ---- Funding Information: FCT Portugal partially supported this work, under the grand SFRH/BD/137277/2018, and through projects BINDER (PTDC/CCI-INF/29168/2017) and AICE (DSAIPA/DS/ 0113/2019). The growing production of digital content and its dissemination across the worldwide web require eficient and precise management. In this context, image quality assessment measures (IQAMs) play a pivotal role in guiding the development of numerous image processing systems for compression, enhancement, and restoration. The structural similarity index (SSIM) is one of the most common IQAMs for estimating the similarity between a pristine reference image and its corrupted variant. The multi-scale SSIM is one of its most popular variants that allows assessing image quality at multiple spatial scales. This paper proposes a two-stage genetic programming (GP) approach to evolve novel multi-scale IQAMs, that are simultaneously more effective and efficient. We use GP to perform feature selection in the first stage, while the second stage generates the final solutions. The experimental results show that the proposed approach outperforms the existing MS-SSIM. A comprehensive analysis of the feature selection indicates that, for extracting multi-scale similarities, spatially-varying convolutions are more effective than dilated convolutions. Moreover, we provide evidence that the IQAMs learned for one database can be successfully transferred to previously unseen databases. We conclude the paper by presenting a set of evolved multi-scale IQAMs and providing their interpretation. authorsversion published
- Published
- 2022
11. Parameters optimization of the Structural Similarity Index
- Author
-
Raimondo Schettini, Leonardo Vanneschi, Illya Bakurov, Mauro Castelli, Marco Buzzelli, Bakurov, I, Buzzelli, M, Castelli, M, Schettini, R, Vanneschi, L, NOVA Information Management School (NOVA IMS), and Information Management Research Center (MagIC) - NOVA Information Management School
- Subjects
Image Quality Assessment Measure ,Index (economics) ,Structural similarity ,business.industry ,Structural Similarity ,Image Processing ,Image processing ,Pattern recognition ,Artificial intelligence ,Evolutionary Computation ,business ,Evolutionary computation ,Mathematics - Abstract
Bakurov, I., Buzzelli, M., Castelli, M., Schettini, R., & Vanneschi, L. (2020). Parameters optimization of the Structural Similarity Index. In London Imaging Meeting 2020: Future Colour Imaging (1 ed., Vol. 2020, pp. 19-23). (London Imaging Meeting). https://doi.org/10.2352/issn.2694-118X.2020.LIM-13 We exploit evolutionary computation to optimize the handcrafted Structural Similarity method (SSIM) through a datadriven approach. We estimate the best combination of luminance, contrast and structure components, as well as the sliding window size used for processing, with the objective of optimizing the similarity correlation with human-expressed mean opinion score on a standard dataset. We experimentally observe that better results can be obtained by penalizing the overall similarity only for very low levels of luminance similarity. Finally, we report a comparison of SSIM with the optimized parameters against other metrics for full reference quality assessment, showing superior performance on a different dataset. publishersversion published
- Published
- 2020
12. A data-driven approach
- Author
-
Raimondo Schettini, Marco Buzzelli, Mauro Castelli, Leonardo Vanneschi, Illya Bakurov, NOVA Information Management School (NOVA IMS), NOVA IMS Research and Development Center (MagIC), Information Management Research Center (MagIC) - NOVA Information Management School, Bakurov, I, Buzzelli, M, Schettini, R, Castelli, M, and Vanneschi, L
- Subjects
Index (economics) ,Computer science ,Structural similarity ,Image processing ,Evolutionary computation ,computer.software_genre ,Data-driven ,Scale selection ,Artificial Intelligence ,Engineering(all) ,Image quality assessment measure ,business.industry ,General Engineering ,INF/01 - INFORMATICA ,Pattern recognition ,Expert system ,Computer Science Applications ,Image quality assessment measures ,Artificial intelligence ,business ,computer - Abstract
Bakurov, I., Buzzelli, M., Schettini, R., Castelli, M., & Vanneschi, L. (2022). Structural similarity index (SSIM) revisited: A data-driven approach. Expert Systems with Applications, 189, 1-19. [116087]. [Advanced online publication on 27 October 2021]. https://doi.org/10.1016/j.eswa.2021.116087--------------Funding Information: This work was supported by national funds through the FCT (Funda??o para a Ci?ncia e a Tecnologia), Portugal by the projects GADgET (DSAIPA/DS/0022/2018), BINDER (PTDC/CCIINF/29168/2017), and AICE (DSAIPA/DS/0113/2019). Mauro Castelli acknowledges the financial support from the Slovenian Research Agency, Slovenia (research core funding no. P5-0410). Several contemporaneous image processing and computer vision systems rely upon the full-reference image quality assessment (IQA) measures. The single-scale structural similarity index (SS-SSIM) is one of the most popular measures, and it owes its success to the mathematical simplicity, low computational complexity, and implicit incorporation of Human Visual System’s (HVS) characteristics. In this paper, we revise the original parameters of SSIM and its multi-scale counterpart (MS-SSIM) to increase their correlation with subjective evaluation. More specifically, we exploit the evolutionary computation and the swarm intelligence methods on five popular IQA databases, two of which are dedicated distance-changed databases, to determine the best combination of parameters efficiently. Simultaneously, we explore the effect of different scale selection approaches in the context of SS-SSIM. The experimental results show that with a proper fine-tuning (1) the performance of SS-SSIM and MS-SSIM can be improved, in average terms, by 8% and by 3%, respectively, (2) the SS-SSIM after the so-called standard scale selection achieves similar performance as if applying computationally more expensive state-of-the-art scale selection methods or MS-SSIM; moreover, (3) there is evidence that the parameters learned on a given database can be successfully transferred to other (previously unseen) databases; finally, (4) we propose a new set of reference parameters for SSIM’s variants and provide their interpretation. authorsversion published
- Published
- 2021
13. Computational Intelligence for Life Sciences
- Author
-
Stefano Ruberto, Simone Spolaor, Leonardo Rundo, Paolo Cazzaniga, Andrea Tangherloni, Leonardo Vanneschi, Daniela Besozzi, Mauro Castelli, Luca Manzoni, Marco S. Nobile, Rundo, Leonardo [0000-0003-3341-5483], Apollo - University of Cambridge Repository, Besozzi, D, Manzoni, L, Nobile, M, Spolaor, S, Castelli, M, Vanneschi, L, Cazzaniga, P, Ruberto, S, Rundo, L, Tangherloni, A, Besozzi, Daniela, Manzoni, Luca, Nobile, Marco S., Spolaor, Simone, Castelli, Mauro, Vanneschi, Leonardo, Cazzaniga, Paolo, Ruberto, Stefano, Rundo, Leonardo, Tangherloni, Andrea, Information Systems IE&IS, NOVA Information Management School (NOVA IMS), NOVA IMS Research and Development Center (MagIC), and Information Management Research Center (MagIC) - NOVA Information Management School
- Subjects
Computational Intelligence ,Evolutionary Computation ,Genetic Algorithm ,Genetic Programming ,Haplotype Assembly ,Parameter Estimation ,Particle Swarm Optimization ,Protein Folding ,Swarm Intelligence ,Optimization problem ,Computer science ,Computational intelligence ,Genetic programming ,0102 computer and information sciences ,01 natural sciences ,Swarm intelligence ,Evolutionary computation ,Theoretical Computer Science ,SDG 3 - Good Health and Well-being ,Genetic algorithm ,Algebra and Number Theory ,Settore INF/01 - Informatica ,business.industry ,Particle swarm optimization ,Computational Theory and Mathematics ,010201 computation theory & mathematics ,Artificial intelligence ,Computational problem ,business ,Information Systems - Abstract
Besozzi, D., Manzoni, L., Nobile, M. S., Spolaor, S., Castelli, M., Vanneschi, L., ... Tangherloni, A. (2020). Computational Intelligence for Life Sciences. Fundamenta Informaticae, 171(1-4), 57-80. https://doi.org/10.3233/FI-2020-1872 Computational Intelligence (CI) is a computer science discipline encompassing the theory, design, development and application of biologically and linguistically derived computational paradigms. Traditionally, the main elements of CI are Evolutionary Computation, Swarm Intelligence, Fuzzy Logic, and Neural Networks. CI aims at proposing new algorithms able to solve complex computational problems by taking inspiration from natural phenomena. In an intriguing turn of events, these nature-inspired methods have been widely adopted to investigate a plethora of problems related to nature itself. In this paper we present a variety of CI methods applied to three problems in life sciences, highlighting their effectiveness: we describe how protein folding can be faced by exploiting Genetic Programming, the inference of haplotypes can be tackled using Genetic Algorithms, and the estimation of biochemical kinetic parameters can be performed by means of Swarm Intelligence. We show that CI methods can generate very high quality solutions, providing a sound methodology to solve complex optimization problems in life sciences. authorsversion published
- Published
- 2020
14. A distance between populations for n-points crossover in genetic algorithms
- Author
-
Gianpiero Cattaneo, Mauro Castelli, Leonardo Vanneschi, Luca Manzoni, Castelli, M, Cattaneo, G, Manzoni, L, Vanneschi, L, Castelli, Mauro, Cattaneo, Gianpiero, Manzoni, Luca, and Vanneschi, Leonardo
- Subjects
Condensed Matter::Quantum Gases ,FOS: Computer and information sciences ,education.field_of_study ,General Computer Science ,Computer science ,General Mathematics ,Genetic algorithms ,population distance ,Crossover ,Population ,Computer Science (all) ,Computer Science - Neural and Evolutionary Computing ,Field (mathematics) ,Distribution (mathematics) ,Genetic algorithm ,Applied mathematics ,Mathematics (all) ,Neural and Evolutionary Computing (cs.NE) ,education ,Time complexity - Abstract
The theoretical study of Genetic Algorithms and the dynamics induced by their genetic operators is a research field with a long history and many different approaches. In this paper we complete a recently presented approach to model one-point crossover using pretopologies (or Cechtopologies) in two ways. First, we extend it to the case of n-points crossover. We extend the definition of crossover distance between populations to work for n-points crossover, proving that computing it can be performed in polynomial time for any fixed number of crossover points. Secondly, we experimentally study how the distance distribution changes when the number of crossover points increases. In particular, the average distance between a population and the optimum decreases with the increase in the number of crossover points, showing that increasing the latter can reduce the number of crossover operations needed to evolve an optimal solution.
- Published
- 2019
15. The influence of population size in geometric semantic GP
- Author
-
Mauro Castelli, Leonardo Vanneschi, Sara Silva, Aleš Popovič, Luca Manzoni, Castelli, Mauro, Manzoni, Luca, Silva, Sara, Vanneschi, Leonardo, Popovic, Ales, Castelli, M, Manzoni, L, Silva, S, Vanneschi, L, and Popovič, A
- Subjects
General Computer Science ,Generalization ,General Mathematics ,media_common.quotation_subject ,Genetic programming ,0102 computer and information sciences ,02 engineering and technology ,Machine learning ,computer.software_genre ,Semantics ,01 natural sciences ,Statistics ,0202 electrical engineering, electronic engineering, information engineering ,Mathematics (all) ,Quality (business) ,Mathematics ,media_common ,business.industry ,Population size ,Small number ,Computer Science (all) ,Small population size ,010201 computation theory & mathematics ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Symbolic regression ,Semantic ,computer - Abstract
In this work, we study the influence of the population size on the learning ability of Geometric Semantic Genetic Programming for the task of symbolic regression. A large set of experiments, considering different population size values on different regression problems, has been performed. Results show that, on real-life problems, having small populations results in a better training fitness with respect to the use of large populations after the same number of fitness evaluations. However, performance on the test instances varies among the different problems: in datasets with a high number of features, models obtained with large populations present a better performance on unseen data, while in datasets characterized by a relative small number of variables a better generalization ability is achieved by using small population size values. When synthetic problems are taken into account, large population size values represent the best option for achieving good quality solutions on both training and test instances.
- Published
- 2017
16. A survey of semantic methods in genetic programming
- Author
-
Leonardo Vanneschi, Mauro Castelli, Sara Silva, Vanneschi, L, Castelli, M, and Silva, S
- Subjects
education.field_of_study ,Information retrieval ,Theoretical computer science ,Computer science ,Fitness landscape ,Locality ,Population ,Genetic programming ,Semantics ,Formal methods ,Computer Science Applications ,Theoretical Computer Science ,Genotype/phenotype ,Hardware and Architecture ,Semantic computing ,Genetic representation ,Survey ,education ,Semantic ,Software - Abstract
Several methods to incorporate semantic awareness in genetic programming have been proposed in the last few years. These methods cover fundamental parts of the evolutionary process: from the population initialization, through different ways of modifying or extending the existing genetic operators, to formal methods, until the definition of completely new genetic operators. The objectives are also distinct: from the maintenance of semantic diversity to the study of semantic locality; from the use of semantics for constructing solutions which obey certain constraints to the exploitation of the geometry of the semantic topological space aimed at defining easy-to-search fitness landscapes. All these approaches have shown, in different ways and amounts, that incorporating semantic awareness may help improving the power of genetic programming. This survey analyzes and discusses the state of the art in the field, organizing the existing methods into different categories. It restricts itself to studies where semantics is intended as the set of output values of a program on the training data, a definition that is common to a rather large set of recent contributions. It does not discuss methods for incorporating semantic information into grammar-based genetic programming or approaches based on formal methods. The objective is keeping the community updated on this interesting research track, hoping to motivate new and stimulating contributions. © 2014 Springer Science+Business Media New York.
- Published
- 2014
17. Semantic Search-Based Genetic Programming and the Effect of Intron Deletion
- Author
-
Mauro Castelli, Leonardo Vanneschi, Sara Silva, Castelli, M, Vanneschi, L, and Silva, S
- Subjects
Theoretical computer science ,intron ,Generalization ,Computer science ,Semantics (computer science) ,Genetic programming ,Machine learning ,computer.software_genre ,Set (abstract data type) ,genetic programming (GP) ,Genetic algorithm ,Code (cryptography) ,Electrical and Electronic Engineering ,semantics ,Training set ,business.industry ,Intron ,Semantic search ,Computer Science Applications ,Human-Computer Interaction ,Control and Systems Engineering ,Artificial intelligence ,business ,computer ,Software ,Information Systems - Abstract
SFX(opens in a new window)|View at Publisher| Export | Download | More... IEEE Transactions on Cybernetics Volume 44, Issue 1, January 2014, Article number 6476653, Pages 103-113 Semantic search-based genetic programming and the effect of intron deletion (Article) Castelli, M.a , Vanneschi, L.a , Silva, S.a , Agapitos, A.b , Oneill, M.b a Instituto de Engenharia de Sistemas e Computadores-Investigacao e Desenvolvimento em Lisboa, IST/UTL, Lisboa, Portugal b Natural Computing Research and Applications Group, University College Dublin, IRL, Ireland View references (30) Abstract The concept of semantics (in the sense of input-output behavior of solutions on training data) has been the subject of a noteworthy interest in the genetic programming (GP) research community over the past few years. In this paper, we present a new GP system that uses the concept of semantics to improve search effectiveness. It maintains a distribution of different semantic behaviors and biases the search toward solutions that have similar semantics to the best solutions that have been found so far. We present experimental evidence of the fact that the new semantics-based GP system outperforms the standard GP and the well-known bacterial GP on a set of test functions, showing particularly interesting results for noncontinuous (i.e., generally harder to optimize) test functions. We also observe that the solutions generated by the proposed GP system often have a larger size than the ones returned by standard GP and bacterial GP and contain an elevated number of introns, i.e., parts of code that do not have any effect on the semantics. Nevertheless, we show that the deletion of introns during the evolution does not affect the performance of the proposed method. © 2013 IEEE.
- Published
- 2014
18. Prediction of high performance concrete strength using Genetic Programming with geometric semantic genetic operators
- Author
-
Leonardo Vanneschi, Mauro Castelli, Sara Silva, Castelli, M, Vanneschi, L, and Silva, S
- Subjects
Cement ,Aggregate (composite) ,Computer science ,Semantics (computer science) ,business.industry ,General Engineering ,Superplasticizer ,Genetic programming ,Machine learning ,computer.software_genre ,Computer Science Applications ,Composite construction ,Artificial Intelligence ,Ground granulated blast-furnace slag ,Fly ash ,genetic programming ,Cementitious ,Artificial intelligence ,business ,computer - Abstract
Concrete is a composite construction material made primarily with aggregate, cement, and water. In addition to the basic ingredients used in conventional concrete, high-performance concrete incorporates supplementary cementitious materials, such as fly ash and blast furnace slag, and chemical admixture, such as superplasticizer. Hence, high-performance concrete is a highly complex material and modeling its behavior represents a difficult task. In this paper, we propose an intelligent system based on Genetic Programming for the prediction of high-performance concrete strength. The system we propose is called Geometric Semantic Genetic Programming, and it is based on recently defined geometric semantic genetic operators for Genetic Programming. Experimental results show the suitability of the proposed system for the prediction of concrete strength. In particular, the new method provides significantly better results than the ones produced by standard Genetic Programming and other machine learning methods, both on training and on out-of-sample data. © 2013 Elsevier Ltd. All rights reserved.
- Published
- 2013
19. Self-tuning geometric semantic Genetic Programming
- Author
-
Sara Silva, Aleš Popovič, Leonardo Vanneschi, Mauro Castelli, Luca Manzoni, Castelli, M, Manzoni, L, Vanneschi, L, Silva, S, Popovič, A, Castelli, Mauro, Manzoni, Luca, Vanneschi, Leonardo, Silva, Sara, and Popovic, Ales
- Subjects
Genetic Programming ,Parameters Tuning ,Semantics ,Theoretical computer science ,Computer science ,Crossover ,Evolutionary algorithm ,Genetic programming ,0102 computer and information sciences ,02 engineering and technology ,Genetic operator ,01 natural sciences ,Theoretical Computer Science ,Chromosome (genetic algorithm) ,Genetic algorithm ,0202 electrical engineering, electronic engineering, information engineering ,Computer Science Applications1707 Computer Vision and Pattern Recognition ,Computer Science Applications ,010201 computation theory & mathematics ,Hardware and Architecture ,Mutation (genetic algorithm) ,020201 artificial intelligence & image processing ,Genetic representation ,Semantic ,Software - Abstract
The process of tuning the parameters that characterize evolutionary algorithms is difficult and can be time consuming. This paper presents a self-tuning algorithm for dynamically updating the crossover and mutation probabilities during a run of genetic programming. The genetic operators that are considered in this work are the geometric semantic genetic operators introduced by Moraglio et al. Differently from other existing self-tuning algorithms, the proposed one works by assigning a (different) crossover and mutation probability to each individual of the population. The experimental results we present show the appropriateness of the proposed self-tuning algorithm: on seven different test problems, the proposed algorithm finds solutions of a quality that is better than, or comparable to, the one achieved using the best known values for the geometric semantic crossover and mutation rates for the same problems. Also, we study how the mutation and crossover probabilities change during the execution of the proposed self-tuning algorithm, pointing out an interesting insight: mutation is basically the only operator used in the exploration phase, while crossover is used for exploitation, further improving good quality solutions.
- Published
- 2016
20. Semantic genetic programming for fast and accurate data knowledge discovery
- Author
-
Luca Manzoni, Leonardo Vanneschi, Mauro Castelli, Aleš Popovič, Castelli, M, Vanneschi, L, Manzoni, L, Popovič, A, Castelli, Mauro, Vanneschi, Leonardo, Manzoni, Luca, and Popovic, Ales
- Subjects
General Computer Science ,Computer science ,Semantics (computer science) ,General Mathematics ,Big data ,Genetic programming ,0102 computer and information sciences ,02 engineering and technology ,Machine learning ,computer.software_genre ,01 natural sciences ,Knowledge discovery ,Knowledge extraction ,0202 electrical engineering, electronic engineering, information engineering ,Mathematics (all) ,business.industry ,Supervised learning ,Computer Science (all) ,Semantics ,Inductive programming ,010201 computation theory & mathematics ,Programming paradigm ,020201 artificial intelligence & image processing ,Artificial intelligence ,Genetic representation ,business ,computer ,Semantic - Abstract
Big data knowledge discovery emerged as an important factor contributing to advancements in society at large. Still, researchers continuously seek to advance existing methods and provide novel ones for analysing vast data sets to make sense of the data, extract useful information, and build knowledge to inform decision making. In the last few years, a very promising variant of genetic programming was proposed: geometric semantic genetic programming. Its difference with the standard version of genetic programming consists in the fact that it uses new genetic operators, called geometric semantic operators, that, acting directly on the semantics of the candidate solutions, induce by definition a unimodal error surface on any supervised learning problem, independently from the complexity and size of the underlying data set. This property should improve the evolvability of genetic programming in presence of big data and thus makes geometric semantic genetic programming an extremely promising method for mining vast amounts of data. Nevertheless, to the best of our knowledge, no contribution has appeared so far to employ this new technology to big data problems. This paper intends to fill this gap. For the first time, in fact, we show the effectiveness of geometric semantic genetic programming on several complex real-life problems, characterized by vast amounts of data, coming from several different application domains.
- Published
- 2016
21. A distance between populations for one-point crossover in genetic algorithms
- Author
-
Luca Manzoni, Giancarlo Mauri, Leonardo Vanneschi, NOVA Information Management School (NOVA IMS), Manzoni, Luca, Vanneschi, Leonardo, Mauri, Giancarlo, Manzoni, L, Vanneschi, L, and Mauri, G
- Subjects
education.field_of_study ,General Computer Science ,Genetic algorithms, Crossover ,Carry (arithmetic) ,Crossover ,Population ,Process (computing) ,INF/01 - INFORMATICA ,Function (mathematics) ,genetic algorithms ,Theoretical Computer Science ,Transformation (function) ,Operator (computer programming) ,Point (geometry) ,education ,Algorithm ,Computer Science(all) ,Mathematics - Abstract
Manzoni, L., Vanneschi, L., & Mauri, G. (2012). A distance between populations for one-point crossover in genetic algorithms. Theoretical Computer Science, 429, 213-221. https://doi.org/10.1016/j.tcs.2011.12.041 Genetic algorithms use transformation operators on the genotypic structures of the individuals to carry out a search. These operators define a neighborhood. To analyze various dynamics of the search process, it is often useful to define a distance in this space. In fact, using an operator-based distance can make the analysis more accurate and reliable than using distances which have no relationship with the genetic operators. In this paper we define a distance which is based on the standard one-point crossover. Given that the population strongly affects the neighborhood induced by the crossover, we first define a crossover-based distance between populations. Successively, we show that it is naturally possible to derive from this function a family of distances between individuals. Finally, we also introduce an algorithm to compute this distance efficiently. published
- Published
- 2012
22. Operator equalisation for bloat free genetic programming and a survey of bloat control methods
- Author
-
Sara Silva, Leonardo Vanneschi, Stephen Dignum, Silva, S, Dignum, S, and Vanneschi, L
- Subjects
Theoretical computer science ,business.industry ,Computer science ,Crossover ,Genetic programming ,Overfitting ,Computer Science Applications ,Theoretical Computer Science ,Operator (computer programming) ,Hardware and Architecture ,Taxonomy (general) ,genetic programming ,Artificial intelligence ,business ,Control (linguistics) ,Evolutionary dynamics ,Implementation ,Software - Abstract
Bloat can be defined as an excess of code growth without a corresponding improvement in fitness. This problem has been one of the most intensively studied subjects since the beginnings of Genetic Programming. This paper begins by briefly reviewing the theories explaining bloat, and presenting a comprehensive survey and taxonomy of many of the bloat control methods published in the literature through the years. Particular attention is then given to the new Crossover Bias theory and the bloat control method it inspired, Operator Equalisation (OpEq). Two implementations of OpEq are described in detail. The results presented clearly show that Genetic Programming using OpEq is essentially bloat free. We discuss the advantages and shortcomings of each different implementation, and the unexpected effect of OpEq on overfitting. We observe the evolutionary dynamics of OpEq and address its potential to be extended and integrated into different elements of the evolutionary process. © Springer Science+Business Media, LLC 2011.
- Published
- 2011
23. A Comparative Study of Four Parallel and Distributed PSO Methods
- Author
-
Daniele Codecasa, Leonardo Vanneschi, Giancarlo Mauri, Vanneschi, L, Codecasa, D, and Mauri, G
- Subjects
Computer Networks and Communications ,Computer science ,Generalization ,ComputingMethodologies_MISCELLANEOUS ,Computer Science::Neural and Evolutionary Computation ,MathematicsofComputing_NUMERICALANALYSIS ,INF/01 - INFORMATICA ,Particle swarm optimization ,Swarm behaviour ,Swarm intelligence ,Theoretical Computer Science ,Set (abstract data type) ,Hardware and Architecture ,Component (UML) ,Genetic algorithm ,Multi-swarm optimization ,Optimization, Swarm Intelligence, Parallel and Distributed Algorithms ,Algorithm ,Software - Abstract
We present four new parallel and distributed particle swarm optimization methods consisting in a genetic algorithm whose individuals are co-evolving swarms, an “island model”-based multi-swarm system, where swarms are independent and interact by means of particle migrations at regular time steps, and their respective variants enriched by adding a repulsive component to the particles. We study the proposed methods on a wide set of problems including theoretically hand-tailored benchmarks and complex real-life applications from the field of drug discovery, with a particular focus on the generalization ability of the obtained solutions. We show that the proposed repulsive multi-swarm system has a better optimization ability than all the other presented methods on all the studied problems. Interestingly, the proposed repulsive multi-swarm system is also the one that returns the most general solutions.
- Published
- 2011
24. Hot topics in Evolutionary Computation
- Author
-
Leonardo Vanneschi, Stefano Cagnoni, Luca Mussi, Vanneschi, L, Mussi, L, and Cagnoni, S
- Subjects
Theoretical computer science ,business.industry ,Computer science ,Evolutionary algorithm ,Complex system ,Swarm intelligence ,Evolutionary computation ,Java Evolutionary Computation Toolkit ,Human-based evolutionary computation ,evolutionary computation ,Artificial Intelligence ,Artificial intelligence ,Graphics ,business ,Evolutionary programming - Abstract
We introduce the special issue on Evolutionary Computation (EC) reporting a non-exhaustive list of topics which have recently attracted much interest from the EC community, with particular regard to the ones dealt with by the papers included in this issue: EC research, hybrid neuro-evolutionary systems and synergies between EC and complex systems. In addition, we introduce a more technological emerging topic: the parallel implementation of evolutionary and Swarm Intelligence algorithms on graphics processor units (GPUs), by which new applications of evolutionary algorithms have been made possible, even in real-time environments.
- Published
- 2011
25. Genetic programming for QSAR investigation of docking energy
- Author
-
Ilaria Giordani, Francesco Archetti, Leonardo Vanneschi, Archetti, F, Giordani, I, and Vanneschi, L
- Subjects
Drug ,Quantitative structure–activity relationship ,Computer science ,Pharmacological research ,media_common.quotation_subject ,Genetic Programming ,Genistein ,Genetic programming ,Machine learning ,computer.software_genre ,Drug design ,Computational biology ,chemistry.chemical_compound ,Pharmacokinetics ,Molecular descriptor ,Molecule ,Binding affinities ,ADME ,media_common ,QSAR ,business.industry ,Drug discovery ,INF/01 - INFORMATICA ,Regression ,Acute toxicity ,Bioavailability ,Docking energy ,chemistry ,Drug development ,Toxicity ,Artificial intelligence ,business ,computer ,Software - Abstract
Statistical methods, and in particular Machine Learning, have been increasingly used in the drug development workflow to accelerate the discovery phase and to eliminate possible failures early during clinical developments. In the past, the authors of this paper have been working specifically on two problems: (i) prediction of drug induced toxicity and (ii) evaluation of the target-drug chemical interaction based on chemical descriptors. Among the numerous existing Machine Learning methods and their application to drug development ( see for instance [ F. Yoshida, J.G. Topliss, QSAR model for drug human oral bioavailability, Journal of Medicinal Chemistry 43 (2000) 2575-2585; Frohlich, J. Wegner, F. Sieker, A. Zell, Kernel functions for attributed molecular graphs-a new similarity based approach to ADME prediction in classification and regression, QSAR and Combinatorial Science, 38( 4) ( 2003) 427431; C. W. Andrews, L. Bennett, L. X. Yu, Predicting human oral bioavailability of a compound: development of a novel quantitative structure-bioavailability relationship, Pharmacological Research 17 ( 2000) 639-644; J Feng, L. Lurati, H. Ouyang, T. Robinson, Y. Wang, S. Yuan, S. S. Young, Predictive toxicology: benchmarking molecular descriptors and statistical methods, Journal of Chemical Information Computer Science 43 ( 2003) 1463-1470; T. M. Martin, D. M. Young, Prediction of the acute toxicity (96-h LC50) of organic compounds to the fat head minnow (Pimephales promelas) using a group contribution method, Chemical Research in Toxicology 14( 10) ( 2001) 1378-1385; G. Colmenarejo, A. Alvarez-Pedraglio, J. L. Lavandera, Chemoinformatic models to predict binding affinities to human serum albumin, Journal of Medicinal Chemistry 44 ( 2001) 4370-4378; J. Zupan, P. Gasteiger, Neural Networks in Chemistry and Drug Design: An Introduction, 2nd edition, Wiley, 1999]), we have been specifically concerned with Genetic Programming. A first paper [F. Archetti, E. Messina, S. Lanzeni, L. Vanneschi, Genetic programming for computational pharmacokinetics in drug discovery and development, Genetic Programming and Evolvable Machines 8( 4) ( 2007) 17-26] has been devoted to problem ( i). The present contribution aims at developing a Genetic Programming based framework on which to build specific strategies which are then shown to be a valuable tool for problem ( ii). In this paper, we use target estrogen receptor molecules and genistein based drug compounds. Being able to precisely and efficiently predict their mutual interaction energy is a very important task: for example, it may have an immediate relationship with the efficacy of genistein based drugs in menopause therapy and also as a natural prevention of some tumors. We compare the experimental results obtained by Genetic Programming with the ones of a set of "non-evolutionary'' Machine Learning methods, including Support Vector Machines, Artificial Neural Networks, Linear and Least Square Regression. Experimental results confirm that Genetic Programming is a promising technique from the viewpoint of the accuracy of the proposed solutions, of the generalization ability and of the correlation between predicted data and correct ones. (C) 2009 Elsevier B. V. All rights reserved.
- Published
- 2010
26. Fitness landscape of the cellular automata majority problem: View from the 'Olympus'
- Author
-
Leonardo Vanneschi, Philippe Collard, Marco Tomassini, Sébastien Verel, Laboratoire d'Informatique, Signaux, et Systèmes de Sophia-Antipolis (I3S) / Groupe SCOBI, Modèles Discrets pour les Systèmes Complexes (Laboratoire I3S - MDSC), Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S), Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)-Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA), Institut des systèmes d'information (ISI), Université de Lausanne (UNIL), Dipartimento di Informatica Sistemistica e Comunicazione (DISCo), Università degli Studi di Milano-Bicocca [Milano] (UNIMIB), Verel, S, Collard, P, Tomassini, M, and Vanneschi, L
- Subjects
FOS: Computer and information sciences ,Theoretical computer science ,General Computer Science ,Computer science ,Fitness landscape ,Computer Science - Artificial Intelligence ,correlation analysis ,fitness landscapes ,neutrality ,02 engineering and technology ,fitness, landscape, cellular, automata, majority, problem, view, olympus ,01 natural sciences ,Evolutionary computation ,Task (project management) ,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] ,Theoretical Computer Science ,Local optimum ,AR models ,0103 physical sciences ,Genetic algorithm ,0202 electrical engineering, electronic engineering, information engineering ,genetic algorithm ,010306 general physics ,business.industry ,cellular automata ,Majority problem ,Cellular automaton ,Artificial Intelligence (cs.AI) ,evolutionary computation ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Heuristics ,Computer Science(all) - Abstract
In this paper we study cellular automata (CAs) that perform the computational Majority task. This task is a good example of what the phenomenon of emergence in complex systems is. We take an interest in the reasons that make this particular fitness landscape a difficult one. The first goal is to study the landscape as such, and thus it is ideally independent from the actual heuristics used to search the space. However, a second goal is to understand the features a good search technique for this particular problem space should possess. We statistically quantify in various ways the degree of difficulty of searching this landscape. Due to neutrality, investigations based on sampling techniques on the whole landscape are difficult to conduct. So, we go exploring the landscape from the top. Although it has been proved that no CA can perform the task perfectly, several efficient CAs for this task have been found. Exploiting similarities between these CAs and symmetries in the landscape, we define the Olympus landscape which is regarded as the "heavenly home" of the best local optima known (blok). Then we measure several properties of this subspace. Although it is easier to find relevant CAs in this subspace than in the overall landscape, there are structural reasons that prevent a searcher from finding overfitted CAs in the Olympus. Finally, we study dynamics and performance of genetic algorithms on the Olympus in order to confirm our analysis and to find efficient CAs for the Majority problem with low computational cost. © 2007 Elsevier Ltd. All rights reserved.
- Published
- 2007
- Full Text
- View/download PDF
27. A C++ framework for geometric semantic genetic programming
- Author
-
Mauro Castelli, Sara Silva, Leonardo Vanneschi, Castelli, M, Silva, S, and Vanneschi, L
- Subjects
Theoretical computer science ,Source code ,Property (programming) ,Computer science ,Semantics (computer science) ,media_common.quotation_subject ,Supervised learning ,Genetic programming ,Geometric operator ,computer.software_genre ,Regression ,Computer Science Applications ,Theoretical Computer Science ,Set (abstract data type) ,Documentation ,Hardware and Architecture ,genetic programming ,Data mining ,computer ,Semantic ,Software ,C++ ,media_common - Abstract
Geometric semantic operators are new and promising genetic operators for genetic programming. They have the property of inducing a unimodal error surface for any supervised learning problem, i.e., any problem consisting in finding the match between a set of input data and known target values (like regression and classification). Thanks to an efficient implementation of these operators, it was possible to apply them to a set of real-life problems, obtaining very encouraging results. We have now made this implementation publicly available as open source software, and here we describe how to use it. We also reveal details of the implementation and perform an investigation of its efficiency in terms of running time and memory occupation, both theoretically and experimentally. The source code and documentation are available for download at http://gsgp.sourceforge.net .
- Published
- 2015
28. Electricity Demand Modelling with Genetic Programming
- Author
-
Matteo De Felice, Leonardo Vanneschi, Luca Manzoni, Mauro Castelli, Castelli, M, De Felice, M, Manzoni, L, Vanneschi, L, Francisco Pereira, Penousal Machado, Ernesto Costa, Amílcar Cardoso, Castelli, Mauro, De Felice, Matteo, Manzoni, Luca, and Vanneschi, Leonardo
- Subjects
electricity demand modelling ,Operations research ,Artificial neural network ,business.industry ,Computer science ,Computer Science (all) ,Genetic programming ,evolutionary computation ,Grid ,Industrial engineering ,Evolutionary computation ,Theoretical Computer Science ,Task (project management) ,Electric power system ,Power system simulation ,Air conditioning ,business - Abstract
Load forecasting is a critical task for all the operations of power systems. Especially during hot seasons, the influence of weather on energy demand may be strong, principally due to the use of air conditioning and refrigeration. This paper investigates the application of Genetic Programming on day-ahead load forecasting, comparing it with Neural Networks, Neural Networks Ensembles and Model Trees. All the experimentations have been performed on real data collected from the Italian electric grid during the summer period. Results show the suitability of Genetic Programming in providing good solutions to this problem. The advantage of using Genetic Programming, with respect to the other methods, is its ability to produce solutions that explain data in an intuitively meaningful way and that could be easily interpreted by a human being. This fact allows the practitioner to gain a better understanding of the problem under exam and to analyze the interactions between the features that characterize it.
- Published
- 2015
29. A Study of Fitness Distance Correlation as a Difficulty Measure in Genetic Programming
- Author
-
Manuel Clergue, Marco Tomassini, Leonardo Vanneschi, Philippe Collard, Information Systems Department, Université de Lausanne (UNIL), Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S), Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA), Tomassini, M, Vanneschi, L, Collard, P, and Clergue, M
- Subjects
Fitness landscape ,Genetic programming ,0102 computer and information sciences ,02 engineering and technology ,Machine learning ,computer.software_genre ,Models, Biological ,01 natural sciences ,Measure (mathematics) ,[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] ,Correlation ,0202 electrical engineering, electronic engineering, information engineering ,Animals ,Humans ,Computer Simulation ,Selection, Genetic ,Mathematical Computing ,Statistic ,Mathematics ,Models, Statistical ,Fitness function ,Models, Genetic ,Fitness approximation ,business.industry ,Computational Biology ,Biological Evolution ,study, fitness, distance, correlation, difficulty, measure, genetic, programming ,Distance correlation ,Computational Mathematics ,Genetics, Population ,010201 computation theory & mathematics ,Mutation ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer ,Algorithms ,Software - Abstract
International audience; We present an approach to genetic programming difficulty based on a statistical study of program fitness landscapes. The fitness distance correlation is used as an indicator of problem hardness and we empirically show that such a statistic is adequate in nearly all cases studied here. However, fitness distance correlation has some known problems and these are investigated by constructing an artificial landscape for which the correlation gives contradictory indications. Although our results confirm the usefulness of fitness distance correlation, we point out its shortcomings and give some hints for improvement in assessing problem hardness in genetic programming.
- Published
- 2005
30. Prediction of the Unified Parkinson’s Disease Rating Scale Assessment using a Genetic Programming System with Geometric Semantic Genetic Operators
- Author
-
Leonardo Vanneschi, Sara Silva, Mauro Castelli, Castelli, M, Vanneschi, L, and Silva, S
- Subjects
Genetic programming, Geometric operators, Semantics, Unified Parkinson's Disease Rating Scale ,Computer science ,business.industry ,Process (engineering) ,Scale (chemistry) ,General Engineering ,Genetic programming ,Unified Parkinson's disease rating scale ,Disease ,Machine learning ,computer.software_genre ,Computer Science Applications ,Artificial Intelligence ,Rating scale ,State (computer science) ,Artificial intelligence ,business ,computer ,Test data - Abstract
Unified Parkinson’s Disease Rating Scale (UPDRS) assessment is the most used scale for tracking Parkinson’s disease symptom progression. Nowadays, the tracking process requires a patient to undergo invasive and time-consuming specialized examinations in hospital clinics, under the supervision of trained medical staff. Thus, the process is costly and logistically inconvenient for both patients and clinicians. For this reason, new powerful computational tools, aimed at making the process more automatic, cheaper and less invasive, are becoming more and more a necessity. The purpose of this paper is to investigate the use of an innovative intelligent system based on genetic programming for the prediction of UPDRS assessment, using only data derived from simple, self-administered and non-invasive speech tests. The system we propose is called geometric semantic genetic programming and it is based on recently defined geometric semantic genetic operators. Experimental results, achieved using the largest database of Parkinson’s disease speech in existence (approximately 6000 recordings from 42 Parkinson’s disease patients, recruited in a six-month, multi-centre trial), show the appropriateness of the proposed system for the prediction of UPDRS assessment. In particular, the results obtained with geometric semantic genetic programming are significantly better than the ones produced by standard genetic programming and other state of the art machine learning methods both on training and unseen test data.
- Published
- 2014
31. Geometric Semantic Genetic Programming for Real Life Applications
- Author
-
Mauro Castelli, Luca Manzoni, Leonardo Vanneschi, Sara Silva, Vanneschi, Leonardo, Silva, Sara, Castelli, Mauro, Manzoni, Luca, Riolo, R, Moore, JH, Kotanchek, M, Vanneschi, L, Silva, S, Castelli, M, and Manzoni, L
- Subjects
Training set ,Fitness landscape ,business.industry ,Overfitting ,Genetic programming ,Geometric semantic operators ,Fitness landscapes ,Parameter tuning ,Machine learning ,computer.software_genre ,Geometric Semantic Operators, Fitness Landscapes, Overfitting, Parameter Tuning ,Geometric semantic operator ,A priori and a posteriori ,Limit (mathematics) ,Artificial intelligence ,business ,computer ,Test data ,Mathematics - Abstract
In a recent contribution we have introduced a new implementation of geometric semantic operators for Genetic Programming. Thanks to this implementation, we are now able to deeply investigate their usefulness and study their properties on complex real-life applications. Our experiments confirm that these operators are more effective than traditional ones in optimizing training data, due to the fact that they induce a unimodal fitness landscape. Furthermore, they automatically limit overfitting, something we had already noticed in our recent contribution, and that is further discussed here. Finally, we investigate the influence of some parameters on the effectiveness of these operators, and we show that tuning their values and setting them “a priori” may be wasted effort. Instead, if we randomly modify the values of those parameters several times during the evolution, we obtain a performance that is comparable with the one obtained with the best setting, both on training and test data for all the studied problems.
- Published
- 2014
32. Geometric Selective Harmony Search
- Author
-
Luca Manzoni, Leonardo Vanneschi, Sara Silva, Mauro Castelli, Castelli, M, Silva, S, Manzoni, L, Vanneschi, L, Instituto de Engenharia de Sistemas e Computadores (INESC), Modèles Discrets pour les Systèmes Complexes (Laboratoire I3S - MDSC), Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis (I3S), Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA)-Université Nice Sophia Antipolis (... - 2019) (UNS), COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-COMUE Université Côte d'Azur (2015-2019) (COMUE UCA)-Centre National de la Recherche Scientifique (CNRS)-Université Côte d'Azur (UCA), ANR-09-BLAN-0164,EMC,Emergence dans les modèles de calcul(2009), Castelli, Mauro, Silva, Sara, Manzoni, Luca, and Vanneschi, Leonardo
- Subjects
Optimization ,Information Systems and Management ,Recombination operators ,02 engineering and technology ,Theoretical Computer Science ,Operator (computer programming) ,Artificial Intelligence ,Multimodal Functions ,0202 electrical engineering, electronic engineering, information engineering ,[INFO]Computer Science [cs] ,ComputingMilieux_MISCELLANEOUS ,Selection (genetic algorithm) ,Mathematics ,Harmony Search ,business.industry ,Suite ,ComputingMilieux_PERSONALCOMPUTING ,Process (computing) ,Computer Science Application ,020206 networking & telecommunications ,Computer Science Applications ,Multimodal Function ,Control and Systems Engineering ,Benchmark (computing) ,Harmony search ,Beam search ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Software - Abstract
This paper presents a new variant of the Harmony Search algorithm, called Geometric Selective Harmony Search. The main differences between the proposed variant and the original formulation of Harmony Search are the integration of a selection procedure in the improvisation phase, a new memory consideration process that makes use of a recombination operator, and the integration of a new mutation operator. We compare Geometric Selective Harmony Search with the original Harmony Search, with another existing variant called Improved Harmony Search, and with two existing selection-based Harmony Search algorithms. The experimental study was conducted on 20 benchmark problems belonging to the CEC 2010 suite, one of the most well-known state-of-the-art benchmark sets. The results show that Geometric Selective Harmony Search outperforms the other studied methods with statistical significance in almost all the considered benchmark problems. © 2014 Elsevier Inc. All rights reserved.
- Published
- 2014
33. Towards the Use of Genetic Programming for the Prediction of Survival in Cancer
- Author
-
Marco Giacobini, Paolo Provero, Leonardo Vanneschi, Giancarlo Mauri, Cagnoni, S, Mirolli, M, Villani, M, Giacobini, M, Provero, P, Vanneschi, L, and Mauri, G
- Subjects
machine learning ,evolutionary computation ,Bioinformatics ,Genetic Programming ,artificial life - Abstract
Risk stratification of cancer patients, that is the prediction of the outcome of the pathology on an individual basis, is a key ingredient in making therapeutic decisions. In recent years the use of gene expression profiling in combination with the clinical and histological criteria traditionally used in such a prediction has been successfully introduced. Sets of genes whose expression values in a tumor can be used to predict the outcome of the pathology (gene expression signatures) were introduced and tested by many research groups. A well-known such signature is the 70-gene signature, on which we recently tested several machine learning techniques in order to maximize its predictive power. Genetic Programming (GP) was shown to perform significantly better than other techniques including Support Vector Machines, Multilayered Perceptrons and Random Forests in classifying patients. Genetic Programming has the further advantage, with respect to other methods, of performing an automatic feature selection. Importantly, by using a weighted average between false positives and false negatives in the definition of the fitness, we showed that GP can outperform all the other methods in minimizing false negatives (one of the main goals in clinical applications) without compromising the overall minimization of incorrectly classified instances. The solutions returned by GP are appealing also from a clinical point of view, being simple, easy to understand, and built out of a rather limited subset of the available features.
- Published
- 2014
34. Genetic algorithm with variable neighborhood search for the optimal allocation of goods in shop shelves
- Author
-
Mauro Castelli, Leonardo Vanneschi, Castelli, M, and Vanneschi, L
- Subjects
Mathematical optimization ,Allocation, Heuristics, Shelf space ,Computer science ,Shelf space ,Applied Mathematics ,INF/01 - INFORMATICA ,Management Science and Operations Research ,Hybrid algorithm ,Industrial and Manufacturing Engineering ,Genetic algorithm ,Optimal allocation ,Heuristics ,Software ,Variable neighborhood search - Abstract
Shelves on which products are being displayed are one of the most important resources in retail environment. The decision of shelf-space allocation and management is therefore a critical issue in retail operation management. In this paper a hybrid algorithm that combines a genetic algorithm with a variable neighborhood search is proposed to address the shop shelf allocation problem. Results obtained from an extensive experimental phase show the suitability of the proposed algorithm in addressing the problem at hand.
- Published
- 2014
35. An efficient implementation of geometric semantic genetic programming for anticoagulation level prediction in pharmacogenetics
- Author
-
Mauro Castelli, Daniele Maccagnola, Leonardo Vanneschi, Davide Castaldi, Francesco Archetti, Ilaria Giordani, Castelli, M, Castaldi, D, Vanneschi, L, Giordani, I, Archetti, F, and Maccagnola, D
- Subjects
Set (abstract data type) ,Geometric semantic operator ,Computational Mathematics ,Training set ,Theoretical computer science ,Semantics (computer science) ,Computer science ,Random tree ,Computational mathematics ,Genetic programming ,Oral anticoagulation therapy - Abstract
In the last few years researchers have dedicated several efforts to the definition of Genetic Programming (GP) [?] systems based on the semantics of the solutions, where by semantics we generally intend the behavior of a program once it is executed on a set of inputs, or more particularly the set of its output values on input training data (this definition has been used, among many others, for instance in [?, ?, ?, ?]). In particular, new genetic operators, called geometric semantic operators, have been proposed by Moraglio et al. [?]. They are defined s follows
- Published
- 2013
36. A new implementation of geometric semantic GP and its application to problems in pharmacokinetics
- Author
-
Sara Silva, Luca Manzoni, Leonardo Vanneschi, Mauro Castelli, Krzysztof Krawiec, Alberto Moraglio, Ting Hu, A. Şima Etaner-Uyar, Bin Hu, Vanneschi, Leonardo, Castelli, Mauro, Manzoni, Luca, Silva, Sara, Vanneschi, L, Castelli, M, Manzoni, L, and Silva, S
- Subjects
Theoretical computer science ,Training set ,Matching (graph theory) ,business.industry ,Offspring ,Fitness landscape ,Generalization ,Computer science ,Computer Science (all) ,Geometric Semantic GP ,Genetic Programming ,Genetic programming ,Semantic operators ,Semantic operator ,Theoretical Computer Science ,Evolvability ,Feature (linguistics) ,Random tree ,Artificial intelligence ,business - Abstract
Moraglio et al. have recently introduced new genetic operators for genetic programming, called geometric semantic operators. These operators induce a unimodal fitness landscape for all the problems consisting in matching input data with known target outputs (like regression and classification). This feature facilitates genetic programming evolvability, which makes these operators extremely promising. Nevertheless, Moraglio et al. leave open problems, the most important one being the fact that these operators, by construction, always produce offspring that are larger than their parents, causing an exponential growth in the size of the individuals, which actually renders them useless in practice. In this paper we overcome this limitation by presenting a new efficient implementation of the geometric semantic operators. This allows us, for the first time, to use them on complex real-life applications, like the two problems in pharmacokinetics that we address here. Our results confirm the excellent evolvability of geometric semantic operators, demonstrated by the good results obtained on training data. Furthermore, we have also achieved a surprisingly good generalization ability, a fact that can be explained considering some properties of geometric semantic operators, which makes them even more appealing than before. © 2013 Springer-Verlag.
- Published
- 2013
37. A hybrid genetic algorithm for the repetition free longest common subsequence problem
- Author
-
Leonardo Vanneschi, Mauro Castelli, Stefano Beretta, Castelli, M, Beretta, S, and Vanneschi, L
- Subjects
Theoretical computer science ,Repetition (rhetorical device) ,Applied Mathematics ,Hunt–McIlroy algorithm ,Approximation algorithm ,Heuristic ,Longest increasing subsequence ,Management Science and Operations Research ,Industrial and Manufacturing Engineering ,Longest common subsequence problem ,Applied Mathematic ,Estimation of distribution algorithm ,Genetic algorithm ,Repetition free longest common subsequence ,Focus (optics) ,Algorithm ,Software ,Mathematics - Abstract
Computing the longest common subsequence of two sequences is one of the most studied algorithmic problems. In this work we focus on a particular variant of the problem, called repetition free longest common subsequence (RF-LCS), which has been proved to be NP-hard. We propose a hybrid genetic algorithm, which combines standard genetic algorithms and estimation of distribution algorithms, to solve this problem. An experimental comparison with some well-known approximation algorithms shows the suitability of the proposed technique. © 2013 Elsevier B.V. All rights reserved.
- Published
- 2013
38. A new genetic programming framework based on reaction systems
- Author
-
Luca Manzoni, Mauro Castelli, Leonardo Vanneschi, Manzoni, L, Castelli, M, Vanneschi, L, Manzoni, Luca, Castelli, Mauro, and Vanneschi, Leonardo
- Subjects
Theoretical computer science ,business.industry ,Computer science ,Genetic programming ,Interactive evolutionary computation ,Evolutionary computation ,Reaction systems ,Computer Science Applications1707 Computer Vision and Pattern Recognition ,Inductive programming ,Computer Science Applications ,Theoretical Computer Science ,Java Evolutionary Computation Toolkit ,Evolutionary music ,Hardware and Architecture ,Reactive programming ,Artificial intelligence ,Genetic representation ,business ,Evolutionary programming ,Software ,Reaction system - Abstract
This paper presents a new genetic programming framework called Evolutionary Reaction Systems. It is based on a recently defined computational formalism, inspired by chemical reactions, called Reaction Systems, and it has several properties that distinguish it from other existing genetic programming frameworks, making it interesting and worthy of investigation. For instance, it allows us to express complex constructs in a simple and intuitive way, and it lightens the final user from the task of defining the set of primitive functions used to build up the evolved programs. Given that Evolutionary Reaction Systems is new and it has small similarities with other existing genetic programming frameworks, a first phase of this work is dedicated to a study of some important parameters and their influence on the algorithm's performance. Successively, we use the best parameter setting found to compare Evolutionary Reaction Systems with other well established machine learning methods, including standard tree-based genetic programming. The presented results show that Evolutionary Reaction Systems are competitive with, and in some cases even better than, the other studied methods on a wide set of benchmarks. © 2013 Springer Science+Business Media New York.
- Published
- 2013
39. Gene regulatory networks reconstruction from time series datasets using genetic programming: a comparison between tree-based and graph-based approaches
- Author
-
Mattia Stefano, Matteo Mondini, Alberto Ronchi, Leonardo Vanneschi, Martino Bertoni, Vanneschi, L, Mondini, M, Bertoni, M, Ronchi, A, and Stefano, M
- Subjects
Reverse engineering ,Series (mathematics) ,Computer science ,business.industry ,Graph based ,Gene regulatory network ,Inference ,Genetic programming ,computer.software_genre ,Machine learning ,Field (computer science) ,Computer Science Applications ,Theoretical Computer Science ,Hardware and Architecture ,genetic programming ,ComputingMethodologies_GENERAL ,Tree based ,Artificial intelligence ,business ,computer ,Software - Abstract
Genetic programming researchers have shown a growing interest in the study of gene regulatory networks in the last few years. Our team has also contributed to the field, by defining two systems for the automatic reverse engineering of gene regulatory networks called GRNGen and GeNet. In this paper, we revise this work by describing in detail the two approaches and empirically comparing them. The results we report, and in particular the fact that GeNet can be used on large networks while GRNGen cannot, encourage us to pursue the study of GeNet in the future. We conclude the paper by discussing the main research directions that we are planning to investigate to improve GeNet. © 2013 Springer Science+Business Media New York.
- Published
- 2013
40. Genetic programming needs better benchmarks
- Author
-
Wojciech Jaskowski, Sean Luke, James McDermott, Luca Manzoni, David White, Mauro Castelli, Leonardo Vanneschi, Krzysztof Krawiec, Kenneth de Jong, Robin Harper, Una-May O'Reilly, Mcdermott, J, White, D, Luke, S, Manzoni, L, Castelli, M, Vanneschi, L, Jaśkowski, W, Krawiec, K, Harper, R, De Jong, K, O'Reilly, U, Terence Soule, Jason H. Moore, Mcdermott, Jame, White David, R., Luke, Sean, Manzoni, Luca, Castelli, Mauro, Vanneschi, Leonardo, Jaskowski, Wojciech, Krawiec, Krzysztof, Harper, Robin, De Jong, Kenneth, and O'Reilly, Una-May
- Subjects
Standardization ,business.industry ,Computer science ,Applied Mathematics ,benchmarks ,genetic programming ,Genetic programming ,Benchmarking ,Data science ,Field (computer science) ,benchmark ,Computational Theory and Mathematic ,Benchmark (computing) ,Artificial intelligence ,business ,Set (psychology) ,Implementation - Abstract
Genetic programming (GP) is not a field noted for the rigor of its benchmarking. Some of its benchmark problems are popular purely through historical contingency, and they can be criticized as too easy or as providing misleading information concerning real-world performance, but they persist largely because of inertia and the lack of good alternatives. Even where the problems themselves are impeccable, comparisons between studies are made more difficult by the lack of standardization. We argue that the definition of standard benchmarks is an essential step in the maturation of the field. We make several contributions towards this goal. We motivate the development of a benchmark suite and define its goals; we survey existing practice; we enumerate many candidate benchmarks; we report progress on reference implementations; and we set out a concrete plan for gathering feedback from the GP community that would, if adopted, lead to a standard set of benchmarks. © 2012 ACM.
- Published
- 2012
41. Parameter tuning of evolutionary reactions systems
- Author
-
Mauro Castelli, Leonardo Vanneschi, Luca Manzoni, Terence Soule, Jason H. Moore, Castelli, Mauro, Manzoni, Luca, Vanneschi, Leonardo, Castelli, M, Manzoni, L, and Vanneschi, L
- Subjects
evolutionary algorithm ,reaction systems ,Artificial neural network ,business.industry ,Computer science ,Applied Mathematics ,Evolutionary robotics ,Evolutionary algorithm ,evolutionary algorithms ,parameter tuning ,Genetic programming ,Interactive evolutionary computation ,reaction system ,Human-based evolutionary computation ,Evolutionary acquisition of neural topologies ,Evolutionary music ,Computational Theory and Mathematic ,Memetic algorithm ,Artificial intelligence ,business ,Evolutionary programming - Abstract
Reaction systems is a formalism inspired by chemical reactions introduced by Rozenberg and Ehrenfeucht. Recently, an evolutionary algorithm based on this formalism, called Evolutionary Reaction Systems, has been presented. This new algorithm proved to have comparable performances to other well-established machine learning methods, like genetic programming, neural networks and support vector machines on both artificial and real-life problems. Even if the results are encouraging, to make Evolutionary Reaction Systems an established evolutionary algorithm, an in depth analysis of the effect of its parameters on the search process is needed, with particular focus on those parameters that are typical of Evolutionary Reaction Systems and do not have a counterpart in traditional evolutionary algorithms. Here we address this problem for the first time. The results we present show that one particular parameter, between the ones tested, has a great influence on the performances of Evolutionary Reaction Systems, and thus its setting deserves practitioners' particular attention: the number of symbols used to represent the reactions that compose the system. Furthermore, this work represents a first step towards the definition of a set of default parameter values for Evolutionary Reaction Systems, that should facilitate their use for beginners or inexpert practitioners. © 2012 ACM.
- Published
- 2012
42. Evolutionary reaction systems
- Author
-
Luca Manzoni, Leonardo Vanneschi, Mauro Castelli, Mario Giacobini, Leonardo Vanneschi, William S. Bush, Manzoni, Luca, Castelli, Mauro, Vanneschi, Leonardo, Manzoni, L, Castelli, M, and Vanneschi, L
- Subjects
Computer science ,business.industry ,Computer Science::Neural and Evolutionary Computation ,Computer Science (all) ,Evolutionary algorithm ,Reaction systems ,evolutionary computation ,Particle swarm optimization ,Genetic programming ,Ant colony ,Evolutionary computation ,Theoretical Computer Science ,Evolutionary music ,Artificial intelligence ,business ,Evolutionary programming ,Reaction system - Abstract
In the recent years many bio-inspired computational methods were defined and successfully applied to real life problems. Examples of those methods are particle swarm optimization, ant colony, evolutionary algorithms, and many others. At the same time, computational formalisms inspired by natural systems were defined and their suitability to represent different functions efficiently was studied. One of those is a formalism known as reaction systems. The aim of this work is to establish, for the first time, a relationship between evolutionary algorithms and reaction systems, by proposing an evolutionary version of reaction systems. In this paper we show that the resulting new genetic programming system has better, or at least comparable performances to a set of well known machine learning methods on a set of problems, also including real-life applications. Furthermore, we discuss the expressiveness of the solutions evolved by the presented evolutionary reaction systems. © 2012 Springer-Verlag.
- Published
- 2012
43. A GPU-Based Multi-swarm PSO Method for Parameter Estimation in Stochastic Biological Systems Exploiting Discrete-Time Target Series
- Author
-
Giancarlo Mauri, Daniela Besozzi, Dario Pescini, Paolo Cazzaniga, Marco S. Nobile, Giacobini, M, Vanneschi, L, Bush, WS, Nobile, M, Besozzi, D, Cazzaniga, P, Mauri, G, and Pescini, D
- Subjects
Mathematical optimization ,Settore INF/01 - Informatica ,particle swarm optimization ,Estimation theory ,Computer science ,INF/01 - INFORMATICA ,Swarm behaviour ,Particle swarm optimization ,Sampling (statistics) ,Set (abstract data type) ,Reduction (complexity) ,Discrete time and continuous time ,Stochastic simulation ,Parameter estimation ,Systems biology ,Algorithm - Abstract
Parameter estimation (PE) of biological systems is one of the most challenging problems in Systems Biology. Here we present a PE method that integrates particle swarm optimization (PSO) to estimate the value of kinetic constants, and a stochastic simulation algorithm to reconstruct the dynamics of the system. The fitness of candidate solutions, corresponding to vectors of reaction constants, is defined as the point-to-point distance between a simulated dynamics and a set of experimental measures, carried out using discrete-time sampling and various initial conditions. A multi-swarm PSO topology with different modalities of particles migration is used to account for the different laboratory conditions in which the experimental data are usually sampled. The whole method has been specifically designed and entirely executed on the GPU to provide a reduction of computational costs. We show the effectiveness of our method and discuss its performances on an enzymatic kinetics and a prokaryotic gene expression network.
- Published
- 2012
44. An Empirical Study of Parallel and Distributed Particle Swarm Optimization
- Author
-
Daniele Codecasa, Giancarlo Mauri, Leonardo Vanneschi, deVega, FF, Perez, JIH, Lanchares, J, Vanneschi, L, Codecasa, D, and Mauri, G
- Subjects
Genetic Algorithm ,education.field_of_study ,Computer science ,Heuristic (computer science) ,Embarrassingly parallel ,Population ,PSO ,INF/01 - INFORMATICA ,Particle swarm optimization ,Parallel computing ,Set (abstract data type) ,Swarm Optimization ,Component (UML) ,Genetic algorithm ,Evolutionary programming ,Heuristics ,education - Abstract
Given the implicitly parallel nature of population-based heuristics, many contributions reporting on parallel and distributed models and implementations of these heuristics have appeared so far. They range from the most natural and simple ones, i.e. fitness-level embarrassingly parallel implementations (where, for instance, each candidate solution is treated as an independent agent and evaluated on a dedicated processor), to many more sophisticated variously interacting multi-population systems. In the last few years, researchers have dedicated a growing attention to Particle Swarm Optimization (PSO), a bio-inspired population based heuristic inspired by the behavior of flocks of birds and shoals of fish, given its extremely simple implementation and its high intrinsical parallelism. Several parallel and distributed models of PSO have been recently defined, showing interesting performances both on benchmarks and real-life applications. In this chapter we report on four parallel and distributed PSO methods that have recently been proposed. They consist in a genetic algorithm whose individuals are co-evolving swarms, an “island model”- based multi-swarm system, where swarms are independent and interact by means of particle migrations at regular time steps, and their respective variants enriched by adding a repulsive component to the particles. We show that the proposed repulsive multi-swarm system has a better optimization ability than all the other presented methods on a set of hand-tailored benchmarks and complex real-life applications.
- Published
- 2012
45. A Study on Learning Robustness using Asynchronous 1D Cellular Automata Rules
- Author
-
Leonardo Vanneschi, Giancarlo Mauri, Vanneschi, L, and Mauri, G
- Subjects
Cellular automata ,Generality ,Computer science ,business.industry ,Complex system ,INF/01 - INFORMATICA ,Genetic algorithms ,Cellular automaton ,Computer Science Applications ,Automaton ,Asynchronous communication ,Robustness (computer science) ,Theory of computation ,Machine learning ,Artificial intelligence ,business - Abstract
Numerous studies can be found in literature concerning the idea of learning cellular automata (CA) rules that perform a given task by means of machine learning methods. Among these methods, genetic algorithms (GAs) have often been used with excellent results. Nevertheless, few attention has been dedicated so far to the generality and robustness of the learned rules. In this paper, we show that when GAs are used to evolve asynchronous one-dimensional CA rules, they are able to find more general and robust solutions compared to the more usual case of evolving synchronous CA rules.
- Published
- 2012
46. Bloat free genetic programming: application to human oral bioavailability prediction
- Author
-
Leonardo Vanneschi, Sara Silva, Silva, S, and Vanneschi, L
- Subjects
Computer science ,media_common.quotation_subject ,Biological Availability ,Genetic programming ,Feature selection ,Library and Information Sciences ,Overfitting ,Machine learning ,computer.software_genre ,General Biochemistry, Genetics and Molecular Biology ,Operator (computer programming) ,Humans ,Quality (business) ,media_common ,Models, Genetic ,business.industry ,Code growth ,Genetics, Population ,Pharmaceutical Preparations ,Linear Models ,genetic programming ,Artificial intelligence ,Symbolic regression ,business ,computer ,Control methods ,Algorithms ,Information Systems - Abstract
Being able to predict the human oral bioavailability for a potential new drug is extremely important for the drug discovery process. This problem has been addressed by several prediction tools, with Genetic Programming providing some of the best results ever achieved. In this paper we use the newest developments of Genetic Programming, in particular the latest bloat control method, Operator Equalisation, to find out how much improvement we can achieve on this problem. We show examples of some actual solutions and discuss their quality, comparing them with previously published results. We identify some unexpected behaviours related to overfitting, and discuss the way for further improving the practical usage of the Genetic Programming approach. Copyright © 2012 Inderscience Enterprises Ltd.
- Published
- 2012
47. Evolutionary Algorithms in Problem Solving and Machine Learning
- Author
-
Marco Tomassini, Leonardo Vanneschi, Orsucci, F, Sala, N, Tomassini, M, and Vanneschi, L
- Subjects
Learning classifier system ,Computer science ,Active learning (machine learning) ,business.industry ,evolutionary, algorithms, problem, solving, machine, learning ,Online machine learning ,Machine learning ,computer.software_genre ,Evolutionary computation ,Computational learning theory ,Memetic algorithm ,Instance-based learning ,Artificial intelligence ,business ,computer ,Evolutionary programming - Abstract
In the first part of the chapter, evolutionary algorithms are briefly described, especially genetic algorithms and genetic programming, with sufficient detail so as to prepare the ground for the second part. The latter presents in more detail two specific applications. The first is about an important financial problem: the portfolio allocation problem. The second one deals with a biochemical problem related to drug design and efficacy. © 2008, IGI Global.
- Published
- 2011
48. Multi objective genetic programming for feature construction in classification problems
- Author
-
Luca Manzoni, Mauro Castelli, Leonardo Vanneschi, Castelli, M, Manzoni, L, Vanneschi, L, Carlos A. Coello Coello, Castelli, Mauro, Manzoni, Luca, and Vanneschi, Leonardo
- Subjects
business.industry ,Computer science ,Computer Science (all) ,Genetic programming ,Machine learning ,computer.software_genre ,Multi objective genetic programming ,Theoretical Computer Science ,Support vector machine ,Entropy (information theory) ,Artificial intelligence ,business ,computer - Abstract
This work introduces a new technique for features construction in classification problems by means of multi objective genetic programming (MOGP). The final goal is to improve the generalization ability of the final classifier. MOGP can help in finding solutions with a better generalization ability with respect to standard genetic programming as stated in [1]. The main issue is the choice of the criteria that must be optimized by MOGP. In this work the construction of new features is guided by two criteria: the first one is the entropy of the target classes as in [7] while the second is inspired by the concept of margin used in support vector machines. © Springer-Verlag Berlin Heidelberg 2011
- Published
- 2011
49. A New Evolutionary Gene Regulatory Network Reverse Engineering Tool
- Author
-
Paolo Provero, Giancarlo Mauri, Mario Giacobini, Antonella Farinaccio, Leonardo Vanneschi, Pizzuti, C, Ritchie, MD, Giacobini, M, Farinaccio, A, Vanneschi, L, Provero, P, and Mauri, G
- Subjects
Reverse engineering ,Computer science ,Activation function ,Gene regulatory network ,Genetic programming ,Feature selection ,gene regulatory network ,Computational biology ,Machine learning ,computer.software_genre ,genetic networks ,reverse engineering ,evolutionary algorithms ,automatic feature selection ,Gene ,Regulation of gene expression ,business.industry ,INF/01 - INFORMATICA ,Yeast ,random boolean networks ,ComputingMethodologies_PATTERNRECOGNITION ,Benchmark (computing) ,ComputingMethodologies_GENERAL ,Artificial intelligence ,business ,computer - Abstract
We present a new reverse-engineering framework for gene regulatory network reconstruction. It works on temporal series of gene activation data and, using genetic programming, it extracts the activation functions of the different genes from those data. Successively, the gene regulatory network is reconstructed exploiting the automatic feature selection performed by genetic programming and its dynamics can be simulated using the previously extracted activation functions. The framework was tested on the well-known IRMA gene regulatory network, a simple network composed by five genes in the yeast Saccharomyces cerevisiae, defined in 2009 as a simplified biological model to benchmark reverse-engineering approaches. We show that the performances of the proposed framework on this benchmark network are encouraging.
- Published
- 2011
50. A comparison of machine learning techniques for survival prediction in breast cancer
- Author
-
Giancarlo Mauri, Paolo Provero, Antonella Farinaccio, Mario Giacobini, Marco Antoniotti, Leonardo Vanneschi, Vanneschi, L, Farinaccio, A, Mauri, G, Antoniotti, M, Provero, P, and Giacobini, M
- Subjects
Computer science ,Genetic programming ,Feature selection ,lcsh:Analysis ,lcsh:Computer applications to medicine. Medical informatics ,computer.software_genre ,Machine learning ,Biochemistry ,breast cancer ,Breast cancer ,Genetics ,medicine ,survival prediction ,Molecular Biology ,Machine Learning, Breast Cancer, Genetic Programming ,business.industry ,Research ,machine learning ,genetic programming ,evolutionary computation ,lcsh:QA299.6-433 ,Cancer ,Expression (computer science) ,Perceptron ,medicine.disease ,Computer Science Applications ,Random forest ,Support vector machine ,Computational Mathematics ,Computational Theory and Mathematics ,lcsh:R858-859.7 ,Artificial intelligence ,Data mining ,business ,computer - Abstract
Background The ability to accurately classify cancer patients into risk classes, i.e. to predict the outcome of the pathology on an individual basis, is a key ingredient in making therapeutic decisions. In recent years gene expression data have been successfully used to complement the clinical and histological criteria traditionally used in such prediction. Many "gene expression signatures" have been developed, i.e. sets of genes whose expression values in a tumor can be used to predict the outcome of the pathology. Here we investigate the use of several machine learning techniques to classify breast cancer patients using one of such signatures, the well established 70-gene signature. Results We show that Genetic Programming performs significantly better than Support Vector Machines, Multilayered Perceptrons and Random Forests in classifying patients from the NKI breast cancer dataset, and comparably to the scoring-based method originally proposed by the authors of the 70-gene signature. Furthermore, Genetic Programming is able to perform an automatic feature selection. Conclusions Since the performance of Genetic Programming is likely to be improvable compared to the out-of-the-box approach used here, and given the biological insight potentially provided by the Genetic Programming solutions, we conclude that Genetic Programming methods are worth further investigation as a tool for cancer patient classification based on gene expression data.
- Published
- 2011
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.