Back to Search Start Over

A case study on grammatical-based representation for regular expression evolution

Authors :
María D. R-Moreno
David Camacho
David F. Barrero
Antonio Gonzalez-Pardo
UAM. Departamento de Ingeniería Informática
Herramientas Interactivas Avanzadas (ING EPS-003)
Source :
Biblos-e Archivo. Repositorio Institucional de la UAM, instname, ResearcherID, Advances in Intelligent and Soft Computing ISBN: 9783642124327, PAAMS (Special Sessions and Workshops)
Publication Year :
2010
Publisher :
Springer Berlin Heidelberg, 2010.

Abstract

The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-642-12433-4_45<br />Proceedings of 8th International Conference on Practical Applications of Agents and Multiagent Systems<br />Regular expressions, or simply regex, have been widely used as a powerful pattern matching and text extractor tool through decades. Although they provide a powerful and flexible notation to define and retrieve patterns from text, the syntax and the grammatical rules of these regex notations are not easy to use, and even to understand. Any regex can be represented as a Deterministic or Non-Deterministic Finite Automata; so it is possible to design a representation to automatically build a regex, and a optimization algorithm able to find the best regex in terms of complexity. This paper introduces both, a graph-based representation for regex, and a particular heuristic-based evolutionary computing algorithm based on grammatical features from this language in a particular data extraction problem.<br />This work has been partially supported by the Spanish Ministry of Science and Innovation under the projects Castilla-La Mancha project PEII09-0266-6640, COMPUBIODIVE (TIN2007-65989), and by HADA (TIN2007-64718).

Details

Language :
English
ISBN :
978-3-642-12432-7
ISBNs :
9783642124327
Database :
OpenAIRE
Journal :
Biblos-e Archivo. Repositorio Institucional de la UAM, instname, ResearcherID, Advances in Intelligent and Soft Computing ISBN: 9783642124327, PAAMS (Special Sessions and Workshops)
Accession number :
edsair.doi.dedup.....0a560a4c7ccea290595b50569277bdf8