33 results on '"Bel-Enguix, Gemma"'
Search Results
2. Strategies for Automatic Detection of Fallacious Arguments in Political Speeches during Electoral Campaigns in Mexico
- Author
-
Nieto-Benitez, Kenia, primary, Castro-Sanchez, Noe Alejandro Castro-Sanchez, additional, Salazar, Hector Jimenez, additional, Bel-Enguix, Gemma, additional, Mújica Vargas, Dante, additional, González Serna, Juan Gabriel, additional, and Franco, Nimrod González, additional
- Published
- 2024
- Full Text
- View/download PDF
3. LexAN: Lexical Association Networks
- Author
-
Reyes-Magaña, Jorge, primary, Sierra, Gerardo, additional, Bel-Enguix, Gemma, additional, and Gomez-Adorno, Helena, additional
- Published
- 2023
- Full Text
- View/download PDF
4. The challenges of statistical patterns of language: the case of Menzerath's law in genomes
- Author
-
Ferrer-i-Cancho, Ramon, Forns, Núria, Hernández-Fernández, Antoni, Bel-Enguix, Gemma, and Baixeries, Jaume
- Subjects
Quantitative Biology - Genomics ,Computer Science - Computational Engineering, Finance, and Science ,Physics - Data Analysis, Statistics and Probability - Abstract
The importance of statistical patterns of language has been debated over decades. Although Zipf's law is perhaps the most popular case, recently, Menzerath's law has begun to be involved. Menzerath's law manifests in language, music and genomes as a tendency of the mean size of the parts to decrease as the number of parts increases in many situations. This statistical regularity emerges also in the context of genomes, for instance, as a tendency of species with more chromosomes to have a smaller mean chromosome size. It has been argued that the instantiation of this law in genomes is not indicative of any parallel between language and genomes because (a) the law is inevitable and (b) non-coding DNA dominates genomes. Here mathematical, statistical and conceptual challenges of these criticisms are discussed. Two major conclusions are drawn: the law is not inevitable and languages also have a correlate of non-coding DNA. However, the wide range of manifestations of the law in and outside genomes suggests that the striking similarities between non-coding DNA and certain linguistics units could be anecdotal for understanding the recurrence of that statistical law., Comment: Title changed, abstract and introduction improved and little corrections on the statistical arguments
- Published
- 2012
- Full Text
- View/download PDF
5. Overview of HOMO-MEX at Iberlef 2023: Hate speech detection in Online Messages directed Towards the MEXican Spanish speaking LGBTQ+ population
- Author
-
Bel Enguix, Gemma, Gómez-Adorno, Helena, Sierra Martínez, Gerardo, Vásquez, Juan, Andersen, Scott Thomas, Ojeda-Trueba, Sergio, Bel Enguix, Gemma, Gómez-Adorno, Helena, Sierra Martínez, Gerardo, Vásquez, Juan, Andersen, Scott Thomas, and Ojeda-Trueba, Sergio
- Abstract
The detection of hate speech and stereotypes in online platforms has gained significant attention in the field of Natural Language Processing (NLP). Among various forms of discrimination, LGBTQ+ phobia is prevalent on social media, particularly on platforms like Twitter. The objective of the HOMO-MEX task is to encourage the development of NLP systems that can detect and classify LGBTQ+ phobic content in Spanish tweets, regardless of whether it is expressed aggressively or subtly. The task aims to address the lack of dedicated resources for LGBTQ+ phobia detection in Spanish Twitter and encourages participants to employ multi-label classification approaches., La detección de discursos de odio y estereotipos en plataformas en línea ha suscitado gran atención en el campo del Procesamiento del Lenguaje Natural (PLN). Entre las diversas formas de discriminación, la LGBTQ+fobia es frecuente en las redes sociales, especialmente en plataformas como Twitter. El objetivo de la tarea HOMO-MEX es fomentar el desarrollo de sistemas de PLN que puedan detectar y clasificar contenido LGBTQ+fóbico en tuits en español, independientemente de si se expresa de forma agresiva o sutil. La tarea pretende abordar la falta de recursos dedicados a la detección de la fobia LGBTQ+ en Twitter en español y anima a los participantes a emplear enfoques de clasificación multietiqueta.
- Published
- 2023
6. Corpus de falacias por apelación a las emociones: una aproximación a la identificación automática de falacias
- Author
-
Nieto Benitez, Kenia, primary, Castro-Sánchez, Noé Alejandro, additional, Jiménez Salazar, Héctor, additional, and Bel-Enguix, Gemma, additional
- Published
- 2023
- Full Text
- View/download PDF
7. Resumen de PAR-MEX en IberLEF 2022: Tarea Compartida para la Detección de Paráfrasis en Español
- Author
-
Bel Enguix, Gemma, Sierra Martínez, Gerardo, Gómez-Adorno, Helena, Torres Moreno, Juan Manuel, Ortiz-Barajas, Jesus-German, and Vásquez, Juan
- Subjects
PAR-MEX ,Iberlef ,Detección paráfrasis ,Paraphrase detection - Abstract
Paraphrase detection is an important unresolved task in natural language processing; especially in the Spanish language. In order to address this issue, and contribute to the creation of high-performance paraphrase detection automated systems, we propose a shared task called PAR-MEX. For this task, we created a corpus, in Spanish, with topics in the domain of Mexican gastronomy. Afterwards, the participants in this task submitted their classification results on our corpus. In this paper we explain the steps followed for the creation of the corpus, we summarize the results obtained by the various participants, and propose some conclusions regarding the paraphrase-detection task in Spanish. La detección de paráfrasis es una tarea importante no resuelta en procesamiento del lenguaje natural; especialmente en la lengua española. Para atacar este problema, y para contribuir a la creación de sistemas de detección automática que obtengan resultados competitivos, proponemos la tarea compartida llamada PAR-MEX. Para esto, creamos un corpus en español con temas dentro del campo semántico de gastronomía mexicana. Después los participantes en esta tarea enviaron los resultados de sus sistemas de clasificación sobre nuestro corpus. En este paper explicamos los pasos seguidos para la creación del corpus, resumimos los resultados obtenidos por los participantes, y proponemos algunas conclusiones al respecto de la detección de paráfrasis en español. We acknowledge the support of the projects CONACyT CB A1-S-27780, and DGAPA-UNAM PAPIIT references TA400121 and TA101722.
- Published
- 2022
8. On Coevaluation Behavior and Equivalence
- Author
-
Zúñiga, Angel, primary and Bel-Enguix, Gemma, additional
- Published
- 2022
- Full Text
- View/download PDF
9. Sentence-CROBI: A Simple Cross-Bi-Encoder-Based Neural Network Architecture for Paraphrase Identification
- Author
-
Ortiz-Barajas, Jesus-German, primary, Bel-Enguix, Gemma, additional, and Gómez-Adorno, Helena, additional
- Published
- 2022
- Full Text
- View/download PDF
10. La #felicidad en Twitter: ¿qué representa realmente?
- Author
-
Bel Enguix, Gemma, primary, Gómez-Adorno, Helena, additional, Mendoza, Karla, additional, Sidorov, Grigori, additional, and Vásquez, Juan, additional
- Published
- 2022
- Full Text
- View/download PDF
11. Overview of PAR-MEX at Iberlef 2022: Paraphrase Detection in Spanish Shared Task
- Author
-
Bel Enguix, Gemma, Sierra Martínez, Gerardo, Gómez-Adorno, Helena, Torres Moreno, Juan Manuel, Ortiz-Barajas, Jesus-German, Vásquez, Juan, Bel Enguix, Gemma, Sierra Martínez, Gerardo, Gómez-Adorno, Helena, Torres Moreno, Juan Manuel, Ortiz-Barajas, Jesus-German, and Vásquez, Juan
- Abstract
Paraphrase detection is an important unresolved task in natural language processing; especially in the Spanish language. In order to address this issue, and contribute to the creation of high-performance paraphrase detection automated systems, we propose a shared task called PAR-MEX. For this task, we created a corpus, in Spanish, with topics in the domain of Mexican gastronomy. Afterwards, the participants in this task submitted their classification results on our corpus. In this paper we explain the steps followed for the creation of the corpus, we summarize the results obtained by the various participants, and propose some conclusions regarding the paraphrase-detection task in Spanish., La detección de paráfrasis es una tarea importante no resuelta en procesamiento del lenguaje natural; especialmente en la lengua española. Para atacar este problema, y para contribuir a la creación de sistemas de detección automática que obtengan resultados competitivos, proponemos la tarea compartida llamada PAR-MEX. Para esto, creamos un corpus en español con temas dentro del campo semántico de gastronomía mexicana. Después los participantes en esta tarea enviaron los resultados de sus sistemas de clasificación sobre nuestro corpus. En este paper explicamos los pasos seguidos para la creación del corpus, resumimos los resultados obtenidos por los participantes, y proponemos algunas conclusiones al respecto de la detección de paráfrasis en español.
- Published
- 2022
12. Gender Identification in Social Media Using Transfer Learning
- Author
-
Sotelo, Aquilino Francisco, Gómez-Adorno, Helena, Esquivel-Flores, Oscar, and Bel-Enguix, Gemma
- Subjects
Author profiling ,Classification ,Article ,Natural Language Processing ,Transfer learning - Abstract
Social networks have modified the way we communicate. It is now possible to talk to a large number of people we have never met. Knowing the traits of a person from what he/she writes has become a new area of computational linguistics called Author Profiling. In this paper, we introduce a method for applying transfer learning to address the gender identification problem, which is a subtask of Author Profiling. Systems that use transfer learning are trained in a large number of tasks and then tested in their ability to learn new tasks. An example is to classify a new image into different possible classes, giving an example of each class. This differs from the traditional approach of standard machine learning techniques, which are trained in a single task and are evaluated in new examples of that task. The aim is to train a gender identification model on Twitter users using only their text samples in Spanish. The difference with other related works consists in the evaluation of different preprocessing techniques so that the transfer learning-based fine-tuning is more efficient.
- Published
- 2020
13. Overview of FakeDeS at IberLEF 2021: Fake News Detection in Spanish Shared Task
- Author
-
Gómez-Adorno, Helena, Posadas-Durán, Juan Pablo, Bel Enguix, Gemma, and Porto Capetillo, Claudia
- Subjects
Fake news detection ,Iberlef ,Lenguajes y Sistemas Informáticos ,FakeDeS - Abstract
This paper presents the overview of FakeDeS 2021, the second edition of this lab under the IberLEF conference. The FakeDeS shared task aims to explore different methodologies and strategies related to fake news detection in Spanish. This year edition brings two main challenges: thematic and language variation. For this purpose, we introduce a new testing corpus containing news related to COVID-19 and news from other Ibero-American countries. Este artículo hace una presentación general de la tarea compartida FakeDeS 2021, cuya segunda edición ha tenido lugar en 2021 bajo el congreso IberLEF, aunque se trata de la primera vez con esta denominación. La tarea FakeDeS tiene por objetivo explorar diferentes métodos y estategias relacinados con la detección de noticias falsas en español, principalmente en su variante de México. La edición de este año propone dos desafíos principales: variación temática y variación lingüística. Para ello, se introduce un nuevo corpus de prueba que contiene noticias relacionadas con COVID 19 y noticias de otros países de Iber-América. This research was funded by CONACyT project CB A1-S-27780, DGAPA-UNAM PAPIIT grants number TA400121 and TA100520. The authors also thank CONACYT for the computer resources provided through the INAOE Supercomputing Laboratory's Deep Learning Platform for Language Technologies.
- Published
- 2021
14. Modelling dialogue as inter-action
- Author
-
Bel-Enguix, Gemma and Jiménez-López, M. Dolores
- Published
- 2008
- Full Text
- View/download PDF
15. Negation Detection on Mexican Spanish Tweets: The T-MexNeg Corpus
- Author
-
Bel-Enguix, Gemma, primary, Gómez-Adorno, Helena, additional, Pimentel, Alejandro, additional, Ojeda-Trueba, Sergio-Luis, additional, and Aguilar-Vizuet, Brian, additional
- Published
- 2021
- Full Text
- View/download PDF
16. Computing Dialogues with Membranes
- Author
-
Bel-Enguix, Gemma and Jiménez-López, M. Dolores
- Published
- 2006
- Full Text
- View/download PDF
17. Coinductive Natural Semantics for Compiler Verification in Coq
- Author
-
Zúñiga, Angel, primary and Bel-Enguix, Gemma, additional
- Published
- 2020
- Full Text
- View/download PDF
18. Wan2vec: Embeddings learned on word association norms
- Author
-
Bel-Enguix, Gemma, primary, Gómez-Adorno, Helena, additional, Reyes-Magaña, Jorge, additional, and Sierra, Gerardo, additional
- Published
- 2019
- Full Text
- View/download PDF
19. Extracción automática de definiciones analíticas y relaciones semánticas de hiponimia-hiperonimia con un sistema basado en patrones lingüísticos
- Author
-
Dorantes, Miguel Alejandro, Pimentel, Alejandro, Sierra, Gerardo, Bel-Enguix, Gemma, and Molina, Claudio
- Published
- 2017
20. Wan2vec: Embeddings learned on word association norms.
- Author
-
Kejriwal, Mayank, Lopez, Vanessa, Sequeda, Juan F., Bel-Enguix, Gemma, Gómez-Adorno, Helena, Reyes-Magaña, Jorge, and Sierra, Gerardo
- Subjects
TRANSFER functions ,NATURAL language processing ,NATURAL languages ,EMBEDDINGS (Mathematics) ,SENTIMENT analysis ,WEIGHTED graphs ,TASK analysis - Abstract
Word embeddings are powerful for many tasks in natural language processing. In this work, we learn word embeddings using weighted graphs from word association norms (WAN) with the node2vec algorithm. Although building WAN is a difficult and time-consuming task, training the vectors from these resources is a fast and efficient process. This allows us to obtain good quality word embeddings from small corpora. We evaluate our word vectors in two ways: intrinsic and extrinsic. The intrinsic evaluation was performed with several word similarity benchmarks, WordSim-353, MC30, MTurk-287, MEN-TR-3k, SimLex-999, MTurk-771 and RG-65, and different similarity measures achieving better results than those obtained with word2vec, GloVe, and fastText, trained on a huge corpus. The extrinsic evaluation was done by measuring the quality of sentence embeddings using transfer tasks: sentiment analysis, paraphrase detection, natural language inference, and semantic textual similarity. The word vectors learned from the WAN are available on our Github page. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
21. Rhetorical Relations in the Speech of Alzheimer’s Patients and Healthy Elderly Subjects: An Approach from the RST
- Author
-
Paulino, Anayeli, primary, Sierra, Gerardo, additional, Hernández-Domínguez, Laura, additional, Da Cunha, Iria, additional, and Bel-Enguix, Gemma, additional
- Published
- 2018
- Full Text
- View/download PDF
22. Extracción automática de definiciones analíticas y relaciones semánticas de hiponimia-hiperonimia con un sistema basado en patrones lingüísticos
- Author
-
Dorantes, Miguel Alejandro, primary, Pimentel, Alejandro, additional, Sierra, Gerardo, additional, Bel-Enguix, Gemma, additional, and Molina, Claudio, additional
- Published
- 2018
- Full Text
- View/download PDF
23. Language Production, Cognition, and the Lexicon / edited by N ria Gala, Reinhard Rapp, Gemma Bel-Enguix.
- Author
-
Gala, N ria. editor., Rapp, Reinhard. editor., Bel-Enguix, Gemma. editor., SpringerLink (Online service), Gala, N ria. editor., Rapp, Reinhard. editor., Bel-Enguix, Gemma. editor., and SpringerLink (Online service)
- Abstract
The book collects contributions from well-established researchers at the interface between language and cognition. It provides an overview of the latest insights into this interdisciplinary field from the perspectives of natural language processing, computer science, psycholinguistics and cognitive science. One of the pioneers in cognitive natural language processing is Michael Zock, to whom this volume is dedicated. The structure of the book reflects his main research interests: lexicon and lexical analysis, semantics, language and speech generation, reading and writing technologies, language resources and language engineering. The book is a valuable reference work and authoritative information source, giving an overview on the field and describing the state of the art as well as future developments. It is intended for researchers and advanced students interested in the subject.
- Published
- 2015
24. Natural language and the genetic code: from the semiotic analogy to biolinguistics
- Author
-
Bel Enguix, Gemma and Jiménez López, M. Dolores
- Abstract
[Abstract] With the discovery of the DNA structure (Watson and Crick, 1953), the idea of DNA as a linguistic code arose (Monod, 1970). Many researchers have considered DNA as a language, pointing out the semiotic parallelism between genetic code and natural language. This idea had been discussed, almost dismissed and somehow accepted. This paper does not claim that the genetic code is a linguistic structure, but it highlights several important semiotic analogies between DNA and verbal language. Genetic code and natural language share a number of units, structures and operations. The syntactic and semantic parallelisms between those codes should lead to a methodological exchange between biology, linguistics and semiotics. During the 20th century, biology has become a pilot science, so that many disciplines have formulated their theories under models taken from biology. Computer science has become almost a bioinspired field thanks to the great development of natural computing and DNA computing. Biology and semiotics are two different sciences challenged by the same common goal of deciphering the codes of the nature. Linguistics could become another «bio-inspired» science by taking advantage of the structural and «semantic» similarities between the genetic code and natural language. Biological methods coming from computer science can be very useful in the field of linguistics, since they provide flexible and intuitive tools for describing natural languages. In this way, we obtain a theoretical framework where biology, linguistics and computer science exchange methods and interact, thanks to the semiotic parallelism between the genetic code a natural language. The influence of the semiotics of the genetic code in linguistics is parallel to the need of achieving an implementable formal description of natural language. In this paper we present an overview of different bio-inspired methods — from theoretical computer science — that during the last years have been successfully applied to several linguistics issues, from syntax to pragmatics.
- Published
- 2012
25. The challenges of statistical patterns of language: the case of Menzerath's law in genomes
- Author
-
Universitat Politècnica de Catalunya. Departament de Ciències de la Computació, Universitat Politècnica de Catalunya. Institut de Ciències de l'Educació, Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural, Universitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge, Ferrer Cancho, Ramon, Forns Fradera, Núria, Hernández Fernández, Antonio, Bel Enguix, Gemma, Baixeries i Juvillà, Jaume, Universitat Politècnica de Catalunya. Departament de Ciències de la Computació, Universitat Politècnica de Catalunya. Institut de Ciències de l'Educació, Universitat Politècnica de Catalunya. GPLN - Grup de Processament del Llenguatge Natural, Universitat Politècnica de Catalunya. LARCA - Laboratori d'Algorísmia Relacional, Complexitat i Aprenentatge, Ferrer Cancho, Ramon, Forns Fradera, Núria, Hernández Fernández, Antonio, Bel Enguix, Gemma, and Baixeries i Juvillà, Jaume
- Abstract
The importance of statistical patterns of language has been debated over decades. Although Zipf's law is perhaps the most popular case, recently, Menzerath's law has begun to be involved. Menzerath's law manifests in language, music and genomes as a tendency of the mean size of the parts to decrease as the number of parts increases in many situations. This statistical regularity emerges also in the context of genomes, for instance, as a tendency of species with more chromosomes to have a smaller mean chromosome size. It has been argued that the instantiation of this law in genomes is not indicative of any parallel between language and genomes because (a) the law is inevitable and (b) noncoding DNA dominates genomes. Here mathematical, statistical, and conceptual challenges of these criticisms are discussed. Two major conclusions are drawn: the law is not inevitable and languages also have a correlate of noncoding DNA. However, the wide range of manifestations of the law in and outside genomes suggests that the striking similarities between noncoding DNA and certain linguistics units could be anecdotal for understanding the recurrence of that statistical law., Peer Reviewed, Postprint (author's final draft)
- Published
- 2013
26. Molecular computing methods for natural languge syntax
- Author
-
Bel Enguix, Gemma, Martín Vide, Carlos, Universitat Rovira i Virgili. Departament de Filologies Romàniques, Departament de Filologies Romàniques, and Universitat Rovira i Virgili.
- Published
- 2001
27. Código genético y lenguaje verbal
- Author
-
Bel Enguix, Gemma, Jiménez-López, María Dolores, Bel Enguix, Gemma, and Jiménez-López, María Dolores
- Published
- 2006
28. Semántica natural como marco de verificación de compiladores en Coq
- Author
-
Zúñiga Chávez, Ángel Francisco, Sierra Martínez, Gerardo Eugenio, and Bel Enguix, Gemma
- Subjects
Ciencias Físico-Matemáticas e Ingenierías - Published
- 2021
29. Detección automática de pseudociencia
- Author
-
Martinell García, Víctor and Bel Enguix, Gemma
- Subjects
Ciencias Físico-Matemáticas e Ingenierías - Published
- 2019
30. Complexity and modeling power of insertion-deletion systems
- Author
-
Krassovitskiy, Alexander, Departament de Filologies Romàniques, Universitat Rovira i Virgili., Yurii Rogozhim, Verlan, Sergey, Bel Enguix, Gemma, and Universitat Rovira i Virgili. Departament de Filologies Romàniques
- Subjects
004 - Informàtica ,Complexity and modeling power ,51 - Matemàtiques - Abstract
SISTEMAS DE INSERCIÓN Y BORRADO: COMPLEJIDAD Y CAPACIDAD DE MODELADO El objetivo central de la tesis es el estudio de los sistemas de inserción y borrado y su capacidad computacional. Más concretamente, estudiamos algunos modelos de generación de lenguaje que usan operaciones de reescritura de dos cadenas. También consideramos una variante distribuida de los sistemas de inserción y borrado en el sentido de que las reglas se separan entre un número finito de nodos de un grafo. Estos sistemas se denominan sistemas controlados mediante grafo, y aparecen en muchas áreas de la Informática, jugando un papel muy importante en los lenguajes formales, la lingüística y la bio-informática. Estudiamos la decidibilidad/ universalidad de nuestros modelos mediante la variación de los parámetros de tamaño del vector. Concretamente, damos respuesta a la cuestión más importante concerniente a la expresividad de la capacidad computacional: si nuestro modelo es equivalente a una máquina de Turing o no. Abordamos sistemáticamente las cuestiones sobre los tamaños mínimos de los sistemas con y sin control de grafo., COMPLEXITY AND MODELING POWER OF INSERTION-DELETION SYSTEMS The central object of the thesis are insertion-deletion systems and their computational power. More specifically, we study language generating models that use two string rewriting operations: contextual insertion and contextual deletion, and their extensions. We also consider a distributed variant of insertion-deletion systems in the sense that rules are separated among a finite number of nodes of a graph. Such systems are refereed as graph-controlled systems. These systems appear in many areas of Computer Science and they play an important role in formal languages, linguistics, and bio-informatics. We vary the parameters of the vector of size of insertion-deletion systems and we study decidability/universality of obtained models. More precisely, we answer the most important questions regarding the expressiveness of the computational model: whether our model is Turing equivalent or not. We systematically approach the questions about the minimal sizes of the insertiondeletion systems with and without the graph-control.
- Published
- 2011
31. Petri net controlled grammars
- Author
-
Turaev, Sherzod, Dassow, Jürgen, Bel Enguix, Gemma, Universitat Rovira i Virgili. Departament de Filologies Romàniques, Departament de Filologies Romàniques, and Universitat Rovira i Virgili.
- Subjects
004 - Informàtica ,petri nets ,51 - Matemàtiques - Abstract
Different types of regulated grammars have been introduced in order to supplement shortcomings of context-free grammars in applications preserving their elegant mathematical properties. However, the rapid developments in present day industry, biology, and other areas challenge to deal with various tasks which need suitable tools for their modelling and investigation. We propose Petri net controlled grammars as models for representing and analyzing of metabolic pathways in living cells where Petri nets are responsible for the structure and communication of the pathways, and grammars represent biochemical processes. On the other hand, the control by Petri nets has also theoretical interest: it extends possibilities to introduce and investigate concurrent control mechanisms in formal language theory. The thesis introduces various variants of Petri net controlled grammars using different types of Petri nets and investigates their mathematical properties such as computational power and closure properties., Los diferentes tipos de gramáticas con reescritura regulada han sido introducidas para complementar las deficiencias de las gramáticas libres del contexto en las aplicaciones, preservando sus propiedades matemáticas. Por otro lado, la rápida evolución la biología, y otras áreas actuales supone un reto para tratar de las tareas varias que necesitan las herramientas adecuadas para la elaboración de modelos e investigación. Proponemos gramáticas controladas por redes de Petri como modelos para representar y analizar los procesos bioquímicos en las células vivas donde redes de Petri son responsables de la estructura, y gramáticas representan los procesos generativos. Además, el control de redes de Petri también tiene interés teórico: amplía las posibilidades de investigar los mecanismos de control concurrente en la teoría de lenguajes formales. La tesis presenta distintas variantes de gramáticas controladas por redes de Petri e investiga sus propiedades matemáticas.
- Published
- 2010
32. Language learning with correction queries
- Author
-
Tirnauca, Cristina, Higuera, Colin de, Mitrana, Victor, Bel Enguix, Gemma, Universitat Rovira i Virgili. Departament de Filologies Romàniques, Departament de Filologies Romàniques, and Universitat Rovira i Virgili.
- Subjects
aprendizaje a partir de preguntas ,inferencia gramatical - Abstract
In the field of grammatical inference, the goal of any learning algorithm is to identify a target concept from a given class by having access to a specific type of information. The main learning settings are Gold's model of learning in the limit and Angluin's query learning. This dissertation is primarily concerned with the second approach.We thoroughly investigate a recently introduced, linguistic motivated, type of query called Correction Query (CQ). We consider three possible definitions, and for each of them we give necessary and sufficient conditions for a language class to be learnable with these types of queries. Furthermore, we compare the model of learning with CQs with other well-known Gold-style and query learning models when no efficiency constraints are imposed. Results are also obtained for the restricted version of the model of learning with CQs in polynomial time. Additionally, we discuss the learnability of deterministic finite automata (DFAs) with correction and equivalence queries. We design several learning algorithms and we present a comparison between our algorithms and the standard algorithm for learning DFAs with membership and equivalence queries. These results are furthermore extended from string languages to tree languages., Dentro del ámbito de la inferencia gramatical, el objetivo de cualquier algoritmo de aprendizaje consiste en identificar un concepto de una clase dada, teniendo acceso a un tipo específico de información. Los dos modelos de aprendizaje principales son: el modelo de aprendizaje en el límite propuesto por Gold y el modelo de aprendizaje a partir de preguntas de Angluin. Esta tesis se centra fundamentalmente en el segundo tipo de aprendizaje. En esta tesis doctoral investigamos a fondo un tipo de preguntas introducidas recientemente, por motivos lingüísticos, denominada PREGUNTA DE CORRECCIÓN. Nosotros consideramos tres posibles definiciones, y para cada uno de ellos presentamos condiciones necesarias y suficientes para que una clase de lenguajes sea identificable con estos tipos de preguntas. Además, comparamos estos tres modelos de aprendizaje a partir de preguntas de corrección con otros modelos de aprendizaje, en el caso general (sin ninguna restricción de tiempo) y también imponiendo que los algoritmos sean polinómicos.Además, investigamos el aprendizaje de autómata finitos deterministas (AFD) con preguntas de corrección y equivalencia. Diseñamos varios algoritmos de aprendizaje y presentamos una comparación entre nuestros algoritmos y el algoritmo estándar de aprendizaje a partir de preguntas de pertenencia y equivalencia. Estos resultados son, además, extendidos para los lenguajes de árboles.
- Published
- 2009
33. Communication in membrana Systems with symbol Objects
- Author
-
Alhazov, Artiom, Departament de Filologies Romàniques, Universitat Rovira i Virgili., Freund, Rudolf, Rogozhin, Yurii, Bel Enguix, Gemma, and Universitat Rovira i Virgili. Departament de Filologies Romàniques
- Subjects
004 - Informàtica ,membrane compating ,parallel multiset processing ,Psystems ,sympont - Abstract
Esta tesis está dedicada a los sistemas de membranas con objetos-símbolo como marco teórico de los sistemas paralelos y distribuidos de procesamiento de multiconjuntos.Una computación de parada puede aceptar, generar o procesar un número, un vector o una palabra; por tanto el sistema define globalmente (a través de los resultados de todas sus computaciones) un conjunto de números, de vectores, de palabras (es decir, un lenguaje), o bien una función. En esta tesis estudiamos la capacidad de estos sistemas para resolver problemas particulares, así como su potencia computacional. Por ejemplo, las familias de lenguajes definidas por diversas clases de estos sistemas se comparan con las familias clásicas, esto es, lenguajes regulares, independientes del contexto, generados por sistemas 0L tabulados extendidos, generados por gramáticas matriciales sin chequeo de apariciones, recursivamente enumerables, etc. Se prestará especial atención a la comunicación de objetos entre regiones y a las distintas formas de cooperación entre ellos.Se pretende (Sección 3.4) realizar una formalización los sistemas de membranas y construir una herramienta tipo software para la variante que usa cooperación no distribuida, el navegador de configuraciones, es decir, un simulador, en el cual el usuario selecciona la siguiente configuración entre todas las posibles, estando permitido volver hacia atrás. Se considerarán diversos modelos distribuidos. En el modelo de evolución y comunicación (Capítulo 4) separamos las reglas tipo-reescritura y las reglas de transporte (llamadas symport y antiport). Los sistemas de bombeo de protones (proton pumping, Secciones 4.8, 4.9) constituyen una variante de los sistemas de evolución y comunicación con un modo restrictivo de cooperación. Un modelo especial de computación con membranas es el modelo puramente comunicativo, en el cual los objetos traspasan juntos una membrana. Estudiamos la potencia computacional de las sistemas de membranas con symport/antiport de 2 o 3 objetos (Capítulo 5) y la potencia computacional de las sistemas de membranas con alfabeto limitado (Capítulo 6).El determinismo (Secciones 4.7, 5.5, etc.) es una característica especial (restrictiva) de los sistemas computacionales. Se pondrá especial énfasis en analizar si esta restricción reduce o no la potencia computacional de los mismos. Los resultados obtenidos para sistemas de bombeo del protones están transferidos (Sección 7.3) a sistemas con catalizadores bistabiles. Unos ejemplos de aplicación concreta de los sistemas de membranas (Secciones 7.1, 7.2) son la resolución de problemas NP-completos en tiempo polinomial y la resolución de problemas de ordenación., This thesis deals with membrane systems with symbol objects as a theoretical framework of distributed parallel multiset processing systems.A halting computation can accept, generate or process a number, a vector or a word, so the system globally defines (by the results of all its computations) a set of numbers or a set of vectors or a set of words, (i.e., a language), or a function. The ability of these systems to solve particular problems is investigated, as well as their computational power, e.g., the language families defined by different classes of these systems are compared to the classical ones, i.e., regular, context-free, languages generated by extended tabled 0L systems, languages generated by matrix grammars without appearance checking, recursively enumerable languages, etc. Special attention is paid to communication of objects between the regions and to the ways of cooperation between the objects.An attempt to formalize the membrane systems is made (Section 3.4), and a software tool is constructed for the non-distributed cooperative variant, the configuration browser, i.e., a simulator, where the user chooses the next configuration among the possible ones and can go back. Different distributed models are considered. In the evolution-communication model (Chapter 4) rewriting-like rules are separated from transport rules. Proton pumping systems (Sections 4.8, 4.9) are a variant of the evolution-communication systems with a restricted way of cooperation. A special membrane computing model is a purely communicative one: the objects are moved together through a membrane. We study the computational power of membrane systems with symport/antiport of 2 or 3 objects (Chapter 5) and the computational power of membrane systems with a limited alphabet (Chapter 6).Determinism (Sections 4.7, 5.5, etc.) is a special property of computational systems; the question of whether this restriction reduces the computational power is addressed. The results on proton pumping systems can be carried over (Section 7.3) to the systems with bi-stable catalysts. Some particular examples of membrane systems applications are solving NP-complete problems in polynomial time, and solving the sorting problem.
- Published
- 2006
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.