549 results on '"Martínez-Barco, Patricio"'
Search Results
2. Applying Human-in-the-Loop to construct a dataset for determining content reliability to combat fake news
- Author
-
Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete, Estela, Martínez-Barco, Patricio, Piad-Morffis, Alejandro, and Estevez-Velarde, Suilan
- Published
- 2023
- Full Text
- View/download PDF
3. A semi-automatic annotation methodology that combines Summarization and Human-In-The-Loop to create disinformation detection resources
- Author
-
Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete, Estela, and Martínez-Barco, Patricio
- Published
- 2023
- Full Text
- View/download PDF
4. Why are some social-media contents more popular than others? Opinion and association rules mining applied to virality patterns discovery
- Author
-
Saquete, Estela, Zubcoff, Jose, Gutiérrez, Yoan, Martínez-Barco, Patricio, and Fernández, Javi
- Published
- 2022
- Full Text
- View/download PDF
5. Exploiting discourse structure of traditional digital media to enhance automatic fake news detection
- Author
-
Bonet-Jover, Alba, Piad-Morffis, Alejandro, Saquete, Estela, Martínez-Barco, Patricio, and Ángel García-Cumbreras, Miguel
- Published
- 2021
- Full Text
- View/download PDF
6. Semi-Automatic Dataset Annotation Applied to Automatic Violent Message Detection
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Botella-Gil, Beatriz, Sepúlveda-Torres, Robiert, Bonet-Jover, Alba, Martínez-Barco, Patricio, Saquete Boró, Estela, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Botella-Gil, Beatriz, Sepúlveda-Torres, Robiert, Bonet-Jover, Alba, Martínez-Barco, Patricio, and Saquete Boró, Estela
- Abstract
Annotated corpora are indispensable tools to train computational models in Artificial Intelligence and Natural Language Processing. However, manual annotation is a costly, arduous, and time-consuming task, especially when the annotation is semantically complex. To address the problem, this work applies a methodology for semi-automatic annotation of datasets based on the Human-in-the-Loop paradigm. The methodology supports the building a resource, that benefits from a fine-grained annotation, to aid in the detection of Spanish violent messages sourced from social media (Twitter/X). After implementing the proposed methodology for semi-automatic violence annotation, a high quality resource was obtained (hereafter referred to as VILLANOS). The methodology consists of annotating the dataset incrementally, which delivers an increase in annotator efficiency, thereby validating the suitability of the proposal. Annotation time was reduced by 52% compared to manual annotation and performance, by training a model with the VILLANOS dataset, obtains an F 1 of 85.2%. These results demonstrate the efficiency and effectiveness of the methodology, evidencing its validity.
- Published
- 2024
7. SocialFairness: Assessing Fairness in Digital Media
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Ureña López, Luis Alfonso, Martín Valdivia, María Teresa, Saquete Boró, Estela, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Ureña López, Luis Alfonso, Martín Valdivia, María Teresa, Saquete Boró, Estela, and Martínez-Barco, Patricio
- Abstract
The proliferation of hoaxes on the Internet and toxic messages (with offensive or very negative content and high dissemination rates) constitute a current problem on the road to truthful and respectful information. Due to the enormous influence that social media have in the generation of opinion and as a channel of information for society, the efforts that various companies, organisations and institutions are making to detect and counteract the high volume of disinformation circulating on the networks are important. This project deals with the implementation of a proof of concept of a system for analysing the fairness of messages published through social media, built on the basis of various methods and algorithms from human language technologies. These methods and algorithms are the result of research that the participating groups have been working on for the last few years and are promising solutions for the determination of different levels of quality of publications in two fundamental aspects: their veracity and their toxicity. To address the proof of concept, activities aimed at the definition and integration of these technologies and their evaluation by stakeholders are proposed. This will make it possible to establish the responsiveness of these technologies to the needs of society and industry, as well as their viability to work towards higher levels of technological maturity.
- Published
- 2024
8. Overview of FLARES at IberLEF 2024: Fine-grained Language-based Reliability Detection in Spanish News.
- Author
-
Sepúlveda-Torres, Robiert, Bonet-Jover, Alba, Diab, Isam, Guillén-Pacho, Ibai, Cabrera-de Castro, Isabel, Badenes-Olmedo, Carlos, Saquete, Estela, Teresa Martín-Valdivia, M., Martínez-Barco, Patricio, and Alfonso Ureña López, L.
- Subjects
LANGUAGE models ,SPANISH language ,LINGUISTICS ,NATURAL languages ,INFORMATION processing - Abstract
Copyright of Procesamiento del Lenguaje Natural is the property of Sociedad Espanola para el Procesamiento del Lenguaje Natural and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
- Published
- 2024
- Full Text
- View/download PDF
9. Fighting post-truth using natural language processing: A review and open challenges
- Author
-
Saquete, Estela, Tomás, David, Moreda, Paloma, Martínez-Barco, Patricio, and Palomar, Manuel
- Published
- 2020
- Full Text
- View/download PDF
10. Semi-automatic dataset annotation applied to automatic violent message detection
- Author
-
Botella-Gil, Beatriz, primary, Sepúlveda-Torres, Robiert, additional, Bonet-Jover, Alba, additional, Martínez-Barco, Patricio, additional, and Saquete, Estela, additional
- Published
- 2024
- Full Text
- View/download PDF
11. Enhancing QA Systems with Complex Temporal Question Processing Capabilities
- Author
-
Saquete, Estela, Vicedo, Jose Luis, Martínez-Barco, Patricio, Muñoz, Rafael, and Llorens, Hector
- Subjects
Computer Science - Computation and Language ,Computer Science - Artificial Intelligence ,Computer Science - Information Retrieval - Abstract
This paper presents a multilayered architecture that enhances the capabilities of current QA systems and allows different types of complex questions or queries to be processed. The answers to these questions need to be gathered from factual information scattered throughout different documents. Specifically, we designed a specialized layer to process the different types of temporal questions. Complex temporal questions are first decomposed into simple questions, according to the temporal relations expressed in the original question. In the same way, the answers to the resulting simple questions are recomposed, fulfilling the temporal restrictions of the original complex question. A novel aspect of this approach resides in the decomposition which uses a minimal quantity of resources, with the final aim of obtaining a portable platform that is easily extensible to other languages. In this paper we also present a methodology for evaluation of the decomposition of the questions as well as the ability of the implemented temporal layer to perform at a multilingual level. The temporal layer was first performed for English, then evaluated and compared with: a) a general purpose QA system (F-measure 65.47% for QA plus English temporal layer vs. 38.01% for the general QA system), and b) a well-known QA system. Much better results were obtained for temporal questions with the multilayered system. This system was therefore extended to Spanish and very good results were again obtained in the evaluation (F-measure 40.36% for QA plus Spanish temporal layer vs. 22.94% for the general QA system).
- Published
- 2014
- Full Text
- View/download PDF
12. RUN-AS: a novel approach to annotate news reliability for disinformation detection
- Author
-
Bonet-Jover, Alba, primary, Sepúlveda-Torres, Robiert, additional, Saquete, Estela, additional, Martínez-Barco, Patricio, additional, and Nieto-Pérez, Mario, additional
- Published
- 2023
- Full Text
- View/download PDF
13. A semi-automatic annotation methodology that combines Summarization and Human-In-The-Loop to create disinformation detection resources
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, and Martínez-Barco, Patricio
- Abstract
Early detection of disinformation is one of the most challenging big-scale problems facing present day society. This is why the application of technologies such as Artificial Intelligence and Natural Language Processing is necessary. The vast majority of Artificial Intelligence approaches require annotated data, and generating these resources is very expensive. This proposal aims to improve the efficiency of the annotation process with a two-level semi-automatic annotation methodology. The first level extracts relevant information through summarization techniques. The second applies a Human-in-the-Loop strategy whereby the labels are pre-annotated by the machine, corrected by the human and reused by the machine to retrain the automatic annotator. After evaluating the system, the average annotation time per news item is reduced by 50%. In addition, a set of experiments on the semi-automatically annotated dataset that is generated are performed so as to demonstrate the effectiveness of the proposal. Although the dataset is annotated in terms of unreliable content, it is applied to the veracity detection task with very promising results (0.95 accuracy in reliability detection and 0.78 in veracity detection).
- Published
- 2023
14. Abordando el tratamiento automático de la desinformación: modelado de la confiabilidad en noticias mediante Procesamiento del Lenguaje Natural
- Author
-
Saquete Boró, Estela, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Bonet-Jover, Alba, Saquete Boró, Estela, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Bonet-Jover, Alba
- Abstract
La llegada de Internet y de las nuevas tecnologías dio lugar al nacimiento de la era de la información, una era que ha conectado a la sociedad de forma global y le ha permitido acceder libremente a la información digital. Con esta facilidad de acceso, cualquier persona, aún sin ser experta en la materia, puede publicar y acceder a la información sin ningún coste, lo que ha ocasionado un exceso de información no contrastada que muchas veces oculta intenciones como el engaño, la manipulación o los fines económicos. De esa forma, la era de la información se ha transformado en la era de la desinformación. La incesante necesidad de estar informados ha motivado que el consumo de la información se convierta en una rutina, ya sea siguiendo las últimas noticias en portales digitales o leyendo a diario publicaciones de personas afines. Antes, la información viajaba en forma de sonido a través de la radio o en forma de tinta a través de los periódicos, pero ahora una desmedida cantidad de información se propaga a través de algoritmos. Las tecnologías han propiciado la sobreabundancia de información, así como la propagación de noticias falsas y bulos, hasta tal punto que resulta imposible contrastar y procesar manualmente tales volúmenes de desinformación en tiempo real. No obstante, lo que se considera un problema puede convertirse en una solución, pues igual que los algoritmos y el entorno digital son los causantes de la viralización de la información falsa, estos pueden ser a su vez los detectores de la desinformación. Es aquí donde el Procesamiento del Lenguaje Natural desempeña un papel clave en la relación humano-máquina, modelando el lenguaje humano a través de la comprensión y generación automática del lenguaje, y entrenando modelos a través de la retroalimentación del experto. El trabajo coordinado entre la ingeniería computacional y la lingüística es decisivo a la hora de frenar el fenómeno de la desinformación. Son necesarias las dos perspectivas para abordar la dete
- Published
- 2023
15. Análisis de la influencia de las redes sociales en cotizaciones de valores bursátiles
- Author
-
Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Llopis Quereda, Fernando, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Llopis Quereda, Fernando
- Abstract
El objetivo de este trabajo es la creación de una plataforma que permita la monitorización del mercado bursátil y de criptomonedas de manera actualizada. Además, esta plataforma permitirá añadir propuestas en el mercado realizadas por expertos económicos para ver su evolución y si estos suelen acertar en sus pronósticos. Esto por otro lado, con los suficientes datos, podrá ayudar a evitar a aquellos que den consejos sin realmente tener conocimiento, ya que en estos últimos años ha habido un gran aumento de “influencers” que dan consejos sobre esto, pero realmente no tienen idea alguna. Todos estos datos serán proyectados de manera visual para su fácil entendimiento. Todos estos prescriptores, con sus correspondientes propuestas y valores, serán almacenados en una base de datos en la que se almacenarán además las fuentes de donde han sido creadas las recomendaciones, los índices bursátiles y como veremos a continuación, los datos obtenidos de la herramienta que será usada para analizar el efecto de las redes sociales en los valores. Para poder descargar la herramienta utilizada se habilitará un enlace en GitHub, el cual se encontrará en la bibliografía. Por otro lado, se realizará un análisis del efecto de las redes sociales en el mercado bursátil y de criptomonedas mediante una herramienta tecnológica llamada Social Analytics para saber si realmente hay alguna correlación entre ellas y poder determinar si basarse en las redes sociales sería una estrategia determinante para invertir. Este análisis será llevado a cabo con las medidas que dispone esta herramienta, en diferentes periodos de tiempo y podrán ser representados los resultados en la plataforma que ha sido creada con la que realizaré un análisis de los resultados. En este trabajo también se profundizarán sobre algunos ejemplos que han sucedido sobre la influencia de las redes sociales, como el famoso caso de Elon Musk y la criptomoneda DogeCoin que supuso todo un cambio en la cotización del precio de esta. Fi
- Published
- 2023
16. CLEAR.TEXT Enhancing the Modernization Public Sector Organizations by Deploying Natural Language Processing to Make Their Digital Content CLEARER to Those with Cognitive Disabilities
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Moreda, Paloma, Botella, Beatriz, Espinosa-Zaragoza, Isabel, Lloret, Elena, Martin, Tania Josephine, Martínez-Barco, Patricio, Suárez Cueto, Armando, Palomar, Manuel, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Moreda, Paloma, Botella, Beatriz, Espinosa-Zaragoza, Isabel, Lloret, Elena, Martin, Tania Josephine, Martínez-Barco, Patricio, Suárez Cueto, Armando, and Palomar, Manuel
- Abstract
The CLEAR.TEXT project (TED2021-130707B-I00) researches how natural language processing technology can support the authoring of accessible content in Spanish for people with cognitive disabilities. Our main objective is to research, implement, deploy, evaluate, and ultimately provide robust technologies for natural language processing to support the authoring of accessible Spanish content for public sector organisations (at local, regional and national level) that is intelligible to people with cognitive disability, thereby widening their inclusion and empowerment in Europe. It is expected to impact positively the quality of life of people with cognitive disabilities, facilitating their access to educational, vocational, cultural, and social opportunities in public sector organisations.
- Published
- 2023
17. Applying Human-in-the-Loop to construct a dataset for determining content reliability to combat fake news
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante. Instituto Universitario de Investigación Informática, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, Piad-Morffis, Alejandro, Estévez-Velarde, Suilan, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante. Instituto Universitario de Investigación Informática, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, Piad-Morffis, Alejandro, and Estévez-Velarde, Suilan
- Abstract
Annotated corpora are indispensable tools to train computational models in Natural Language Processing. However, in the case of more complex semantic annotation processes, it is a costly, arduous, and time-consuming task, resulting in a shortage of resources to train Machine Learning and Deep Learning algorithms. In consideration, this work proposes a methodology, based on the human-in-the-loop paradigm, for semi-automatic annotation of complex tasks. This methodology is applied in the construction of a reliability dataset of Spanish news so as to combat disinformation and fake news. We obtain a high quality resource by implementing the proposed methodology for semi-automatic annotation, increasing annotator efficacy and speed, with fewer examples. The methodology consists of three incremental phases and results in the construction of the RUN dataset. The annotation quality of the resource was evaluated through time-reduction (annotation time reduction of almost 64% with respect to the fully manual annotation), annotation quality (measuring consistency of annotation and inter-annotator agreement), and performance by training a model with RUN semi-automatic dataset (Accuracy 95% F1 95%), validating the suitability of the proposal.
- Published
- 2023
18. RUN-AS: a novel approach to annotate news reliability for disinformation detection
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante. Instituto Universitario de Investigación Informática, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, Nieto Pérez, Mario, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante. Instituto Universitario de Investigación Informática, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, and Nieto Pérez, Mario
- Abstract
The development of the internet and digital technologies has inadvertently facilitated the huge disinformation problem that faces society nowadays. This phenomenon impacts ideologies, politics and public health. The 2016 US presidential elections, the Brexit referendum, the COVID-19 pandemic and the Russia-Ukraine war have been ideal scenarios for the spreading of fake news and hoaxes, due to the massive dissemination of information. Assuming that fake news mixes reliable and unreliable information, we propose RUN-AS (Reliable and Unreliable Annotation Scheme), a fine-grained annotation scheme that enables the labelling of the structural parts and essential content elements of a news item and their classification into Reliable and Unreliable. This annotation proposal aims to detect disinformation patterns in text and to classify the global reliability of news. To this end, a dataset in Spanish was built and manually annotated with RUN-AS and several experiments using this dataset were conducted to validate the annotation scheme by using Machine Learning (ML) and Deep Learning (DL) algorithms. The experiments evidence the validity of the annotation scheme proposed, obtaining the best F1m, 0.948, with the Decision Tree algorithm.
- Published
- 2023
19. Violencia Identificada en el Lenguaje (VIL). Creación de recurso para mensajes violentos
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Botella, Beatriz, Sepúlveda-Torres, Robiert, Martínez-Barco, Patricio, Saquete Boró, Estela, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Botella, Beatriz, Sepúlveda-Torres, Robiert, Martínez-Barco, Patricio, and Saquete Boró, Estela
- Abstract
La sociedad avanza cargada de conocimientos nuevos y muy accesibles, que se publican en el mundo virtual. Es una realidad que las Tecnologías de la Información y la Comunicación (TIC) han traído muchos beneficios a nuestras vidas pero también vemos como año tras año aumenta el uso de violencia en plataformas digitales. Nuestro trabajo se enfoca en la creación de recursos que permitan la detección de mensajes violentos en la red social Twitter. Se parte de la creación de una guía de anotación de grano fino para anotar un corpus de mensajes violentos (VIL) con el fin de utilizar herramientas de aprendizaje automático que nos ayuden a detectar automáticamente el problema. Con este corpus se entrenan dos modelos de lenguaje (BETO y RoBERTa base) con los que se alcanza un valor en la métrica F1m de 97.03% y 96.51% clasificando si un tuit es o no violento., Society is moving forward full of new and very accessible knowledge, which is published in the virtual world. It is a reality that ICTs have brought many benefits to our lives but we also see how year after year the use of violence on digital platforms increases. Our work focuses on the detection of violent messages in the social network Twitter. Starting from the creation of a fine-grained annotation guide to obtain a corpus of violent messages (VIL) in order to use Machine Learning tools that help us to automatically detect the problem Two language models are trained with this corpus (BETO and RoBERTa base) with which a value of 97.03% and 96.51% is reached in the F1m metric, classifying whether or not a tweet is violent.
- Published
- 2023
20. Annotating reliability to enhance disinformation detection: annotation scheme, resource and evaluation
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, and Martínez-Barco, Patricio
- Abstract
Disinformation is a critical problem in our society. The COVID-19 pandemic and the Russia-Ukraine war have been key events for the spreading of fake news. Assuming that fake news mixes reliable and unreliable information, we propose RUN-AS (Reliable and Unreliable Annotation Scheme), a fine-grained annotation scheme that labels the structural parts and essential content elements of a news item to enable their classification into Reliable and Unreliable. This type of annotation will be used for training systems to automatically classify the reliability of a news item. To this end, RUN dataset in Spanish was built and annotated with RUN-AS. A set of experiments were conducted to validate the annotation scheme. The experiments evidence the validity of the annotation scheme proposed, obtaining the best F1m, i.e., 0.948., La desinformación es un problema crítico en nuestra sociedad. La pandemia de covid-19 y la guerra entre Rusia y Ucrania han sido escenarios clave para la difusión de noticias falsas. Partiendo de la base de que las noticias falsas mezclan información confiable y no confiable, proponemos RUN-AS (Reliable and Unreliable Annotation Scheme), un esquema de anotación de grano fino que etiqueta las partes estructurales y los elementos de contenido esenciales de una noticia y permite clasificarlos en Confiable y No confiable. Esta anotación será usada en el entrenamiento de sistemas para la clasificación automática de la confiabilidad de una noticia. Para ello, se construyó el corpus RUN en español y se anotó con RUN-AS. Se llevó a cabo un conjunto de experimentos para validar el esquema de anotación. Los experimentos evidencian la validez del esquema de anotación propuesto, obteniendo el mejor F1m 0,948.
- Published
- 2023
21. Generación y pesado de skipgrams y su aplicación al análisis de sentimientos
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Fernández Martínez, Javier, Gutiérrez, Yoan, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Fernández Martínez, Javier, Gutiérrez, Yoan, and Martínez-Barco, Patricio
- Abstract
El modelado de skipgrams es una técnica para la generación de términos multi-palabra que conserva parte de la secuencialidad y flexibilidad del lenguaje. Sin embargo, en algunos casos el número de skipgrams generados puede ser excesivo a medida que se aumenta la distancia entre palabras. Además, esta distancia no suele ser tenida en cuenta a la hora de valorar los términos que se generan. En este trabajo proponemos una técnica para la generación y filtrado eficientes de skipgrams y un esquema de pesado que tiene en cuenta la distancia entre los términos, dando más importancia a aquellos más cercanos. Aplicaremos y evaluaremos estas propuestas en la tarea de análisis de sentimientos., Skipgram modelling is a technique for generating multi-word terms that preserves some of the sequentiality and flexibility of the language. However, in some cases the number of skipgrams generated may become excessive as the distance between words increases. Moreover, this distance is often not taken into account when evaluating the terms that are generated. In this paper we propose a technique for efficient skipgram generation and filtering, and a weighing scheme that takes into account the distance between terms, giving more importance to those closer. We will apply and evaluate these proposals in the task of sentiment analysis.
- Published
- 2023
22. A novel concept-level approach for ultra-concise opinion summarization
- Author
-
Lloret, Elena, Boldrini, Ester, Vodolazova, Tatiana, Martínez-Barco, Patricio, Muñoz, Rafael, and Palomar, Manuel
- Published
- 2015
- Full Text
- View/download PDF
23. Annotating reliability to enhance disinformation detection: annotation scheme, resource and evaluation
- Author
-
Bonet-Jover, Alba, Sepúlveda-Torres, Robiert, Saquete Boró, Estela, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
- Subjects
Annotation Guideline ,Reliability Detection ,Procesamiento Lenguaje Natural ,Guía Anotación ,Anotación Corpus ,Disinformation Detection ,Detección Confiabilidad ,Dataset Annotation ,Detección Desinformación ,Natural Language Processing - Abstract
Disinformation is a critical problem in our society. The COVID-19 pandemic and the Russia-Ukraine war have been key events for the spreading of fake news. Assuming that fake news mixes reliable and unreliable information, we propose RUN-AS (Reliable and Unreliable Annotation Scheme), a fine-grained annotation scheme that labels the structural parts and essential content elements of a news item to enable their classification into Reliable and Unreliable. This type of annotation will be used for training systems to automatically classify the reliability of a news item. To this end, RUN dataset in Spanish was built and annotated with RUN-AS. A set of experiments were conducted to validate the annotation scheme. The experiments evidence the validity of the annotation scheme proposed, obtaining the best F1m, i.e., 0.948. La desinformación es un problema crítico en nuestra sociedad. La pandemia de covid-19 y la guerra entre Rusia y Ucrania han sido escenarios clave para la difusión de noticias falsas. Partiendo de la base de que las noticias falsas mezclan información confiable y no confiable, proponemos RUN-AS (Reliable and Unreliable Annotation Scheme), un esquema de anotación de grano fino que etiqueta las partes estructurales y los elementos de contenido esenciales de una noticia y permite clasificarlos en Confiable y No confiable. Esta anotación será usada en el entrenamiento de sistemas para la clasificación automática de la confiabilidad de una noticia. Para ello, se construyó el corpus RUN en español y se anotó con RUN-AS. Se llevó a cabo un conjunto de experimentos para validar el esquema de anotación. Los experimentos evidencian la validez del esquema de anotación propuesto, obteniendo el mejor F1m 0,948. This research work is funded by MCIN/AEI/10.13039/501100011033 and European Union NextGenerationEU/PRTR through the projects “TRIVIAL” (PID2021-122263OB-C22) and “SocialTrust” (PDC2022-133146-C22). It is also supported by Generalitat Valenciana through the project “NL4DISMIS” (CIPROM/2021/21) and Consellería de Innovación, Universidades, Ciencia y Sociedad Digital (ACIF/2020/177).
- Published
- 2023
24. Skipgrams Generation and Weighting and its Application to Sentiment Analysis
- Author
-
Fernández Martínez, Javier, Gutiérrez, Yoan, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
- Subjects
Generación de términos ,Análisis de sentimientos ,Sentiment analysis ,Term weighting ,Term generation ,Skipgrams ,Pesado de términos - Abstract
El modelado de skipgrams es una técnica para la generación de términos multi-palabra que conserva parte de la secuencialidad y flexibilidad del lenguaje. Sin embargo, en algunos casos el número de skipgrams generados puede ser excesivo a medida que se aumenta la distancia entre palabras. Además, esta distancia no suele ser tenida en cuenta a la hora de valorar los términos que se generan. En este trabajo proponemos una técnica para la generación y filtrado eficientes de skipgrams y un esquema de pesado que tiene en cuenta la distancia entre los términos, dando más importancia a aquellos más cercanos. Aplicaremos y evaluaremos estas propuestas en la tarea de análisis de sentimientos. Skipgram modelling is a technique for generating multi-word terms that preserves some of the sequentiality and flexibility of the language. However, in some cases the number of skipgrams generated may become excessive as the distance between words increases. Moreover, this distance is often not taken into account when evaluating the terms that are generated. In this paper we propose a technique for efficient skipgram generation and filtering, and a weighing scheme that takes into account the distance between terms, giving more importance to those closer. We will apply and evaluate these proposals in the task of sentiment analysis. Esta investigación ha sido financiada por la Universidad de Alicante, el Ministerio de Ciencia e Innovación de España, la Generalitat Valenciana y el Fondo Europeo de Desarrollo Regional (FEDER) a través de la siguiente financiación: a nivel nacional, se concedieron los proyectos TRIVIAL (PID2021-122263OB-C22), Social-Trust (PDC2022-133146-C22) y CLEARTEXT (TED2021-130707B-I00), financiados por MCIN/AEI/10.13039/501100011033 y European Union NextGenerationEU/PRTR; a nivel regional, la Generalitat Valenciana (Conselleria d’Educació, Investigació, Cultura i Esport), concedió financiación para NL4DISMIS (CIPROM/2021/21). Además, contó con el apoyo de dos acciones COST: CA19134 - “Distributed Knowledge Graphs” y CA19142 - “Leading Platform for European Citizens, Industries, Academia, and Policymakers in Media Accessibility”.
- Published
- 2023
25. Violence Identified in Language (VIL). Creation of a resource for the detection of violent messages
- Author
-
Botella, Beatriz, Sepúlveda-Torres, Robiert, Martínez-Barco, Patricio, Saquete Boró, Estela, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
- Subjects
Detección Mensajes Violentos ,Detection of Violent Messages ,Annotation Guideline ,Procesamiento Lenguaje Natural ,Guía Anotación ,Anotación Corpus ,Dataset Annotation ,Natural Language Processing - Abstract
La sociedad avanza cargada de conocimientos nuevos y muy accesibles, que se publican en el mundo virtual. Es una realidad que las Tecnologías de la Información y la Comunicación (TIC) han traído muchos beneficios a nuestras vidas pero también vemos como año tras año aumenta el uso de violencia en plataformas digitales. Nuestro trabajo se enfoca en la creación de recursos que permitan la detección de mensajes violentos en la red social Twitter. Se parte de la creación de una guía de anotación de grano fino para anotar un corpus de mensajes violentos (VIL) con el fin de utilizar herramientas de aprendizaje automático que nos ayuden a detectar automáticamente el problema. Con este corpus se entrenan dos modelos de lenguaje (BETO y RoBERTa base) con los que se alcanza un valor en la métrica F1m de 97.03% y 96.51% clasificando si un tuit es o no violento. Society is moving forward full of new and very accessible knowledge, which is published in the virtual world. It is a reality that ICTs have brought many benefits to our lives but we also see how year after year the use of violence on digital platforms increases. Our work focuses on the detection of violent messages in the social network Twitter. Starting from the creation of a fine-grained annotation guide to obtain a corpus of violent messages (VIL) in order to use Machine Learning tools that help us to automatically detect the problem Two language models are trained with this corpus (BETO and RoBERTa base) with which a value of 97.03% and 96.51% is reached in the F1m metric, classifying whether or not a tweet is violent. Esta investigación ha sido financiada por MCIN/AEI/ 10.13039/501100011033 y la Unión Europea NextGenerationEU/PRTR a través de los proyectos “TRIVIAL” (PID2021-122263OB-C22) and “SocialTrust” (PDC2022-133146-C22). También cuenta con el apoyo de la Generalitat Valenciana a través del proyecto “NL4DISMIS” (CIPROM/2021/21).
- Published
- 2023
26. Evaluating EmotiBlog Robustness for Sentiment Analysis Tasks
- Author
-
Fernández, Javi, Boldrini, Ester, Gómez, José Manuel, Martínez-Barco, Patricio, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Muñoz, Rafael, editor, Montoyo, Andrés, editor, and Métais, Elisabeth, editor
- Published
- 2011
- Full Text
- View/download PDF
27. IBQAst: A Question Answering System for Text Transcriptions
- Author
-
Pardiño, María, Gómez, José M., Llorens, Héctor, Muñoz-Terol, Rafael, Navarro-Colorado, Borja, Saquete, Estela, Martínez-Barco, Patricio, Moreda, Paloma, Palomar, Manuel, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Peters, Carol, editor, Deselaers, Thomas, editor, Ferro, Nicola, editor, Gonzalo, Julio, editor, Jones, Gareth J. F., editor, Kurimo, Mikko, editor, Mandl, Thomas, editor, Peñas, Anselmo, editor, and Petras, Vivien, editor
- Published
- 2009
- Full Text
- View/download PDF
28. Integrating Logic Forms and Anaphora Resolution in the AliQAn System
- Author
-
Muñoz-Terol, Rafael, Puchol-Blasco, Marcel, Pardiño, María, Gómez, José Manuel, Roger, Sandra, Vila, Katia, Ferrández, Antonio, Peral, Jesús, Martínez-Barco, Patricio, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, Peters, Carol, editor, Deselaers, Thomas, editor, Ferro, Nicola, editor, Gonzalo, Julio, editor, Jones, Gareth J. F., editor, Kurimo, Mikko, editor, Mandl, Thomas, editor, Peñas, Anselmo, editor, and Petras, Vivien, editor
- Published
- 2009
- Full Text
- View/download PDF
29. Evaluation of an Automatic Extension of Temporal Expression Treatment to Catalan
- Author
-
Saquete, Estela, Martínez-Barco, Patricio, Muñoz, Rafael, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Nierstrasz, Oscar, Series editor, Rangan, C. Pandu, Series editor, Steffen, Bernhard, Series editor, Sudan, Madhu, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Vardi, Moshe Y., Series editor, Weikum, Gerhard, Series editor, and Gelbukh, Alexander, editor
- Published
- 2007
- Full Text
- View/download PDF
30. Applying Logic Forms and Statistical Methods to CL-SR Performance
- Author
-
Terol, Rafael M., Martinez-Barco, Patricio, Palomar, Manuel, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Doug, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Peters, Carol, editor, Clough, Paul, editor, Gey, Fredric C., editor, Karlgren, Jussi, editor, Magnini, Bernardo, editor, Oard, Douglas W., editor, de Rijke, Maarten, editor, and Stempfhuber, Maximilian, editor
- Published
- 2007
- Full Text
- View/download PDF
31. Applying NLP Techniques and Biomedical Resources to Medical Questions in QA Performance
- Author
-
Terol, Rafael M., Martinez-Barco, Patricio, Palomar, Manuel, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Gelbukh, Alexander, editor, and Reyes-Garcia, Carlos Alberto, editor
- Published
- 2006
- Full Text
- View/download PDF
32. The University of Alicante at CL-SR Track
- Author
-
Terol, Rafael M., Palomar, Manuel, Martinez-Barco, Patricio, Llopis, Fernando, Muñoz, Rafael, Noguera, Elisa, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Peters, Carol, editor, Gey, Fredric C., editor, Gonzalo, Julio, editor, Müller, Henning, editor, Jones, Gareth J. F., editor, Kluck, Michael, editor, Magnini, Bernardo, editor, and de Rijke, Maarten, editor
- Published
- 2006
- Full Text
- View/download PDF
33. A Knowledge Based Strategy for Recognising Textual Entailment
- Author
-
Ferrández, Óscar, Terol, Rafael M., Muñoz, Rafael, Martínez-Barco, Patricio, Palomar, Manuel, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Sojka, Petr, editor, Kopeček, Ivan, editor, and Pala, Karel, editor
- Published
- 2006
- Full Text
- View/download PDF
34. A Study of the Influence of PoS Tagging on WSD
- Author
-
Moreno-Monteagudo, Lorenza, Izquierdo-Beviá, Rubén, Martínez-Barco, Patricio, Suárez, Armando, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Sojka, Petr, editor, Kopeček, Ivan, editor, and Pala, Karel, editor
- Published
- 2006
- Full Text
- View/download PDF
35. Deep vs. Shallow Semantic Analysis Applied to Textual Entailment Recognition
- Author
-
Ferrández, Óscar, Terol, Rafael Muñoz, Muñoz, Rafael, Martínez-Barco, Patricio, Palomar, Manuel, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Salakoski, Tapio, editor, Ginter, Filip, editor, Pyysalo, Sampo, editor, and Pahikkala, Tapio, editor
- Published
- 2006
- Full Text
- View/download PDF
36. LIVING-LANG: Tecnologías del lenguaje humano para entidades digitales vivas
- Author
-
Ureña López, Luis Alfonso, Saquete Boró, Estela, Martín Valdivia, María Teresa, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Procesamiento del Lenguaje y Sistemas de Información (GPLSI)
- Subjects
Emotion Mining ,Sentiment Enrichment ,Sentiment Analysis ,Natural Language Processing - Abstract
This project pursues the dynamic modeling at a spatial-temporal level of digital entities in social media for predicting their behavior. Firstly, digital entities are modelled by identifying the characteristics of individuals through their language and footprint on the network. Then, the extraction of relationships between digital entities is one of the nuclear challenges of the project. The proposal pursues this objective on a semantic level, structuring the information into representations of knowledge suitable for logical processing. Considering the heterogeneous nature of the sources to be dealt with, filtering of information is fundamental, using metrics and quality criteria. This spatial-temporal characterization, together with screening processes, will allow us to study high-performance predictive strategies in the evolution of digital entities. This project is coordinated by the SINAI and GPLSI research groups. This research work is funded by MCIN/AEI/10.13039/501100011033 and, as appropriate, by “ERDF A way of making Europe”, by the “European Union” or by the “European Union NextGenerationEU/PRTR” through the grant LIVING-LANG Project (RTI2018-094653-B-C21 / C22). It is a coordinated project with SINAI and GPLSI as participating research groups. It is also funded by Generalitat Valenciana through the project NL4DISMIS: Natural Language Technologies for dealing with dis-and misinformation (CIPROM/2021/21).
- Published
- 2022
37. An Application of NLP Rules to Spoken Document Segmentation Task
- Author
-
Terol, Rafael M., Martínez-Barco, Patricio, Llopis, Fernando, Martínez, Trinitario, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Montoyo, Andrés, editor, Muńoz, Rafael, editor, and Métais, Elisabeth, editor
- Published
- 2005
- Full Text
- View/download PDF
38. Semantic Annotation of a Natural Language Corpus for Knowledge Extraction
- Author
-
Navarro, Borja, Martínez-Barco, Patricio, Palomar, Manuel, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Montoyo, Andrés, editor, Muńoz, Rafael, editor, and Métais, Elisabeth, editor
- Published
- 2005
- Full Text
- View/download PDF
39. Spoken Document Retrieval Experiments with IR-n System
- Author
-
Llopis, Fernando, Martínez-Barco, Patricio, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Peters, Carol, editor, Gonzalo, Julio, editor, Braschler, Martin, editor, and Kluck, Michael, editor
- Published
- 2004
- Full Text
- View/download PDF
40. An Architecture for Spoken Document Retrieval
- Author
-
Terol, Rafael M., Martínez-Barco, Patricio, Palomar, Manuel, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Sojka, Petr, editor, Kopeček, Ivan, editor, and Pala, Karel, editor
- Published
- 2004
- Full Text
- View/download PDF
41. Event Ordering Using TERSEO System
- Author
-
Saquete, Estela, Muñoz, Rafael, Martínez-Barco, Patricio, Hutchison, David, editor, Kanade, Takeo, editor, Kittler, Josef, editor, Kleinberg, Jon M., editor, Mattern, Friedemann, editor, Mitchell, John C., editor, Naor, Moni, editor, Nierstrasz, Oscar, editor, Pandu Rangan, C., editor, Steffen, Bernhard, editor, Sudan, Madhu, editor, Terzopoulos, Demetri, editor, Tygar, Dough, editor, Vardi, Moshe Y., editor, Weikum, Gerhard, editor, Meziane, Farid, editor, and Métais, Elisabeth, editor
- Published
- 2004
- Full Text
- View/download PDF
42. The Role of Temporal Expressions in Word Sense Disambiguation
- Author
-
Vázquez, Sonia, Saquete, Estela, Montoyo, Andrés, Martínez-Barco, Patricio, Muñoz, Rafael, Goos, Gerhard, editor, Hartmanis, Juris, editor, van Leeuwen, Jan, editor, and Gelbukh, Alexander, editor
- Published
- 2004
- Full Text
- View/download PDF
43. An overview of the Applications of Natural Language to Information Systems
- Author
-
Martinez-Barco, Patricio, Métais, Elisabeth, Llopis, Fernando, and Moreda, Paloma
- Published
- 2013
- Full Text
- View/download PDF
44. A Grammar-Based System to Solve Temporal Expressions in Spanish Texts
- Author
-
Martínez-Barco, Patricio, Saquete, Estela, Muñoz, Rafael, Goos, G., editor, Hartmanis, J., editor, van Leeuwen, J., editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Ranchhod, Elisabete, editor, and Mamede, Nuno J., editor
- Published
- 2002
- Full Text
- View/download PDF
45. PHORA: A NLP System for Spanish
- Author
-
Palomar, Manuel, Saiz-Noeda, Maximiliano, Muñoz, Rafael, Suárez, Armando, Martínez-Barco, Patricio, Montoyo, Andrés, and Gelbukh, Alexander, editor
- Published
- 2001
- Full Text
- View/download PDF
46. Why are some social-media contents more popular than others? Opinion and association rules mining applied to virality patterns discovery
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante. Departamento de Ciencias del Mar y Biología Aplicada, Saquete Boró, Estela, Zubcoff, Jose, Gutiérrez, Yoan, Martínez-Barco, Patricio, Fernández Martínez, Javier, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Universidad de Alicante. Departamento de Ciencias del Mar y Biología Aplicada, Saquete Boró, Estela, Zubcoff, Jose, Gutiérrez, Yoan, Martínez-Barco, Patricio, and Fernández Martínez, Javier
- Abstract
Discovering the main features of virality patterns in Twitter is the focus of this research. Five trending topics related to the COVID-19 pandemic were selected for the study, with Spanish as the target language. To carry out the discovery of virality patterns, we applied opinion mining techniques that enable us to structure the information based on the polarity of the messages and the emotions they contain. After transforming the information from an unstructured textual representation to a structured one, data mining techniques were applied, specifically association rules mining. Message patterns with the highest virality (high shares and high likes), and at the same time the most relevant characteristics of the patterns with less impact were extracted. After an exhaustive analysis of the most relevant non-redundant rules, it can be concluded that messages with a high-negative polarity and a very high emotional charge, especially emotions that have intensified with the COVID-19 pandemic, such as fear, sadness, anger and surprise are more likely to go viral in social media. By contrast, messages with little news coverage in the media, few authors, and the absence of surprise are relevant features when it comes to seeing messages with very low dissemination in social media.
- Published
- 2022
47. Implementación de una plataforma web para venta al por mayor de productos alimenticios
- Author
-
Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Álvaro López, Álvaro, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, and Álvaro López, Álvaro
- Abstract
El proyecto tiene como finalidad el desarrollo del sistema de información y su posterior aplicación en una plataforma web, de un sistema para el control de la venta al por mayor de productos alimenticios entre el comercio en diferentes establecimientos, como supermercados y puestos en mercados, así como a particulares.
- Published
- 2022
48. LIVING-LANG: Living digital entities by human language technologies
- Author
-
Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Ureña López, Luis Alfonso, Saquete Boró, Estela, Martín Valdivia, María Teresa, Martínez-Barco, Patricio, Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos, Ureña López, Luis Alfonso, Saquete Boró, Estela, Martín Valdivia, María Teresa, and Martínez-Barco, Patricio
- Abstract
This project pursues the dynamic modeling at a spatial-temporal level of digital entities in social media for predicting their behavior. Firstly, digital entities are modelled by identifying the characteristics of individuals through their language and footprint on the network. Then, the extraction of relationships between digital entities is one of the nuclear challenges of the project. The proposal pursues this objective on a semantic level, structuring the information into representations of knowledge suitable for logical processing. Considering the heterogeneous nature of the sources to be dealt with, filtering of information is fundamental, using metrics and quality criteria. This spatial-temporal characterization, together with screening processes, will allow us to study high-performance predictive strategies in the evolution of digital entities. This project is coordinated by the SINAI and GPLSI research groups.
- Published
- 2022
49. Dialogue Structure Influence Over Anaphora Resolution
- Author
-
Martínez-Barco, Patricio, Palomar, Manuel, Goos, Gerhard, editor, Hartmanis, Juris, editor, van Leeuwen, Jan, editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Cairó, Osvaldo, editor, Sucar, L. Enrique, editor, and Cantu, Francisco J., editor
- Published
- 2000
- Full Text
- View/download PDF
50. An Annotation Scheme for Dialogues Applied to Anaphora Resolution Algorithms
- Author
-
Martínez-Barco, Patricio, Palomar, Manuel, Goos, G., editor, Hartmanis, J., editor, van Leeuwen, J., editor, Carbonell, Jaime G., editor, Siekmann, Jörg, editor, Sojka, Petr, editor, Kopeček, Ivan, editor, and Pala, Karel, editor
- Published
- 2000
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.