Author: "Altuna, Begoña" / Topic: computer science - computation and language - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Altuna, Begoña"' showing total 3 results

Start Over Author "Altuna, Begoña" Topic computer science - computation and language

3 results on '"Altuna, Begoña"'

1. A Hard Nut to Crack: Idiom Detection with Conversational Large Language Models

Author: Fornaciari, Francesca De Luca, Altuna, Begoña, Gonzalez-Dios, Itziar, and Melero, Maite
Subjects: Computer Science - Computation and Language
Abstract: In this work, we explore idiomatic language processing with Large Language Models (LLMs). We introduce the Idiomatic language Test Suite IdioTS, a new dataset of difficult examples specifically designed by language experts to assess the capabilities of LLMs to process figurative language at sentence level. We propose a comprehensive evaluation methodology based on an idiom detection task, where LLMs are prompted with detecting an idiomatic expression in a given English sentence. We present a thorough automatic and manual evaluation of the results and an extensive error analysis.
Published: 2024

2. NoticIA: A Clickbait Article Summarization Dataset in Spanish

Author: García-Ferrero, Iker and Altuna, Begoña
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickbait headlines, each paired with high-quality, single-sentence generative summarizations written by humans. This task demands advanced text understanding and summarization abilities, challenging the models' capacity to infer and connect diverse pieces of information to meet the user's informational needs generated by the clickbait headline. We evaluate the Spanish text comprehension capabilities of a wide range of state-of-the-art large language models. Additionally, we use the dataset to train ClickbaitFighter, a task-specific model that achieves near-human performance in this task., Comment: Accepted in the journal Procesamiento del Lenguaje Natural
Published: 2024

3. This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Author: García-Ferrero, Iker, Altuna, Begoña, Álvez, Javier, Gonzalez-Dios, Itziar, and Rigau, German
Subjects: Computer Science - Computation and Language
Abstract: Although large language models (LLMs) have apparently acquired a certain level of grammatical knowledge and the ability to make generalizations, they fail to interpret negation, a crucial step in Natural Language Processing. We try to clarify the reasons for the sub-optimal performance of LLMs understanding negation. We introduce a large semi-automatically generated dataset of circa 400,000 descriptive sentences about commonsense knowledge that can be true or false in which negation is present in about 2/3 of the corpus in different forms. We have used our dataset with the largest available open LLMs in a zero-shot approach to grasp their generalization and inference capability and we have also fine-tuned some of the models to assess whether the understanding of negation can be trained. Our findings show that, while LLMs are proficient at classifying affirmative sentences, they struggle with negative sentences and lack a deep understanding of negation, often relying on superficial cues. Although fine-tuning the models on negative sentences improves their performance, the lack of generalization in handling negation is persistent, highlighting the ongoing challenges of LLMs regarding negation understanding and generalization. The dataset and code are publicly available., Comment: Accepted in the The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Altuna, Begoña"'

1. A Hard Nut to Crack: Idiom Detection with Conversational Large Language Models

2. NoticIA: A Clickbait Article Summarization Dataset in Spanish

3. This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Publication Year Range

Publication Type

Database

3 results on '"Altuna, Begoña"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources