143 results on '"Fersini, Elisabetta"'
Search Results
2. Let the Models Respond: Interpreting Language Model Detoxification Through the Lens of Prompt Dependence
- Author
-
Scalena, Daniel, Sarti, Gabriele, Nissim, Malvina, and Fersini, Elisabetta
- Subjects
Computer Science - Computation and Language - Abstract
Due to language models' propensity to generate toxic or hateful responses, several techniques were developed to align model generations with users' preferences. Despite the effectiveness of such methods in improving the safety of model interactions, their impact on models' internal processes is still poorly understood. In this work, we apply popular detoxification approaches to several language models and quantify their impact on the resulting models' prompt dependence using feature attribution methods. We evaluate the effectiveness of counter-narrative fine-tuning and compare it with reinforcement learning-driven detoxification, observing differences in prompt reliance between the two methods despite their similar detoxification performances., Comment: 4 pages
- Published
- 2023
3. AI-UPV at EXIST 2023 -- Sexism Characterization Using Large Language Models Under The Learning with Disagreements Regime
- Author
-
de Paula, Angel Felipe Magnossão, Rizzi, Giulia, Fersini, Elisabetta, and Spina, Damiano
- Subjects
Computer Science - Computation and Language ,Computer Science - Computers and Society ,Computer Science - Machine Learning - Abstract
With the increasing influence of social media platforms, it has become crucial to develop automated systems capable of detecting instances of sexism and other disrespectful and hateful behaviors to promote a more inclusive and respectful online environment. Nevertheless, these tasks are considerably challenging considering different hate categories and the author's intentions, especially under the learning with disagreements regime. This paper describes AI-UPV team's participation in the EXIST (sEXism Identification in Social neTworks) Lab at CLEF 2023. The proposed approach aims at addressing the task of sexism identification and characterization under the learning with disagreements paradigm by training directly from the data with disagreements, without using any aggregated label. Yet, performances considering both soft and hard evaluations are reported. The proposed system uses large language models (i.e., mBERT and XLM-RoBERTa) and ensemble strategies for sexism identification and classification in English and Spanish. In particular, our system is articulated in three different pipelines. The ensemble approach outperformed the individual large language models obtaining the best performances both adopting a soft and a hard label evaluation. This work describes the participation in all the three EXIST tasks, considering a soft evaluation, it obtained fourth place in Task 2 at EXIST and first place in Task 3, with the highest ICM-Soft of -2.32 and a normalized ICM-Soft of 0.79. The source code of our approaches is publicly available at https://github.com/AngelFelipeMP/Sexism-LLM-Learning-With-Disagreement., Comment: 15 pages, 9 tables, 1 figures, conference
- Published
- 2023
4. One Configuration to Rule Them All? Towards Hyperparameter Transfer in Topic Models using Multi-Objective Bayesian Optimization
- Author
-
Terragni, Silvia, Harrando, Ismail, Lisena, Pasquale, Troncy, Raphael, and Fersini, Elisabetta
- Subjects
Computer Science - Computation and Language - Abstract
Topic models are statistical methods that extract underlying topics from document collections. When performing topic modeling, a user usually desires topics that are coherent, diverse between each other, and that constitute good document representations for downstream tasks (e.g. document classification). In this paper, we conduct a multi-objective hyperparameter optimization of three well-known topic models. The obtained results reveal the conflicting nature of different objectives and that the training corpus characteristics are crucial for the hyperparameter selection, suggesting that it is possible to transfer the optimal hyperparameter configurations between datasets., Comment: 17 pages, 7 figures
- Published
- 2022
5. Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content
- Author
-
Gasparini, Francesca, Rizzi, Giulia, Saibene, Aurora, and Fersini, Elisabetta
- Subjects
Computer Science - Artificial Intelligence - Abstract
In this paper we present a benchmark dataset generated as part of a project for automatic identification of misogyny within online content, which focuses in particular on memes. The benchmark here described is composed of 800 memes collected from the most popular social media platforms, such as Facebook, Twitter, Instagram and Reddit, and consulting websites dedicated to collection and creation of memes. To gather misogynistic memes, specific keywords that refer to misogynistic content have been considered as search criterion, considering different manifestations of hatred against women, such as body shaming, stereotyping, objectification and violence. In parallel, memes with no misogynist content have been manually downloaded from the same web sources. Among all the collected memes, three domain experts have selected a dataset of 800 memes equally balanced between misogynistic and non-misogynistic ones. This dataset has been validated through a crowdsourcing platform, involving 60 subjects for the labelling process, in order to collect three evaluations for each instance. Two further binary labels have been collected from both the experts and the crowdsourcing platform, for memes evaluated as misogynistic, concerning aggressiveness and irony. Finally for each meme, the text has been manually transcribed. The dataset provided is thus composed of the 800 memes, the labels given by the experts and those obtained by the crowdsourcing validation, and the transcribed texts. This data can be used to approach the problem of automatic detection of misogynistic content on the Web relying on both textual and visual cues, facing phenomenons that are growing every day such as cybersexism and technology-facilitated violence.
- Published
- 2021
- Full Text
- View/download PDF
6. Cross-lingual Contextualized Topic Models with Zero-shot Learning
- Author
-
Bianchi, Federico, Terragni, Silvia, Hovy, Dirk, Nozza, Debora, and Fersini, Elisabetta
- Subjects
Computer Science - Computation and Language - Abstract
Many data sets (e.g., reviews, forums, news, etc.) exist parallelly in multiple languages. They all cover the same content, but the linguistic differences make it impossible to use traditional, bag-of-word-based topic models. Models have to be either single-language or suffer from a huge, but extremely sparse vocabulary. Both issues can be addressed by transfer learning. In this paper, we introduce a zero-shot cross-lingual topic model. Our model learns topics on one language (here, English), and predicts them for unseen documents in different languages (here, Italian, French, German, and Portuguese). We evaluate the quality of the topic predictions for the same document in different languages. Our results show that the transferred topics are coherent and stable across languages, which suggests exciting future research directions., Comment: Updated version. Published as a conference paper at EACL2021
- Published
- 2020
7. The role of hyper-parameters in relational topic models: Prediction capabilities vs topic quality
- Author
-
Terragni, Silvia, Candelieri, Antonio, and Fersini, Elisabetta
- Published
- 2023
- Full Text
- View/download PDF
8. Soft metrics for evaluation with disagreements: an assessment
- Author
-
Rizzi, Giulia, Leonardelli, Elisa, Poesio, Massimo, Uma, Alexandra, Pavlovic, Maja, Paun, Silviu, Rosso, Paolo, Fersini, Elisabetta, Rizzi, Giulia, Leonardelli, Elisa, Poesio, Massimo, Uma, Alexandra, Pavlovic, Maja, Paun, Silviu, Rosso, Paolo, and Fersini, Elisabetta
- Abstract
The move towards preserving judgement disagreements in NLP requires the identification of adequate evaluation metrics. We identify a set of key properties that such metrics should have, and assess the extent to which natural candidates for soft evaluation such as Cross Entropy satisfy such properties. We employ a theoretical framework, supported by a visual approach, by practical examples, and by the analysis of a real case scenario. Our results indicate that Cross Entropy can result in fairly paradoxical results in some cases, whereas other measures Manhattan distance and Euclidean distance exhibit a more intuitive behavior, at least for the case of binary classification.
- Published
- 2024
9. Approximate Reasoning with Order-Sorted Feature Logic
- Author
-
Milanese, G, FERSINI, ELISABETTA, PASI, GABRIELLA, MILANESE, GIAN CARLO, Milanese, G, FERSINI, ELISABETTA, PASI, GABRIELLA, and MILANESE, GIAN CARLO
- Abstract
La logica Order-Sorted Feature (OSF) è un linguaggio di rappresentazione della conoscenza nato dal lavoro di ricerca di Hassan Aït-Kaci su un sistema logico per modellare le nozioni di sussunzione e unificazione nei formalismi basati sull'ereditarietà. La logica OSF è stata applicata alla linguistica computazionale e implementata in linguaggi di constraint logic programming e ragionatori automatici. Il linguaggio della logica OSF si basa su funzioni (attributi, o features), e su tipi (concetti, o sorts) ordinati in una relazione di sussunzione. In questa logica il ragionamento è fondato sull'unificazione di strutture chiamate termini OSF, un processo che mira a combinare i vincoli espressi da due termini OSF in un unico termine. Un vantaggio della logica OSF è che il suo algoritmo di unificazione tiene conto dell'ordinamento di sussunzione tra i tipi, che può consentire a una singola unificazione di sostituire diversi passi di inferenza, portando a calcoli più efficienti. Questa tesi si occupa dello sviluppo teorico del ragionamento approssimato con la logica OSF. Il primo contributo della tesi è la definizione della logica OSF fuzzy, una generalizzazione fuzzy della semantica della logica OSF in cui i tipi denotano insiemi fuzzy, consentendo di rappresentare concetti vaghi. Inoltre, i tipi della logica OSF fuzzy sono ordinati in una relazione di sussunzione fuzzy, fornendo maggiore flessibilità nella modellazione. La relazione di sussunzione fuzzy viene dotata di una semantica che generalizza la definizione di inclusione degli insiemi fuzzy di Zadeh. In questo lavoro indaghiamo se diverse proprietà semantiche e computazionali della logica OSF siano preservate nel contesto fuzzy. Dimostriamo, per esempio, che i termini OSF sono ordinati in un reticolo di sussunzione fuzzy che estende l'ordinamento fuzzy tra i tipi, e dimostriamo che l'unificazione di due termini OSF produce il loro estremo inferiore. Definiamo anche procedure per calcolare il grado di sus, Order-Sorted Feature (OSF) logic is a Knowledge Representation and Reasoning (KRR) language originating in Hassan Aït-Kaci's work on designing a calculus of partially ordered type structures. The language was developed to model the notions of subsumption and unification in inheritance-based KRR formalisms, and it has been applied in computational linguistics and implemented in constraint logic programming languages and automated reasoners. The language of OSF logic is based on function-denoting feature symbols and on set-denoting sort symbols ordered in a subsumption (is-a) lattice. Reasoning with OSF logic relies on the unification of set-denoting structures called OSF terms, a process that aims to combine the constraints expressed by two OSF terms into a single term. An advantage of OSF logic is that its unification algorithm takes into account the subsumption ordering between sorts, which may enable a single unification step to replace several inference steps, leading to more efficient computations. This thesis deals with the theoretical development of approximate reasoning within the framework of OSF logic. The first key contribution of the thesis is the definition of fuzzy OSF logic, a fuzzy generalization of the semantics of OSF logic where sorts denote fuzzy sets rather than crisp sets, allowing to represent vague concepts. Moreover, the sorts of fuzzy OSF logic are ordered in a fuzzy subsumption relation (formally a fuzzy lattice) rather than a crisp one, which provides more modeling flexibility by allowing to represent graded subsumption relations. The fuzzy sort subsumption relation is given a special semantics which generalizes Zadeh's definition of inclusion of fuzzy sets. We investigate whether several semantic and computational properties of crisp OSF logic are preserved in the fuzzy setting. For instance, we show that OSF terms are ordered in a fuzzy subsumption relation which extends the fuzzy ordering between sorts, and we prove that the unificatio
- Published
- 2024
10. Exploring Neural Topic Modeling on a Classical Latin Corpus
- Author
-
Calzolari, Nicoletta, Kan, Min-Yen, Hoste, Veronique, Lenci, Alessandro, Sakti, Sakriani, Xue, Nianwen, Martinelli, Ginevra, Impicciché, Paola, Fersini, Elisabetta, Mambrini, Francesco, Passarotti, Marco Carlo, Mambrini Francesco (ORCID:0000-0003-0834-7562), Passarotti Marco (ORCID:0000-0002-9806-7187), Calzolari, Nicoletta, Kan, Min-Yen, Hoste, Veronique, Lenci, Alessandro, Sakti, Sakriani, Xue, Nianwen, Martinelli, Ginevra, Impicciché, Paola, Fersini, Elisabetta, Mambrini, Francesco, Passarotti, Marco Carlo, Mambrini Francesco (ORCID:0000-0003-0834-7562), and Passarotti Marco (ORCID:0000-0002-9806-7187)
- Abstract
The large availability of processable textual resources for Classical Latin has made it possible to study Latin literature through methods and tools that support distant reading. This paper describes a number of experiments carried out to test the possibility of investigating the thematic distribution of the Classical Latin corpus Opera Latina by means of topic modeling. For this purpose, we train, optimize and compare two neural models, Product-of-Experts LDA (ProdLDA) and Embedded Topic Model (ETM), opportunely revised to deal with the textual data from a Classical Latin corpus, to evaluate which one performs better both on the basis of topic diversity and topic coherence metrics, and from a human judgment point of view. Our results show that the topics extracted by neural models are coherent and interpretable and that they are significant from the perspective of a Latin scholar. The source code of the proposed model is available at https://github.com/MIND-Lab/LatinProdLDA.
- Published
- 2024
11. CAGE: Constrained deep Attributed Graph Embedding
- Author
-
Nozza, Debora, Fersini, Elisabetta, and Messina, Enza
- Published
- 2020
- Full Text
- View/download PDF
12. Constrained Relational Topic Models
- Author
-
Terragni, Silvia, Fersini, Elisabetta, and Messina, Enza
- Published
- 2020
- Full Text
- View/download PDF
13. Named entity recognition using conditional random fields with non-local relational constraints
- Author
-
Cecchini, Flavio Massimiliano and Fersini, Elisabetta
- Subjects
Computer Science - Computation and Language - Abstract
We begin by introducing the Computer Science branch of Natural Language Processing, then narrowing the attention on its subbranch of Information Extraction and particularly on Named Entity Recognition, discussing briefly its main methodological approaches. It follows an introduction to state-of-the-art Conditional Random Fields under the form of linear chains. Subsequently, the idea of constrained inference as a way to model long-distance relationships in a text is presented, based on an Integer Linear Programming representation of the problem. Adding such relationships to the problem as automatically inferred logical formulas, translatable into linear conditions, we propose to solve the resulting more complex problem with the aid of Lagrangian relaxation, of which some technical details are explained. Lastly, we give some experimental results.
- Published
- 2013
14. Ensemble learning on visual and textual data for social image emotion classification
- Author
-
Corchs, Silvia, Fersini, Elisabetta, and Gasparini, Francesca
- Published
- 2019
- Full Text
- View/download PDF
15. Proceedings of the Eighth Italian Conference on Computational Linguistics CliC-it 2021
- Author
-
Achena, Flavia, Aggarwal, Akshay, Albanesi, Davide, Albertin, Giorgia, Alzetta, Chiara, Anselma, Luca, Aparaschivei, Lavinia, Archetti, Francesco, Barattieri di San Pietro, Chiara, Barrón-Cedeño, Alberto, Basile, Pierpaolo, Basile, Valerio, Basili, Roberto, Bellandi, Andrea, Bosco, Cristina, Bosque-Gil, Julia, Brunato, Dominique, Brutti, Alessio, Budeanu, Ancuta, Cabrio, Elena, Canazza, Sergio, Candelieri, Antonio, Caselli, Tommaso, Casola, Silvia, Cassotti, Pierluigi, Cattoni, Roldano, Cecchini, Flavio Massimiliano, Celli, Fabio, Cerulli, Aldo, Cettolo, Mauro, Chesi, Cristiano, Cignarella, Alessandra Teresa, Cimiano, Philipp, Cimino, Andrea, Colella, Annalisa, Croce, Danilo, Cutugno, Francesco, Dall’Acqua, Anna, Dangelo, Paolo, Dei Rossi, Stefano, Dell’Orletta, Felice, Demartini, Silvia, De Gemmis, Marco, De Mattei, Lorenzo, Dini, Irene, Di Bratto, Martina, di Buono, Maria Pia, Di Caro, Luigi, Di Liello, Luca, Di Maro, Maria, Dufaux, Alain, Duzha, Armend, Elahi, Mohammad Fazleh, Ell, Basil, Ernandes, Marco, Fabrizi, Samuel, Fallucchi, Francesca, Fantoli, Margherita, Fernández, Raquel, Ferrari, Pier Luigi, Ferrero, Dario, Ferri, Fabio, Fersini, Elisabetta, Florio, Komal, Flouris, Giorgos, Frenda, Simona, Frey, Jennifer-Carmen, Gattero, Valerio, Gemelli, Sara, Gemmis, Marco de, Giovannetti, Emiliano, Gismondi, Biancamaria, Grasso, Francesca, Grotto, Francesco, Iacono, Maria, Iovine, Andrea, Jansen, Lennert, Jezek, Elisabetta, Kaplan, Frédéric, Labruna, Tiziano, Lai, Mirko, Lavelli, Alberto, Lenci, Alessandro, Litta, Eleonora, Lombardi, Agnese, Lops, Pasquale, Louvan, Samuel, Luzietti, Roberta Bianca, Maffia, Marta, Magnini, Bernardo, Mahed Mousavi, Seyed, Mambrini, Francesco, Marchi, Simone, Marelli, Marco, Marra, Andrea, Martino, Maria De, Marziano, Giuseppe, Masini, Francesca, Mastromattei, Michele, Mattiola, Simone, Mazzei, Alessandro, Messina, Enza, Miaschi, Alessio, Minnema, Gosse, Montemagni, Simonetta, Moretti, Giovanni, Moschitti, Alessandro, Negri, Matteo, Negro, Roberto, Nicolas, Lionel, Nissim, Malvina, Nolano, Gennaro, Ògúnrẹ̀mí, Tolúlọpẹ, Onorati, Dario, Origlia, Antonio, Orrico, Riccardo, Paccosi, Teresa, Palmero Aprosio, Alessio, Papadakos, Panagiotis, Papantoniou, Katerina, Papi, Sara, Papini, Mafalda, Passaro, Lucia C., Passarotti, Marco, Patti, Viviana, Pezzelle, Sandro, Plank, Barbara, Plexousakis, Dimitris, Pretto, Niccolò, Puccinelli, Daniele, Ranaldi, Leonardo, Ravelli, Andrea Amelio, Reverberi, Carlo, Riccardi, Giuseppe, Rocchietti, Guido, Romani, Emma, Rosato, Luca, Rossi, Laura, Roth, Dan, Ruffolo, Paolo, Ruzzetti, Elena Sofia, Sabri, Nazanin, Salaris, Sara, Sanguinetti, Manuela, Schettino, Loredana, Schmalz, Veronica Juliana, Sciolette, Flavia, Semeraro, Giovanni, Siciliani, Lucia, Simi, Maria, Sinclair, Arabella, Sprugnoli, Rachele, Stemle, Egon W., Stranisci, Marco Antonio, Tamburini, Fabio, Terragni, Silvia, Tesei, Andrea, Tonelli, Sara, Tripodi, Rocco, Turchi, Marco, Uryupina, Olga, van der Goot, Margot J., Ventura, Viviana, Venturi, Giulia, Villata, Serena, Vorakitphan, Vorakit, Zanchi, Chiara, Zanzotto, Fabio Massimo, Zeman, Daniel, Zugarini, Andrea, Fersini, Elisabetta, Passarotti, Marco, and Patti, Viviana
- Subjects
Computational Linguistics ,linguistica ,linguistique computationelle ,Linguistics ,LAN009000 ,CF ,linguistica computazionale ,linguistique ,Language - Abstract
The eighth edition of the Italian Conference on Computational Linguistics (CLiC-it 2021) was held at Università degli Studi di Milano-Bicocca from 26th to 28th January 2022. After the edition of 2020, which was held in fully virtual mode due to the health emergency related to Covid-19, CLiC-it 2021 represented the first moment for the Italian research community of Computational Linguistics to meet in person after more than one year of full/partial lockdown.
- Published
- 2022
16. Safe Optimal Control of Dynamic Systems: Learning from Experts and Safely Exploring New Policies.
- Author
-
Candelieri, Antonio, Ponti, Andrea, Fersini, Elisabetta, Messina, Enza, and Archetti, Francesco
- Subjects
DYNAMICAL systems ,KRIGING ,GAUSSIAN processes ,REINFORCEMENT learning ,DIGITAL twins ,STATISTICAL learning ,SYSTEM dynamics - Abstract
Many real-life systems are usually controlled through policies replicating experts' knowledge, typically favouring "safety" at the expense of optimality. Indeed, these control policies are usually aimed at avoiding a system's disruptions or deviations from a target behaviour, leading to suboptimal performances. This paper proposes a statistical learning approach to exploit the historical safe experience—collected through the application of a safe control policy based on experts' knowledge— to "safely explore" new and more efficient policies. The basic idea is that performances can be improved by facing a reasonable and quantifiable risk in terms of safety. The proposed approach relies on Gaussian Process regression to obtain a probabilistic model of both a system's dynamics and performances, depending on the historical safe experience. The new policy consists of solving a constrained optimization problem, with two Gaussian Processes modelling, respectively, the safety constraints and the performance metric (i.e., objective function). As a probabilistic model, Gaussian Process regression provides an estimate of the target variable and the associated uncertainty; this property is crucial for dealing with uncertainty while new policies are safely explored. Another important benefit is that the proposed approach does not require any implementation of an expensive digital twin of the original system. Results on two real-life systems are presented, empirically proving the ability of the approach to improve performances with respect to the initial safe policy without significantly affecting safety. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
17. Deep Learning Representations in Automatic Misogyny Identification: What Do We Gain and What Do We Miss?
- Author
-
Fersini, Elisabetta, Rosato, Luca, Candelieri, Antonio, Archetti, Francesco, and Messina, Enza
- Subjects
Computational Linguistics ,linguistica ,linguistique computationelle ,Linguistics ,linguistica computazionale ,linguistique ,Language - Abstract
In this paper, we address the problem of automatic misogyny identification focusing on understanding the representation capabilities of widely adopted embeddings and addressing the problem of unintended bias. The proposed framework, grounded on Sentence Embeddings and Multi-Objective Bayesian Optimization, has been validated on an Italian dataset. We highlight capabilities and weaknesses related to the use of pre-trained language, as well as the contribution of Bayesian Optimization for mitigating the problem of biased predictions.
- Published
- 2022
18. Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content
- Author
-
Gasparini, Francesca, primary, Rizzi, Giulia, additional, Saibene, Aurora, additional, and Fersini, Elisabetta, additional
- Published
- 2022
- Full Text
- View/download PDF
19. Modeling Relational and Contextual Information into Topic Models and their Evaluation
- Author
-
MESSINA, VINCENZINA, Terragni, S, VIZZARI, GIUSEPPE, FERSINI, ELISABETTA, TERRAGNI, SILVIA, MESSINA, VINCENZINA, Terragni, S, VIZZARI, GIUSEPPE, FERSINI, ELISABETTA, and TERRAGNI, SILVIA
- Abstract
La conoscenza testuale è uno dei pilastri principali della nostra società. Infatti, la conoscenza umana è spesso trasmessa attraverso le parole. Dall'invenzione della scrittura, gli uomini hanno raccontato e descritto la loro esistenza con parole su pezzi di carta. Questa quantità di conoscenza si accumula fino a ciò che l'intera civiltà ha raccolto in più di 5000 anni. Gli storici e gli scienziati sociali e politici cercano modi per capire meglio questa vasta quantità di conoscenza collettiva che non può essere esplorata manualmente. A questo scopo, i ricercatori di machine learning, statistica e linguistica computazionale hanno sviluppato i topic model, una suite di algoritmi che mirano ad annotare grandi archivi di documenti con informazioni tematiche. La popolarità di questi modelli è dovuta al fatto che sono non supervisionati e che sono interpretabili. I topic model analizzano e riassumono i temi principali, anche detti topic, di grandi collezioni di documenti, presentando le informazioni in una forma compatta e comprensibile. La maggior parte dei topic model si concentrano solo sulle parole codificate nei documenti. Tuttavia, informazioni aggiuntive possono essere introdotte nei topic model per migliorare le loro prestazioni. Infatti, in molti casi del mondo reale, raramente abbiamo solo i semplici testi da analizzare. Invece, abbiamo informazioni aggiuntive o metadati relativi ai documenti, ad esempio, l'autore del documento, la data, collegamenti ipertestuali ad altri documenti, un insieme di hashtag, menzioni o etichette. Possiamo usare queste informazioni precedenti per aiutare un topic model a scoprire topic di qualità. Per esempio, sapere che un documento cita un altro documento aumenta la nostra fiducia che i documenti parlano degli stessi argomenti.Inoltre, i topic model spesso ignorano l'ordine delle parole e le informazioni contestuali, rendendo difficile dedurre gli topic coerenti e significativi. Un altro problema nel campo è legato a, Textual knowledge is one of the main pillars of our society. Indeed, human knowledge is often passed along using words. Since the invention of writing, humans have narrated and described their existence with words over pieces of papers. This amount of knowledge builds up to what the entire civilization has collected over more than 5'000 years. Historians and social and political scientists look for ways to understand better this vast amount of collective knowledge that cannot be manually explored. To this end, researchers from machine learning, statistics and computational linguistic have developed topic models, a suite of algorithms that aim to annotate large archives of documents with thematic information. The popularity of these models is due to the fact that they are unsupervised and that they are interpretable. Topic models analyze and summarize the main themes, or topics, of large collections of documents, presenting the information in a compact and understandable form. Most topic models focus only on the words encoded in the documents. However, additional information can be introduced into topic models to improve their performance. In fact, in many real-world cases, we seldom have only the mere texts to analyze. Instead, we have additional information or metadata related to the documents, e.g., the document's author, the date, hyperlinks to other documents, a set of hashtags, mentions or labels. We can use this prior information to help a topic model discover better topics. For example, knowing that a document cites another document increases our confidence that the documents talk about the same topics. Also, topic models often ignore word order and contextual information, making it difficult to infer high-quality topics. Another problem in the field is related to the hyperparameters used to train the topics models. The hyperparameters are often fixed in experimental settings and few researchers have tried to study their impact on the results. In this thesis
- Published
- 2022
20. Legal retrieval as support to eMediation: matching disputant’s case and court decisions
- Author
-
El Jelali, Soufiane, Fersini, Elisabetta, and Messina, Enza
- Published
- 2015
- Full Text
- View/download PDF
21. Overview of PAN 2022 : authorship verification, profiling irony and stereotype spreaders, style change detection, and trigger detection
- Author
-
Bevendorff, Janek, Chulvi, Berta, Fersini, Elisabetta, Heini, Annina, Kestemont, Mike, Kredens, Krzysztof, Mayerl, Maximilian, Ortega-Bueno, Reyner, Pezik, Piotr, Potthast, Martin, Rangel, Francisco, Rosso, Paolo, Stamatatos, Efstathios, Stein, Benno, Wiegmann, Matti, Wolska, Magdalena, and Zangerle, Eva
- Subjects
Computer. Automation - Abstract
The paper gives a brief overview of the four shared tasks to be organized at the PAN 2022 lab on digital text forensics and stylometry hosted at the CLEF 2022 conference. The tasks include authorship verification across discourse types, multi-author writing style analysis, author profiling, and content profiling. Some of the tasks continue and advance past editions (authorship verification and multi-author analysis) and some are new (profiling irony and stereotypes spreaders and trigger detection). The general goal of the PAN shared tasks is to advance the state of the art in text forensics and stylometry while ensuring objective evaluation on newly developed benchmark datasets.
- Published
- 2022
22. Profiling hate speech spreaders on twitter task at PAN 2021
- Author
-
Rangel, Francisco, Peña-Sarracén, Gretel Liz de la, Chulvi-Ferriols, María Alberta, Fersini, Elisabetta, and Rosso, Paolo
- Subjects
Hate speech spreaders ,Artificial intelligence ,Natural language processing ,Author profiling ,Hate speech ,LENGUAJES Y SISTEMAS INFORMATICOS - Abstract
[EN] This overview presents the Author Profiling shared task at PAN 2021. The focus of this year¿s task is on determining whether or not the author of a Twitter feed is keen to spread hate speech. The main aim is to show the feasibility of automatically identifying potential hate speech spreaders on Twitter. For this purpose a corpus with Twitter data has been provided, covering the English and Spanish languages. Altogether, the approaches of 66 participants have been evaluated., First of all, we thank the participants: again 66 this year, as the previous year on Profiling Fake News Spreaders! We have to thank also Martin Potthast, Matti Wiegmann, Nikolay Kolyada, and Magdalena Anna Wolska for their technical support with the TIRA platform. We thank Symanto for sponsoring again the award for the best performing system at the author profiling shared task. The work of Francisco Rangel was partially funded by the Centre for the Development of Industrial Technology (CDTI) of the Spanish Ministry of Science and Innovation under the research project IDI-20210776 on Proactive Profiling of Hate Speech Spreaders - PROHATER (Perfilador Proactivo de Difusores de Mensajes de Odio). The work of the researchers from Universitat Politècnica de València was partially funded by the Spanish MICINN under the project MISMIS-FAKEnHATE on MISinformation and MIScommunication in social media: FAKE news and HATE speech (PGC2018-096212-B-C31), and by the Generalitat Valenciana under the project DeepPattern (PROMETEO/2019/121). This article is also based upon work from the Dig-ForAsp COST Action 17124 on Digital Forensics: evidence analysis via intelligent systems and practices, supported by European Cooperation in Science and Technology.
- Published
- 2021
23. Named Entity Recognition: Resource Constrained Maximum Path
- Author
-
Di Puglia Pugliese Luigi, Fersini Elisabetta, Guerriero Francesca, and Messina Enza
- Subjects
Information technology ,T58.5-58.64 - Abstract
Information Extraction (IE) is a process focused on automatic extraction of structured information from unstructured text sources. One open research field of IE relates to Named Entity Recognition (NER), aimed at identifying and associating atomic elements in a given text to a predefined category such as names of persons, organizations, locations and so on. This problem can be formalized as the assignment of a finite sequence of semantic labels to a set of interdependent variables associated with text fragments, and can modelled through a stochastic process involving both hidden variables (semantic labels) and observed variables (textual cues). In this work we investigate one of the most promising model for NER based on Conditional Random Fields (CRFs). CRFs are enhanced in a two stages approach to include in the decision process logic rules that can be either extracted from data or defined by domain experts. The problem is defined as a Resource Constrained Maximum Path Problem (RCMPP) associating a resource with each logic rule. Proper resource Extension Functions (REFs) and upper bound on the resource consumptions are defined in order to model the logic rules as knapsack-like constraints. A well-tailored dynamic programming procedure is defined to address the RCMPP.
- Published
- 2017
- Full Text
- View/download PDF
24. EVALITA Evaluation of NLP and Speech Tools for Italian - December 17th, 2020
- Author
-
Agerri, Rodrigo, Aliprandi, Carlo, Alkhalifa, Rabab, Alzetta, Chiara, Angel, Jason, Anselmi, Guido, Appiah Balaji, Nitin Nikamanth, Aroyehun, Segun Taofeek, Artigas Herold, Maria Fernanda, Attanasio, Giuseppe, Attardi, Giuseppe, Badryzlova, Yulia, Bai, Yang, Baldissin, Gioia, Ballarè, Silvia, Barrón-Cedeño, Alberto, Bartle, Anna-Sophie, Basile, Pierpaolo, Basile, Valerio, Basili, Roberto, Belotti, Federico, Bennici, Mauro, Bharathi, B., Bhuvana, J., Bianchi, Federico, Bisconti, Elia, Bolanos, Luis, Bondielli, Alessandro, Bosco, Cristina, Breazzano, Claudia, Brivio, Matteo, Brunato, Dominique, Cafagna, Michele, Caputo, Annalina, Caselli, Tommaso, Cassotti, Pierluigi, Castañeda, Enrique, Castro Castro, Daniel, Centeno, Roberto, Cercel, Dumitru-Clementin, Cerruti, Massimo, Chandrabose, Aravindan, Chesi, Cristiano, Chiarello, Filippo, Cignarella, Alessandra Teresa, Cimino, Andrea, Comandini, Gloria, Croce, Danilo, Dai, Hongbing, Dascalu, Mihai, Dell’Orletta, Felice, Delmonte, Rodolfo, Deng, Tao, De Francesco, Nazareno, De Martino, Graziella, De Mattei, Lorenzo, Di Buccio, Emanuele, Di Maro, Maria, di Nuovo, Elisa, Di Rosa, Emanuele, dos S.R. da Silva, Adriano, Durante, Alberto, El Abassi, Samer, Espinosa, María S., Fabrizi, Samuel, Fantoni, Gualtiero, Ferilli, Stefano, Ferraccioli, Federico, Fersini, Elisabetta, Finos, Livio, Fiorucci, Stefano, Fontana, Michele, Frenda, Simona, Gambino, Giuseppe, Gatt, Albert, Gelbukh, Alexander, Giorgi, Giulia, Giorgioni, Simone, Girardi, Paolo, Goria, Eugenio, Gregori, Lorenzo, Hoffmann, Julia, Iacono, Maria, Iovine, Andrea, Izzi, Giovanni Luca, Jimenez, Sergio, Kaiser, Jens, Kayalvizhi, S., Kivlichan, Ian, Klaus, Svea, Koceva, Frosina, Kovács, György, Kruschwitz, Udo, Labadie Tamayo, Roberto, Lai, Mirko, Laicher, Severin, Lapesa, Gabriella, Lavergne, Eric, Lebani, Gianluca E., Lees, Alyssa, Lenci, Alessandro, Leonardelli, Elisa, Li, Hongling, Liakata, Maria, Lovetere, Marco, Madonna, Domenico, Massidda, Riccardo, Mattei, Lorenzo De, Mauri, Caterina, Mele, Francesco, Melucci, Massimo, Menini, Stefano, Miaschi, Alessio, Miliani, Martina, Moggio, Alessio, Montagnani, Matteo, Montefinese, Maria, Montemagni, Simonetta, Monti, Johanna, Moraca, Maurizio, Moretti, Giovanni, Morra, Simone, Murphy, Killian, Muti, Arianna, Nakov, Preslav, Nisioi, Sergiu, Nissim, Malvina, Nozza, Debora, Occhipinti, Daniela, Ortega Bueno, Reynier, Ou, Xiaozhi, Palmonari, Matteo, Parizzi, Andrea, Pascucci, Antonio, Passaro, Lucia C., Pastor, Eliana, Patti, Viviana, Pirrone, Roberto, Polignano, Marco, Politi, Marcello, Pont, Mattia Da, Pražák, Ondřej, Přibáň, Pavel, Proisl, Thomas, Puccetti, Giovanni, Radicioni, Daniele P., Rama, Ilir, Rambelli, Giulia, Ravelli, Andrea Amelio, Rodrigo, Alvaro, Rodriguez-Diaz, Carlos A., Rodriguez Cisnero, Mariano Jason, Roman, Norton T., Roman, Norton Trevisan, Rossmann, Daniela, Rosso, Paolo, Rotaru, Armand Stefan, Rubino, Edoardo, Russo, Irene, Sabella, Gianluca, Saini, Rajkumar, Salman, Samir, Sangati, Federico, Sanguinetti, Manuela, Sarti, Gabriele, Schlechtweg, Dominik, Schulte im Walde, Sabine, Sciandra, Andrea, Setpal, Jinen, Siciliani, Lucia, Solari, Dario, Sorensen, Jeffrey, Sorgente, Antonio, Sprugnoli, Rachele, Stranisci, Marco, Tamburini, Fabio, Taylor, Stephen, Tesei, Andrea, Thenmozhi, D., Tonelli, Sara, Torre, Ilaria, Tsakalidis, Adam, Varvara, Rossella, Venturi, Giulia, Vettigli, Giuseppe, Vlad, George-Alexandru, Wang, Benyou, Zaharia, George-Eduard, Zamparelli, Roberto, Zubiaga, Arkaitz, Basile, Valerio, Croce, Danilo, Maro, Maria, and Passaro, Lucia C.
- Subjects
Language & Linguistics ,Automatic Misogyny Identification ,AlBERTo ,BERT Model ,Language Game ``La Ghigliottina'' ,MEME management ,EVALITA ,Convolutional Neural Network ,LAN000000 ,Hate Speech Detection ,CBX ,linguistica computazionale ,Misogyny on Twitter Posts ,COVID-19 Infodemic ,Multimodal Meme Detection - Abstract
Welcome to EVALITA 2020! EVALITA is the evaluation campaign of Natural Language Processing and Speech Tools for Italian. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC, http://www.ai-lc.it) and it is endorsed by the Italian Association for Artificial Intelligence (AIxIA, http://www.aixia.it) and the Italian Association for Speech Sciences (AISV, http://www.aisv.it).
- Published
- 2021
25. Frame Semantics for Social NLP in Italian: Analyzing Responsibility Framing in Femicide News Reports
- Author
-
Minnema, Gosse, Gemelli, Sara, Zanchi, Chiara, Patti, Viviana, Caselli, Tommaso, Nissim, Malvina, Fersini, Elisabetta, Passarotti, Marco, and Computational Linguistics (CL)
- Subjects
computational linguistics ,perspectives ,frame semantics ,NLP - Abstract
We propose using a FrameNet-based approach for analyzing how socially relevant events are framed in media discourses. Taking femicides as an example, we perform a preliminary investigation on a large dataset of news reports and event data covering recent femicides in Italy. First, we revisit the EVALITA 2011 shared task on Italian frame labeling, and test a recent multilingual frame semantic parser against this benchmark. Then, we experiment with specializing this model for Italian and perform a human evaluation to test our model’s real-world applicability. We show how FrameNet-based analyses can help to identify linguistic constructions that background the agentivity and responsibility of femicide perpetrators in Italian news.
- Published
- 2021
26. OCTIS 2.0: Optimizing and comparing topic models in Italian is even simpler!
- Author
-
Terragni, Silvia, Fersini, Elisabetta, Fersini, E, Passarotti, M, Patti, V, and Terragni, S
- Subjects
Computational Linguistics ,linguistica ,Performance Evaluation ,linguistique computationelle ,Topic Model ,Linguistics ,linguistica computazionale ,Hyper-parameter ,linguistique ,Language - Abstract
OCTIS is an open-source framework for training, evaluating and comparing Topic Models. This tool uses single-objective Bayesian Optimization (BO) to optimize the hyper-parameters of the models and thus guarantee a fairer comparison. Yet, a single-objective approach disregards that a user may want to simultaneously optimize multiple objectives. We therefore propose OCTIS 2.0: the extension of OCTIS that addresses the problem of estimating the optimal hyper-parameter configurations for a topic model using multi-objective BO. Moreover, we also release and integrate two pre-processed Italian datasets, which can be easily used as benchmarks for the Italian language. OCTIS è un framework open-source per il training, la valutazione e la comparazione di Topic Models. Questo strumento utilizza l’ottimizzazione Bayesiana (BO) a singolo obiettivo per ottimizzare gli iperparametri dei modelli e quindi garantire una comparazione più equa. Tuttavia, questo approccio ignora che un utente potrebbe voler ottimizzare pi‘u di un obiettivo. Proponiamo perciò OCTIS 2.0: l’estensione di OCTIS che affronta il problema della stima delle configurazioni ottimali degli iperparametri di un topic model usando la BO multi-obiettivo. In aggiunta, rilasciamo e integriamo anche due nuovi dataset in italiano pre-processati, che possono essere facilmente utilizzati come benchmark per la lingua italiana.
- Published
- 2021
27. Leveraging Bias in Pre-Trained Word Embeddings for Unsupervised Microaggression Detection
- Author
-
Ògúnremí, Tolúlope', Sabri, Nazanin, Basile, Valerio, Caselli, Tommaso, Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, and Computational Linguistics (CL)
- Subjects
hate speech ,NLP ,micro-aggression - Abstract
Microaggressions are subtle manifestations of bias (Breitfeller et al., 2019). These demonstrations of bias can often be classified as a subset of abusive language. However, not as much focus has been placed on the recognition of these instances. As a result, limited data is available on the topic, and only in English. Being able to detect microaggressions without the need for labeled data would be advantageous since it would allow content moderation also for languages lacking annotated data. In this study, we introduce an unsupervised method to detect microaggressions in natural language expressions. The algorithm relies on pre-trained word-embeddings, leveraging the bias encoded in the model in order to detect microaggressions in unseen textual instances. We test the method on a dataset of racial and gender-based microaggressions, reporting promising results. We further run the algorithm on out-of-domain unseen data with the purpose of bootstrapping corpora of microaggressions “in the wild”, and discuss the benefits and drawbacks of our proposed method.
- Published
- 2021
28. Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021). Milan, Italy, January 26-28, 2022
- Author
-
Passarotti, Marco (ORCID:0000-0002-9806-7187), Fersini, Elisabetta, Passarotti, Marco Carlo, Patti, Viviana, Passarotti, Marco (ORCID:0000-0002-9806-7187), Fersini, Elisabetta, Passarotti, Marco Carlo, and Patti, Viviana
- Abstract
Proceedings of the Eighth Italian Conference on Computational Linguistics (CLiC-it 2021). Milan, Italy, January 26-28, 2022
- Published
- 2021
29. Preface
- Author
-
Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Passarotti, Marco Carlo, Passarotti Marco (ORCID:0000-0002-9806-7187), Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Passarotti, Marco Carlo, and Passarotti Marco (ORCID:0000-0002-9806-7187)
- Abstract
Preface of the volume
- Published
- 2021
30. Sentiment Analysis of Latin Poetry: First Experiments on the Odes of Horace
- Author
-
Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Sprugnoli, Rachele, Mambrini, Francesco, Passarotti, Marco Carlo, Moretti, Giovanni, Sprugnoli Rachele (ORCID:0000-0001-6861-5595), Mambrini Francesco (ORCID:0000-0003-0834-7562), Passarotti Marco (ORCID:0000-0002-9806-7187), Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Sprugnoli, Rachele, Mambrini, Francesco, Passarotti, Marco Carlo, Moretti, Giovanni, Sprugnoli Rachele (ORCID:0000-0001-6861-5595), Mambrini Francesco (ORCID:0000-0003-0834-7562), and Passarotti Marco (ORCID:0000-0002-9806-7187)
- Abstract
In this paper we present a set of annotated data and the results of a number of unsupervised experiments for the analysis of sentiment in Latin poetry. More specifically, we describe a small gold standard made of eight poems by Horace, in which each sentence is labeled manually for the sentiment using a four-value classification (positive, negative, neutral and mixed). Then, we report on how this gold standard has been used to evaluate two automatic approaches for sentiment classification: one is lexicon-based and the other adopts a zero-shot transfer approach.
- Published
- 2021
31. The Annotation of Liber Abbaci, a Domain-Specific Latin Resource
- Author
-
Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Grotto, Francesco, Sprugnoli, Rachele, Fantoli, Margherita, Simi, Maria, Cecchini, Flavio Massimiliano, Passarotti, Marco Carlo, Rachele Sprugnoli (ORCID:0000-0001-6861-5595), Flavio Massimiliano Cecchini (ORCID:0000-0001-9029-1822), Marco Passarotti (ORCID:0000-0002-9806-7187), Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Grotto, Francesco, Sprugnoli, Rachele, Fantoli, Margherita, Simi, Maria, Cecchini, Flavio Massimiliano, Passarotti, Marco Carlo, Rachele Sprugnoli (ORCID:0000-0001-6861-5595), Flavio Massimiliano Cecchini (ORCID:0000-0001-9029-1822), and Marco Passarotti (ORCID:0000-0002-9806-7187)
- Abstract
The Liber Abbaci (13th century) is a milestone in the history of mathematics and accounting. Due to the late stage of Latin, its features and its very specialized content, it also represents a unique resource for scholars working on Latin corpora. In this paper we present the annotation and linking work carried out in the frame of the project Fibonacci 1202-2021. A gold-standard lemmatization and part-ofspeech tagging allow us to elaborate some first observations on the linguistic and historical features of the text, and to link the text to the Lila Knowledge Base, that has as its goal to make distributed linguistic resources for Latin interoperable by following the principles of the Linked Data paradigm. Starting from this specific case, we discuss the importance of annotating and linking scientific and technical texts, in order to (a) compare and search them together with other (non-technical) Latin texts (b) train, apply and evaluate NLP resources on a non-standard variety of Latin. The paper also describes the fruitful interaction and coordination between NLP experts and traditional Latin scholars on a project requiring a large range of expertise.
- Published
- 2021
32. Linking the Lewis & Short Dictionary to the LiLa Knowledge Base of Interoperable Linguistic Resources for Latin
- Author
-
Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Mambrini, Francesco, Litta Modignani Picozzi, Eleonora Maria Gabriella, Passarotti, Marco Carlo, Ruffolo, Paolo, Mambrini Francesco (ORCID:0000-0003-0834-7562), Litta Eleonora (ORCID:0000-0002-0499-997X), Passarotti Marco (ORCID:0000-0002-9806-7187), Fersini, Elisabetta, Passarotti, Marco, Patti, Viviana, Mambrini, Francesco, Litta Modignani Picozzi, Eleonora Maria Gabriella, Passarotti, Marco Carlo, Ruffolo, Paolo, Mambrini Francesco (ORCID:0000-0003-0834-7562), Litta Eleonora (ORCID:0000-0002-0499-997X), and Passarotti Marco (ORCID:0000-0002-9806-7187)
- Abstract
This paper describes the steps taken to include data from the Lewis & Short bilingual Latin-English dictionary into the Knowledge Base of linguistic resources for Latin LiLa. First, data were extracted from the original XML and matched with entries in LiLa, overcoming ambiguities and structural inconsistencies in the source. Subsequently, senses were modelled using the Ontolex Lemon Lexicographic module (lexicog), so that they could be included in the LiLa Knowledge Base and thus made interoperable with the (meta)data of the linguistic resources for Latin therein interlinked.
- Published
- 2021
33. Constrained Relational Topic Models
- Author
-
Terragni, S, Fersini, E, Messina, E, Terragni, Silvia, Fersini, Elisabetta, Messina, Enza, Terragni, S, Fersini, E, Messina, E, Terragni, Silvia, Fersini, Elisabetta, and Messina, Enza
- Abstract
Relational topic models (RTM) have been widely used to discover hidden topics in a collection of networked documents. In this paper, we introduce the class of Constrained Relational Topic Models (CRTM), a semi-supervised extension of RTM that, apart from modeling the structure of the document network, explicitly models some available domain knowledge. We propose two instances of CRTM that incorporate prior knowledge in the form of document constraints. The models smooth the probability distribution of topics such that two constrained documents can either share the same topics or denote distinct themes. Experimental results on benchmark relational datasets show significant performances of CRTM on a semi-supervised document classification task.
- Published
- 2020
34. CAGE: Constrained deep Attributed Graph Embedding
- Author
-
Nozza, D, Fersini, E, Messina, E, Nozza, Debora, Fersini, Elisabetta, Messina, Enza, Nozza, D, Fersini, E, Messina, E, Nozza, Debora, Fersini, Elisabetta, and Messina, Enza
- Abstract
In this paper we deal with complex attributed graphs which can exhibit rich connectivity patterns and whose nodes are often associated with attributes, such as text or images. In order to analyze these graphs, the primary challenge is to find an effective way to represent them by preserving both structural properties and node attribute information. To create low-dimensional and meaningful embedded representations of these complex graphs, we propose a fully unsupervised model based on Deep Learning architectures, called Constrained Attributed Graph Embedding model (CAGE). The main contribution of the proposed model is the definition of a novel two-phase optimization problem that explicitly models node attributes to obtain a higher representation expressiveness while preserving the local and the global structural properties of the graph. We validated our approach on two different benchmark datasets for node classification. Experimental results demonstrate that this novel representation provides significant improvements compared to state of the art approaches, also showing higher robustness with respect to the size of the training data.
- Published
- 2020
35. EVALITA Evaluation of NLP and Speech Tools for Italian
- Author
-
Ahluwalia, Resham, Anderson, Jacob, Arslan, Pinar, Bai, Xiaoyu, Bakarov, Amir, Balaraman, Vevake, Barbieri, Francesco, Basile, Angelo, Basile, Pierpaolo, Basile, Valerio, Basili, Roberto, Bennici, Mauro, Bianchini, Giulio, Biondi, Giulio, Bonavita, Ilaria, Bosco, Cristina, Buscaldi, Davide, Cabrio, Elena, Callow, Edward, Cardiff, John, Caselli, Tommaso, Chiusaroli, Francesca, Cignarella, Alessandra Teresa, Cimino, Andrea, Cock, Martine De, Coman, Andrei Catalin, Corazza, Michele, Croce, Danilo, Cutugno, Francesco, Dell’Orletta, Felice, Delmonte, Rodolfo, De la Peña Sarracén, Gretel Liz, Di Bari, Gabriele, Di Maro, Maria, Di Rosa, Emanuele, Durante, Alberto, Dwyer, Gareth, Falcone, Sara, Ferri, Lorenzo, Fersini, Elisabetta, Fortuna, Paula, Frenda, Simona, Gallicchio, Claudio, Gemmis, Marco de, Ghanem, Bilal, Giorni, Tommaso, Girardi, Daniela, Giudice, Valentino, Guerini, Marco, Guzmán-Falcón, Estefanía, Magnini, Bernardo, Magnolini, Simone, Mattei, Lorenzo De, Medina Pagola, José E., Menini, Stefano, Merenda, Flavio, Micheli, Alessio, Milani, Alfredo, Mohammad, Saif M., Montes-y-Gómez, Manuel, Monti, Johanna, Muñiz Cuza, Carlos Enrique, Nascimento, Anderson, Nechaev, Yaroslav, Nicola, Giancarlo, Nissim, Malvina, Novielli, Nicole, Nozza, Debora, Nunes, Sérgio, Origlia, Antonio, Ortega-Bueno, Reynier, Pamungkas, Endang Wahyu, Pascucci, Antonio, Patti, Viviana, Poletto, Fabio, Polignano, Marco, Pons, Reynaldo Gil, Ronzano, Francesco, Rosso, Paolo, Rubagotti, Chiara, Sangati, Federico, Sanguinetti, Manuela, Santilli, Andrea, Santucci, Valentino, Sarli, Daniele Di, Seijas Portocarrero, Xileny, Semeraro, Giovanni, Shushkevich, Elena, Siciliani, Lucia, Soni, Himani, Spina, Stefania, Sprugnoli, Rachele, Squadrone, Luca, Tesconi, Maurizio, Tonelli, Sara, Villaseñor-Pineda, Luis, Villata, Serena, Zaghi, Claudia, Zara, Giacomo, Caselli, Tommaso, Novielli, Nicole, Patti, Viviana, and Rosso, Paolo
- Subjects
elaborazione del linguaggio naturale ,Language & Linguistics ,strumenti del linguaggio ,outils du langage ,Speech Tools ,LAN009000 ,CF ,traitement du langage naturel ,Natural Language Processing - Abstract
EVALITA is a periodic evaluation campaign of Natural Language Processing (NLP) and speech tools for the Italian language. The general objective of EVALITA is to promote the development of language and speech technologies for the Italian language, providing a shared framework where different systems and approaches can be evaluated in a consistent manner. The diffusion of shared tasks and shared evaluation practices is a crucial step towards the development of resources and tools for NLP and speech sciences. The good response obtained by EVALITA, both in the number of participants and in the quality of results, showed that it is worth pursuing such goals for the Italian language. As a side effect of the evaluation campaign, both training and test data are available to the scientific community as benchmarks for future improvements. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC) and it is endorsed by the Italian Association for Artificial Intelligence (AI*IA) and the Italian Association for Speech Sciences (AISV).
- Published
- 2019
36. Editorial to Computers & operations research
- Author
-
Messina, Enza, Fersini, Elisabetta, Vigo, Daniele, Guerriero, Francesca, Messina, Enza, Fersini, Elisabetta, Vigo, Daniele, and Guerriero, Francesca
- Published
- 2019
- Full Text
- View/download PDF
37. Sentiment Analysis in Social Networks
- Author
-
Fersini, Elisabetta, Fersini, Elisabetta, Messina, Enza, Pozzi, Federico Alberto, Fersini, Elisabetta, Fersini, Elisabetta, Messina, Enza, and Pozzi, Federico Alberto
- Abstract
The aim of Sentiment Analysis is to define automatic tools able to extract subjective information from texts in natural language, such as opinions and sentiments, in order to create structured and actionable knowledge to be used by either a decision support system or a decision maker. Sentiment analysis has gained even more value with the advent and growth of social networking. Sentiment Analysis in Social Networks begins with an overview of the latest research trends in the field. It then discusses the sociological and psychological processes underling social network interactions. The book explores both semantic and machine learning models and methods that address context-dependent and dynamic text in online social networks, showing how social network streams pose numerous challenges due to their large-scale, short, noisy, context- dependent and dynamic nature. Further, this volume: Takes an interdisciplinary approach from a number of computing domains, including natural language processing, machine learning, big data, and statistical methodologiesProvides insights into opinion spamming, reasoning, and social network analysisShows how to apply sentiment analysis tools for a particular application and domain, and how to get the best results for understanding the consequencesServes as a one-stop reference for the state-of-the-art in social media analyticsTakes an interdisciplinary approach from a number of computing domains, including natural language processing, big data, and statistical methodologiesProvides insights into opinion spamming, reasoning, and social network miningShows how to apply opinion mining tools for a particular application and domain, and how to get the best results for understanding the consequencesServes as a one-stop reference for the state-of-the-art in social media analytics
- Published
- 2016
38. Computers & operations research
- Author
-
Messina, Enza, primary, Fersini, Elisabetta, additional, Vigo, Daniele, additional, and Guerriero, Francesca, additional
- Published
- 2019
- Full Text
- View/download PDF
39. Deep Learning for Feature Representation in Natural Language Processing
- Author
-
FERSINI, ELISABETTA, Nozza, D, MESSINA, VINCENZINA, VIZZARI, GIUSEPPE, NOZZA, DEBORA, FERSINI, ELISABETTA, Nozza, D, MESSINA, VINCENZINA, VIZZARI, GIUSEPPE, and NOZZA, DEBORA
- Abstract
La mole di dati generata dagli utenti sul Web è esponenzialmente cresciuta negli ultimi dieci anni, creando nuove e rilevanti opportunità per ogni tipo di dominio applicativo. Per risolvere i problemi derivanti dall’eccessiva quantità di dati, la ricerca nell’ambito dell’elaborazione del linguaggio naturale si è mossa verso lo sviluppo di modelli computazionali capaci di capirlo ed interpretarlo senza (o quasi) alcun intervento umano. Recentemente, questo campo di studi è stato testimone di un incremento sia in termini di efficienza computazionale che di risultati, per merito dell’avvento di una nuova linea di ricerca nell’apprendimento automatico chiamata Deep Learning. Questa tesi si focalizza in modo particolare su una specifica classe di modelli di Deep Learning atta ad apprendere rappresentazioni di alto livello, e conseguentemente più significative, dei dati di input in ambiente non supervisionato. Nelle tecniche di Deep Learning, queste rappresentazioni sono ottenute tramite multiple trasformazioni non lineari di complessità e astrazione crescente a partire dai dati di input. Questa fase, in cui vengono elaborate le sopracitate rappresentazioni, è un processo cruciale per l’elaborazione del linguaggio naturale in quanto include la procedura di trasformazione da simboli discreti (es. lettere) a una rappresentazione vettoriale che può essere facilmente trattata da un elaboratore. Inoltre, questa rappresentazione deve anche essere in grado di codificare la sintattica e la semantica espressa nel linguaggio utilizzato nei dati. La prima direzione di ricerca di questa tesi mira ad evidenziare come i modelli di elaborazione del linguaggio naturale possano essere potenziati dalle rappresentazioni ottenute con metodi non supervisionati di Deep Learning al fine di conferire un senso agli ingenti contenuti generati dagli utenti. Nello specifico, questa tesi si focalizza su diversi ambiti che sono considerati cruciali per capire di cosa il testo tratti (Named En, The huge amount of textual user-generated content on the Web has incredibly grown in the last decade, creating new relevant opportunities for different real-world applications and domains. To overcome the difficulties of dealing with this large volume of unstructured data, the research field of Natural Language Processing has provided efficient solutions developing computational models able to understand and interpret human natural language without any (or almost any) human intervention. This field has gained in further computational efficiency and performance from the advent of the recent machine learning research lines concerned with Deep Learning. In particular, this thesis focuses on a specific class of Deep Learning models devoted to learning high-level and meaningful representations of input data in unsupervised settings, by computing multiple non-linear transformations of increasing complexity and abstraction. Indeed, learning expressive representations from the data is a crucial step in Natural Language Processing, because it involves the transformation from discrete symbols (e.g. characters) to a machine-readable representation as real-valued vectors, which should encode semantic and syntactic meanings of the language units. The first research direction of this thesis is aimed at giving evidence that enhancing Natural Language Processing models with representations obtained by unsupervised Deep Learning models can significantly improve the computational abilities of making sense of large volume of user-generated text. In particular, this thesis addresses tasks that were considered crucial for understanding what the text is talking about, by extracting and disambiguating the named entities (Named Entity Recognition and Linking), and which opinion the user is expressing, dealing also with irony (Sentiment Analysis and Irony Detection). For each task, this thesis proposes a novel Natural Language Processing model enhanced by the data representation obtained by
- Published
- 2018
40. A comparison of graph-based word sense induction clustering algorithms in a pseudoword evaluation framework
- Author
-
Cecchini, F, Riedl, M, Fersini, E, Biemann, C, Cecchini, Flavio Massimiliano, Riedl, Martin, Fersini, Elisabetta, Biemann, Chris, Cecchini, F, Riedl, M, Fersini, E, Biemann, C, Cecchini, Flavio Massimiliano, Riedl, Martin, Fersini, Elisabetta, and Biemann, Chris
- Abstract
This article presents a comparison of different Word Sense Induction (wsi) clustering algorithms on two novel pseudoword data sets of semantic-similarity and co-occurrence-based word graphs, with a special focus on the detection of homonymic polysemy. We follow the original definition of a pseudoword as the combination of two monosemous terms and their contexts to simulate a polysemous word. The evaluation is performed comparing the algorithm’s output on a pseudoword’s ego word graph (i.e., a graph that represents the pseudoword’s context in the corpus) with the known subdivision given by the components corresponding to the monosemous source words forming the pseudoword. The main contribution of this article is to present a self-sufficient pseudoword-based evaluation framework for wsi graph-based clustering algorithms, thereby defining a new evaluation measure (top2) and a secondary clustering process (hyperclustering). To our knowledge, we are the first to conduct and discuss a large-scale systematic pseudoword evaluation targeting the induction of coarse-grained homonymous word senses across a large number of graph clustering algorithms.
- Published
- 2018
41. A comparison of graph-based word sense induction clustering algorithms in a pseudoword evaluation framework
- Author
-
Cecchini, Flavio Massimiliano, Riedl, Martin, Fersini, Elisabetta, Biemann, Chris, Cecchini, Flavio Massimiliano (ORCID:0000-0001-9029-1822), Cecchini, Flavio Massimiliano, Riedl, Martin, Fersini, Elisabetta, Biemann, Chris, and Cecchini, Flavio Massimiliano (ORCID:0000-0001-9029-1822)
- Abstract
This article presents a comparison of different Word Sense Induction (wsi) clustering algorithms on two novel pseudoword data sets of semantic-similarity and co-occurrence-based word graphs, with a special focus on the detection of homonymic polysemy. We follow the original definition of a pseudoword as the combination of two monosemous terms and their contexts to simulate a polysemous word. The evaluation is performed comparing the algorithm’s output on a pseudoword’s ego word graph (i.e., a graph that represents the pseudoword’s context in the corpus) with the known subdivision given by the components corresponding to the monosemous source words forming the pseudoword. The main contribution of this article is to present a self-sufficient pseudoword-based evaluation framework for wsi graph-based clustering algorithms, thereby defining a new evaluation measure (top2) and a secondary clustering process (hyperclustering). To our knowledge, we are the first to conduct and discuss a large-scale systematic pseudoword evaluation targeting the induction of coarse-grained homonymous word senses across a large number of graph clustering algorithms.
- Published
- 2018
42. EVALITA. Evaluation of NLP and Speech Tools for Italian
- Author
-
Alzetta, Chiara, Attardi, Giuseppe, Badino, Leonardo, Barbieri, Francesco, Basile, Pierpaolo, Basile, Valerio, Basili, Roberto, Bentham, Jereemi, Bhardwaj, Divyanshu, Bolioli, Andrea, Bondielli, Alessandro, Bosco, Cristina, Buscaldi, Davide, Caputo, Annalina, Caselli, Tommaso, Castellucci, Giuseppe, Cecchini, Flavio Massimiliano, Cieliebak, Mark, Cimino, Andrea, Corcoglioniti, Francesco, Cosi, Piero, Covella, Vito Vincenzo, Cozza, Vittoria, Croce, Danilo, Cutugno, Franco, Daelemans, Walter, Dell’Orletta, Felice, Deriu, Jan, De Carolis, Berardina, de Gemmis, Marco, Di Noia, Tommaso, Di Rosa, Emanuele, Durante, Alberto, Feltracco, Anna, Ferilli, Stefano, Fersini, Elisabetta, Fonseca, Erick R., Frenda, Simona, Gelbukh, Alexander, Gentile, Anna Lisa, Giuliano, Claudio, Graff, Mario, Hernandez-Farias, Delia Irazù, Horsmann, Tobias, La Bruna, Wanda, Lenci, Alessandro, Lops, Pasquale, Lovecchio, Francesco, Magnini, Bernardo, Magnolini, Simone, Majumder, Goutam, Manchanda, Pikakshi, Manzari, Vito, Mazzei, Alessandro, Messina, Enza, Minard, Anne-Lyse, Miranda-Jiménez, Sabino, Moctezuma, Daniela, Monachini, Monica, Nechaev, Yaroslav, Nissim, Malvina, Novielli, Nicole, Nozza, Debora, Paci, Giulio, Pakray, Partha, Palmero Aprosio, Alessio, Palmonari, Matteo, Passaro, Lucia C., Patti, Viviana, Pipitone, Arianna, Pirrone, Roberto, Plank, Barbara, Qwaider, Mohammed R. H., Redavid, Domenico, Rizzo, Giuseppe, Russo, Irene, Saha, Saurav, Sartiano, Daniele, Sas, Cezar, Semplici, Federica, Simi, Maria, Speranza, Manuela, Sprugnoli, Rachele, Stemle, Egon W., Sucameli, Irene, Tamburini, Fabio, Tellez, Eric S., Tirone, Giuseppe, Zesch, Torsten, Basile, Pierpaolo, Cutugno, Franco, Nissim, Malvina, Patti, Viviana, and Sprugnoli, Rachele
- Subjects
reconnaissance téléphonique articulatoire ,entité appelée rEcognition et liens dans le tweets italien ,sentiment polarity classification ,event factuality annotation ,articulatory phone recognition ,Linguistics ,CF ,annotazione fattualità degli eventi ,etichettare per messaggi social media ,riconoscimento telefonico articolare ,classificazione polarità sentimenti ,tagging for italian social media texts ,computational linguistics ,named entity rEcognition and linking in italian tweets ,linguistique computationelle ,classement polarité sentiments ,LAN009000 ,entità chiamata rEcognition e collegamenti nei tweet italiani ,linguistica computazionale ,étiqueter les messages des médias sociaux ,annotation de facturation de l'événement - Abstract
EVALITA is the evaluation campaign of Natural Language Processing and Speech Tools for the Italian language: since 2007 shared tasks have been proposed covering the analysis of both written and spoken language with the aim of enhancing the development and dissemination of resources and technologies for Italian. EVALITA is an initiative of the Italian Association for Computational Linguistics (AILC, http://www.ai-lc.it/) and it is supported by the NLP Special Interest Group of the Italian Association for Artificial Intelligence (AI*IA, http://www.aixia.it/) and by the Italian Association of Speech Science (AISV, http://www.aisv.it/). In this volume, we collect the reports of the tasks’ organisers and of the participants to all of the EVALITA 2016’s tasks, which are the following: ArtiPhone - Articulatory Phone Recognition; FactA - Event Factuality Annotation; NEEL-IT - Named Entity rEcognition and Linking in Italian Tweets; PoSTWITA - POS tagging for Italian Social Media Texts; QA4FAQ - Question Answering for Frequently Asked Questions; SENTIPOLC - SENTIment POLarity Classification. Notice that the volume does not include reports related to the IBM Watson Services Challenge organised by IBM Italy, but information can be found at http://www.evalita.it/2016/tasks/ibm-challenge. Before the task and participant reports, we also include an overview to the campaign that describes the tasks in more detail, provides figures on the participants, and, especially, highlights the innovations introduced at this year’s edition. An additional report presents a reflection on the outcome of two questionnaires filled by past participants and organisers of EVALITA, and of the panel “Raising Interest and Collecting Suggestions on the EVALITA Evaluation Campaign” held at CLIC-it 2015.
- Published
- 2017
43. Actively Learning to Rank Semantic Associations for Personalized Contextual Exploration of Knowledge Graphs
- Author
-
Bianchi, F, Palmonari, M, Cremaschi, M, Fersini, E, BIANCHI, FEDERICO, PALMONARI, MATTEO LUIGI, CREMASCHI, MARCO, FERSINI, ELISABETTA, Bianchi, F, Palmonari, M, Cremaschi, M, Fersini, E, BIANCHI, FEDERICO, PALMONARI, MATTEO LUIGI, CREMASCHI, MARCO, and FERSINI, ELISABETTA
- Abstract
Knowledge Graphs (KG) represent a large amount of Semantic Associations (SAs), i.e., chains of relations that may reveal interesting and unknown connections between different types of entities. Applications for the contextual exploration of KGs help users explore information extracted from a KG, including SAs, while they are reading an input text. Because of the large number of SAs that can be extracted from a text, a first challenge in these applications is to effectively determine which SAs are most interesting to the users, defining a suitable ranking function over SAs. However, since different users may have different interests, an additional challenge is to personalize this ranking function to match individual users’ preferences. In this paper we introduce a novel active learning to rank model to let a user rate small samples of SAs, which are used to iteratively learn a personalized ranking function. Experiments conducted with two data sets show that the approach is able to improve the quality of the ranking function with a limited number of user interactions.
- Published
- 2017
44. Named Entity Recognition: Resource Constrained Maximum Path
- Author
-
Di Puglia Pugliese, L, Fersini, E, Guerriero, F, Messina, V, FERSINI, ELISABETTA, MESSINA, VINCENZINA, Di Puglia Pugliese, L, Fersini, E, Guerriero, F, Messina, V, FERSINI, ELISABETTA, and MESSINA, VINCENZINA
- Abstract
Information Extraction (IE) is a process focused on automatic extraction of structured information from unstructured text sources. One open research field of IE relates to Named Entity Recognition (NER), aimed at identifying and associating atomic elements in a given text to a predefined category such as names of persons, organizations, locations and so on. This problem can be formalized as the assignment of a finite sequence of semantic labels to a set of interdependent variables associated with text fragments, and can modelled through a stochastic process involving both hidden variables (semantic labels) and observed variables (textual cues). In this work we investigate one of the most promising model for NER based on Conditional Random Fields (CRFs). CRFs are enhanced in a two stages approach to include in the decision process logic rules that can be either extracted from data or defined by domain experts. The problem is defined as a Resource Constrained Maximum Path Problem (RCMPP) associating a resource with each logic rule. Proper resource Extension Functions (REFs) and upper bound on the resource consumptions are defined in order to model the logic rules as knapsack-like constraints. A well-tailored dynamic programming procedure is defined to address the RCMPP
- Published
- 2017
45. A multi-view sentiment corpus
- Author
-
Nozza, D, Fersini, E, Messina, V, NOZZA, DEBORA, FERSINI, ELISABETTA, MESSINA, VINCENZINA, Nozza, D, Fersini, E, Messina, V, NOZZA, DEBORA, FERSINI, ELISABETTA, and MESSINA, VINCENZINA
- Abstract
Sentiment Analysis is a broad task that involves the analysis of various aspect of the natural language text. However, most of the approaches in the state of the art usually investigate independently each aspect, i.e. Subjectivity Classification, Sentiment Polarity Classification, Emotion Recognition, Irony Detection. In this paper we present a Multi-View Sentiment Corpus (MVSC), which comprises 3000 English microblog posts related the movie domain. Three independent annotators manually labelled MVSC, following a broad annotation schema about different aspects that can be grasped from natural language text coming from social networks. The contribution is therefore a corpus that comprises five different views for each message, i.e. subjective/objective, sentiment polarity, implicit/explicit, irony, emotion. In order to allow a more detailed investigation on the human labelling behaviour, we provide the annotations of each human annotator involved.
- Published
- 2017
46. Graph-based Clustering Algorithms for Word Sense Induction
- Author
-
BIEMANN, CHRISTIAN, Cecchini, F, DE PAOLI, FLAVIO MARIA, FERSINI, ELISABETTA, CECCHINI, FLAVIO MASSIMILIANO, BIEMANN, CHRISTIAN, Cecchini, F, DE PAOLI, FLAVIO MARIA, FERSINI, ELISABETTA, and CECCHINI, FLAVIO MASSIMILIANO
- Abstract
La presente tesi si occupa dell’induzione dei significati delle parole (ISP o word sense induction - WSI), una branca dell’elaborazione dei linguaggi naturali il cui scopo è individuare ed elencare automaticamente e in modo non supervisionato i possibili significati o sensi che un termine può assumere relativamente ai differenti contesti in cui esso viene impiegato, senza ricorrere a risorse esterne quali dizionari od ontologie. Fra i vari approcci alla ISP esistenti, ci siamo concentrati in particolare su quelli che modellano il contesto di una parola tramite un grafo, su cui viene fatto operare un algoritmo per partizionarlo: la partizione che ne risulta ha come interpretazione l’implicita descrizione dei possibili sensi di quella parola. Le nozioni fondamentali dell’ISP, alcuni concetti base e approcci alla ISP scelti fra la letteratura del campo vengono presentati e analizzati nella prima parte del lavoro. Nella seconda parte introduciamo il nostro triplice contributo. Per prima cosa definiamo ed esploriamo la distanza di Jaccard pesata (assieme a una sua versione non pesata), cioè una distanza sui nodi di un grafo non orientato con pesi positivi, che usiamo per ottenere relazioni di secondo ordine dalle relazioni di primo ordine eventualmente modellate dal grafo (p.e. cooccorrenze). Inoltre, definiamo la nozione correlata di “arco passerella”, un arco “separatore” il cui peso è superiore alla media dei pesi sugli archi uscenti da una delle sue due estremità, e definiamo anche una nuova interpretazione sintetica delle curvatura di un grafo, vista come la differenza delle distanze di Jaccard pesata e non pesata sugli archi. La distanza di Jaccard da noi definita è alla base del nostro secondo contributo: tre nuovi algoritmi di clustering su grafo espressamente creati per l’ISP, rispettivamente l’algoritmo di clustering per passerelle, un algoritmo di clustering aggregativo e uno basato sulla curvatura. Il terzo contributo di questa tesi è un nuovo sistema, This dissertation is about Word Sense Induction (WSI), a branch of Natural Language Processing concerned with the automated, unsupervised detection and listing of the possible senses that a word can assume relative to the different contexts in which it appears. To this end, no external resources like dictionaries or ontologies are used. Among the many existing approaches to WSI, we focus specifically on modelling the context of a word through a graph and on running a clustering algorithm on it: the resulting clusters are interpreted as implicitly describing the possible senses of the word. Fundamental notions of WSI, basic concepts and some WSI approaches selected from literature are presented and examined in the first part of this work. In the second part, we introduce our threefold contribution. Firstly, we define and explore a weighted (together with an unweighted) Jaccard distance, i.e. a distance on the nodes of a positively weighted undirected graph which we use to obtain second-order relations from the first-order ones modelled by the graph (e.g. co-occurrences). Moreover, we define the related notion of gangplank edge, a separator edge with weight greater than the mean weights of the edges incident to either of its two ends, and finally a new synthetic interpretation of the curvature on a graph, seen as the difference between weighted and unweighted Jaccard distances between node couples. Our Jaccard distance is at the basis of the second contribution: three novel graph-based clustering algorithms expressly created for the task of WSI, respectively the gangplank clustering algorithm, an aggregative clustering algorithm and a curvature-based clustering algorithm. The third contribution is a novel evaluation framework for graph-based clustering algorithms for WSI, consisting of two word graph data sets (one for co-occurrences and one for semantic similarities) and a new ad hoc evaluation measure built around pseudowords. A pseudoword is the artificial conflati
- Published
- 2017
47. Towards Adaptation of Named Entity Recognition and Linking Frameworks
- Author
-
PALMONARI, MATTEO LUIGI, Manchanda, P, FERSINI, ELISABETTA, MAURI, GIANCARLO, MANCHANDA, PIKAKSHI, PALMONARI, MATTEO LUIGI, Manchanda, P, FERSINI, ELISABETTA, MAURI, GIANCARLO, and MANCHANDA, PIKAKSHI
- Abstract
L'estrazione di informazioni strutturate a partire dal “web non strutturato”, ha suscitato un notevole interesse da parte delle comunità scientifiche che si occupano di elaborazione del linguaggio naturale e di sistemi basati sulla conoscenza per sviluppare a pieno la visione del “web semantico”. Nell'era moderna, l'uso pervasivo e diffuso delle reti sociali ha portato alla produzione di un flusso continuo di informazioni su piattaforme quali Twitter o Facebook, definite anche piattaforme di microblogging. Tali sorgenti informative, accessibili in tempo reale, producono informazioni caratterizzate dalla presenza costante di rumore e ambiguità linguistiche che rendono particolarmente difficoltoso il compito di estrarre informazioni strutturate. Tale estrazione è tuttavia cruciale per poter arricchire grandi basi di conoscenza, oggi usate in molte applicazioni industriali e di ricerca, con informazioni nuove e rilevanti. Ne risulta che nell'ultimo decennio sono aumentati significativamente gli sforzi della ricerca nel campo dell’elaborazione del linguaggio naturale per l'estrazione di informazioni da piattaforme di microblogging, con particolare attenzione nei confronti dell’estrazione e identificazione di entità nominali (anche Named Entity Extraction and Linking o NEEL). Oggigiorno esistono numerosi sistemi di NEEL, di cui la maggior parte però creati a scopo commerciale. La calibrazione dei componenti di un sistema di NEEL, cioè dei componenti per la rilevazione, la disambiguazione e l'identificazione di entità nominali, nel caso di piattaforme di microblogging come Twitter e Facebook è difficile in particolare a causa delle tipologie di testo considerato. Mancano approcci di ricerca sistematici volti a guidare l'utilizzo e il miglioramento di tali componenti, per la realizzazione di sistemi più robusti, in grado di meglio adattarsi all’emergere di nuove informazioni, e nuovi interessi (ad esempio, a estrarre tipi di entità nuovi rispetto a quelli conside, Natural Language Processing and Knowledge Base Experts are actively involved in extracting structured information from the Unstructured Web in order to realize the Semantic Web Vision. Diverse forms of unstructured information is easily available today to research scientists from social media platforms such as Twitter and Facebook in real time. %Knowledge discovery from the diverse forms of unstructured information available today from social media platforms such as Twitter and Facebook in real time, thus, plays a key role for this goal. The comprehensive and widespread use of such platforms in the modern age has led to a continuous stream of evolving information along with a constant presence of noise, and ambiguity which makes the task of extracting structured information difficult. An essential step is therefore identification of relevant information from the point of view of knowledge base enrichment. As a result, research efforts towards Information Extraction and Natural Language Processing Frameworks have increased significantly over the past decade, Named Entity Extraction and Linking (NEEL) Frameworks being one of the very prevalent ones. Numerous NEEL frameworks exist today, however, mostly for commercial purposes. The orchestration of components of a NEEL framework, i.e., named entity recognition component, named entity disambiguation and named entity linking component, for microblogging platforms such as Twitter and Facebook is difficult in particular due to the type of text under consideration. As a result, there is little research in the use and improvement of such components towards a more robust framework that can be adapted to emerging information in real time. This thesis discusses the challenges faced by conventional NEEL frameworks when faced with textual formats such as tweets and investigates several approaches to improve the performance of the components and of the NEEL framework as a whole. A key hypothesis is that the performance of such a f
- Published
- 2017
48. TWINE: A real-time system for TWeet analysis via INformation extraction
- Author
-
Nozza, D, Ristagno, F, Palmonari, M, Fersini, E, Manchanda, P, Messina, V, NOZZA, DEBORA, PALMONARI, MATTEO LUIGI, FERSINI, ELISABETTA, MANCHANDA, PIKAKSHI, MESSINA, VINCENZINA, Nozza, D, Ristagno, F, Palmonari, M, Fersini, E, Manchanda, P, Messina, V, NOZZA, DEBORA, PALMONARI, MATTEO LUIGI, FERSINI, ELISABETTA, MANCHANDA, PIKAKSHI, and MESSINA, VINCENZINA
- Abstract
In the recent years, the amount of user generated contents shared on the Web has significantly increased, especially in social media environment, e.g. Twitter, Facebook, Google+. This large quantity of data has generated the need of reactive and sophisticated systems for capturing and understanding the underlying information enclosed in them. In this paper we present TWINE, a real-time system for the big data analysis and exploration of information extracted from Twitter streams. The proposed system based on a Named Entity Recognition and Linking pipeline and a multi-dimensional spatial geo-localization is managed by a scalable and flexible architecture for an interactive visualization of micropost streams insights. The demo is available at http://twinemind. cloudapp.net/streaming1,2.
- Published
- 2017
49. Towards adaptation of Named Entity classification
- Author
-
Manchanda, P, Fersini, E, Palmonari, M, Nozza, D, Messina, V, MANCHANDA, PIKAKSHI, FERSINI, ELISABETTA, PALMONARI, MATTEO LUIGI, NOZZA, DEBORA, MESSINA, VINCENZINA, Manchanda, P, Fersini, E, Palmonari, M, Nozza, D, Messina, V, MANCHANDA, PIKAKSHI, FERSINI, ELISABETTA, PALMONARI, MATTEO LUIGI, NOZZA, DEBORA, and MESSINA, VINCENZINA
- Abstract
Numerous state-of-the-art Named Entity Recognition (NER) systems use different classification schemas/ontologies. Comparisons and integration among NER systems, thus, becomes complex. In this paper, we propose a transfer-learning approach where we use supervised learning methods to automatically learn mappings between ontologies of NER systems, where an input probability distribution over a set of entity types defined in a source ontology is mapped to a target distribution over the entity types defined for a target ontology. Experiments conducted with benchmark data show valuable re-classification performance of entity mentions, suggesting our approach as a promising one for domain adaptation of NER systems.
- Published
- 2017
50. Deep learning and ensemble methods for domain adaptation
- Author
-
Bourbakis, N, Esposito, A, Mali, A, Alamaniotis, M, Nozza, D, Fersini, E, Messina, V, NOZZA, DEBORA, FERSINI, ELISABETTA, MESSINA, VINCENZINA, Bourbakis, N, Esposito, A, Mali, A, Alamaniotis, M, Nozza, D, Fersini, E, Messina, V, NOZZA, DEBORA, FERSINI, ELISABETTA, and MESSINA, VINCENZINA
- Abstract
Real world applications of machine learning in natural language processing can span many different domains and usually require a huge effort for the annotation of domain specific training data. For this reason, domain adaptation techniques have gained a lot of attention in the last years. In order to derive an effective domain adaptation, a good feature representation across domains is crucial as well as the generalisation ability of the predictive model. In this paper we address the problem of domain adaptation for sentiment classification by combining deep learning, for acquiring a cross-domain high-level feature representation, and ensemble methods, for reducing the cross-domain generalization error. The proposed adaptation framework has been evaluated on a benchmark dataset composed of reviews of four different Amazon category of products, significantly outperforming the state of the art methods.
- Published
- 2017
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.