Author: "Schlichtkrull, Michael" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Schlichtkrull, Michael"' showing total 35 results

Start Over Author "Schlichtkrull, Michael"

35 results on '"Schlichtkrull, Michael"'

1. Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking

Author: Akhtar, Mubashara, Schlichtkrull, Michael, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Information Retrieval, Computer Science - Machine Learning
Abstract: Current automated fact-checking (AFC) approaches commonly evaluate evidence either implicitly via the predicted verdicts or by comparing retrieved evidence with a predefined closed knowledge source, such as Wikipedia. However, these methods suffer from limitations, resulting from their reliance on evaluation metrics developed for different purposes and constraints imposed by closed knowledge sources. Recent advances in natural language generation (NLG) evaluation offer new possibilities for evidence assessment. In this work, we introduce Ev2R, an evaluation framework for AFC that comprises three types of approaches for evidence evaluation: reference-based, proxy-reference, and reference-less. We evaluate their effectiveness through agreement with human ratings and adversarial tests, and demonstrate that prompt-based scorers, particularly those leveraging LLMs and reference evidence, outperform traditional evaluation approaches., Comment: 10 pages
Published: 2024

2. The Automated Verification of Textual Claims (AVeriTeC) Shared Task

Author: Schlichtkrull, Michael, Chen, Yulong, Whitehouse, Chenxi, Deng, Zhenyun, Akhtar, Mubashara, Aly, Rami, Guo, Zhijiang, Christodoulopoulos, Christos, Cocarascu, Oana, Mittal, Arpit, Thorne, James, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language
Abstract: The Automated Verification of Textual Claims (AVeriTeC) shared task asks participants to retrieve evidence and predict veracity for real-world claims checked by fact-checkers. Evidence can be found either via a search engine, or via a knowledge store provided by the organisers. Submissions are evaluated using AVeriTeC score, which considers a claim to be accurately verified if and only if both the verdict is correct and retrieved evidence is considered to meet a certain quality threshold. The shared task received 21 submissions, 18 of which surpassed our baseline. The winning team was TUDA_MAI with an AVeriTeC score of 63%. In this paper we describe the shared task, present the full results, and highlight key takeaways from the shared task.
Published: 2024

3. Generating Media Background Checks for Automated Source Critical Reasoning

Author: Schlichtkrull, Michael
Subjects: Computer Science - Computation and Language
Abstract: Not everything on the internet is true. This unfortunate fact requires both humans and models to perform complex reasoning about credibility when working with retrieved information. In NLP, this problem has seen little attention. Indeed, retrieval-augmented models are not typically expected to distrust retrieved documents. Human experts overcome the challenge by gathering signals about the context, reliability, and tendency of source documents - that is, they perform source criticism. We propose a novel NLP task focused on finding and summarising such signals. We introduce a new dataset of 6,709 "media background checks" derived from Media Bias / Fact Check, a volunteer-run website documenting media bias. We test open-source and closed-source LLM baselines with and without retrieval on this dataset, finding that retrieval greatly improves performance. We furthermore carry out human evaluation, demonstrating that 1) media background checks are helpful for humans, and 2) media background checks are helpful for retrieval-augmented models.
Published: 2024

4. Document-level Claim Extraction and Decontextualisation for Fact-Checking

Author: Deng, Zhenyun, Schlichtkrull, Michael, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language
Abstract: Selecting which claims to check is a time-consuming task for human fact-checkers, especially from documents consisting of multiple sentences and containing multiple claims. However, existing claim extraction approaches focus more on identifying and extracting claims from individual sentences, e.g., identifying whether a sentence contains a claim or the exact boundaries of the claim within a sentence. In this paper, we propose a method for document-level claim extraction for fact-checking, which aims to extract check-worthy claims from documents and decontextualise them so that they can be understood out of context. Specifically, we first recast claim extraction as extractive summarization in order to identify central sentences from documents, then rewrite them to include necessary context from the originating document through sentence decontextualisation. Evaluation with both automatic metrics and a fact-checking professional shows that our method is able to extract check-worthy claims from documents more accurately than previous work, while also improving evidence retrieval., Comment: Accepted to ACL 2024
Published: 2024

5. Multimodal Automated Fact-Checking: A Survey

Author: Akhtar, Mubashara, Schlichtkrull, Michael, Guo, Zhijiang, Cocarascu, Oana, Simperl, Elena, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computer Vision and Pattern Recognition
Abstract: Misinformation is often conveyed in multiple modalities, e.g. a miscaptioned image. Multimodal misinformation is perceived as more credible by humans, and spreads faster than its text-only counterparts. While an increasing body of research investigates automated fact-checking (AFC), previous surveys mostly focus on text. In this survey, we conceptualise a framework for AFC including subtasks unique to multimodal misinformation. Furthermore, we discuss related terms used in different communities and map them to our framework. We focus on four modalities prevalent in real-world fact-checking: text, image, audio, and video. We survey benchmarks and models, and discuss limitations and promising directions for future research, Comment: The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP): Findings
Published: 2023

6. AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web

Author: Schlichtkrull, Michael, Guo, Zhijiang, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language
Abstract: Existing datasets for automated fact-checking have substantial limitations, such as relying on artificial claims, lacking annotations for evidence and intermediate reasoning, or including evidence published after the claim. In this paper we introduce AVeriTeC, a new dataset of 4,568 real-world claims covering fact-checks by 50 different organizations. Each claim is annotated with question-answer pairs supported by evidence available online, as well as textual justifications explaining how the evidence combines to produce a verdict. Through a multi-round annotation process, we avoid common pitfalls including context dependence, evidence insufficiency, and temporal leakage, and reach a substantial inter-annotator agreement of $\kappa=0.619$ on verdicts. We develop a baseline as well as an evaluation scheme for verifying claims through several question-answering steps against the open web., Comment: Accepted to NeurIPS 2023 Datasets & Benchmarks Track
Published: 2023

7. The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who

Author: Schlichtkrull, Michael, Ousidhoum, Nedjma, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language
Abstract: Automated fact-checking is often presented as an epistemic tool that fact-checkers, social media consumers, and other stakeholders can use to fight misinformation. Nevertheless, few papers thoroughly discuss how. We document this by analysing 100 highly-cited papers, and annotating epistemic elements related to intended use, i.e., means, ends, and stakeholders. We find that narratives leaving out some of these aspects are common, that many papers propose inconsistent means and ends, and that the feasibility of suggested strategies rarely has empirical backing. We argue that this vagueness actively hinders the technology from reaching its goals, as it encourages overclaiming, limits criticism, and prevents stakeholder feedback. Accordingly, we provide several recommendations for thinking and writing about the use of fact-checking artefacts., Comment: Accepted to the Findings of EMNLP 2023
Published: 2023

8. A Survey on Automated Fact-Checking

Author: Guo, Zhijiang, Schlichtkrull, Michael, and Vlachos, Andreas
Subjects: Computer Science - Computation and Language
Abstract: Fact-checking has become increasingly important due to the speed with which both information and misinformation can spread in the modern media ecosystem. Therefore, researchers have been exploring how fact-checking can be automated, using techniques based on natural language processing, machine learning, knowledge representation, and databases to automatically predict the veracity of claims. In this paper, we survey automated fact-checking stemming from natural language processing, and discuss its connections to related tasks and disciplines. In this process, we present an overview of existing datasets and models, aiming to unify the various definitions given and identify common concepts. Finally, we highlight challenges for future research., Comment: Accepted at TACL 2022, 28 pages
Published: 2021

9. FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

Author: Aly, Rami, Guo, Zhijiang, Schlichtkrull, Michael, Thorne, James, Vlachos, Andreas, Christodoulopoulos, Christos, Cocarascu, Oana, and Mittal, Arpit
Subjects: Computer Science - Computation and Language
Abstract: Fact verification has attracted a lot of attention in the machine learning and natural language processing communities, as it is one of the key methods for detecting misinformation. Existing large-scale benchmarks for this task have focused mostly on textual sources, i.e. unstructured information, and thus ignored the wealth of information available in structured formats, such as tables. In this paper we introduce a novel dataset and benchmark, Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS), which consists of 87,026 verified claims. Each claim is annotated with evidence in the form of sentences and/or cells from tables in Wikipedia, as well as a label indicating whether this evidence supports, refutes, or does not provide enough information to reach a verdict. Furthermore, we detail our efforts to track and minimize the biases present in the dataset and could be exploited by models, e.g. being able to predict the label without using evidence. Finally, we develop a baseline for verifying claims against text and tables which predicts both the correct evidence and verdict for 18% of the claims., Comment: Accepted at NeurIPS 2021 Datasets and Benchmarks Track
Published: 2021

10. NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

Author: Min, Sewon, Boyd-Graber, Jordan, Alberti, Chris, Chen, Danqi, Choi, Eunsol, Collins, Michael, Guu, Kelvin, Hajishirzi, Hannaneh, Lee, Kenton, Palomaki, Jennimaria, Raffel, Colin, Roberts, Adam, Kwiatkowski, Tom, Lewis, Patrick, Wu, Yuxiang, Küttler, Heinrich, Liu, Linqing, Minervini, Pasquale, Stenetorp, Pontus, Riedel, Sebastian, Yang, Sohee, Seo, Minjoon, Izacard, Gautier, Petroni, Fabio, Hosseini, Lucas, De Cao, Nicola, Grave, Edouard, Yamada, Ikuya, Shimaoka, Sonse, Suzuki, Masatoshi, Miyawaki, Shumpei, Sato, Shun, Takahashi, Ryo, Suzuki, Jun, Fajcik, Martin, Docekal, Martin, Ondrej, Karel, Smrz, Pavel, Cheng, Hao, Shen, Yelong, Liu, Xiaodong, He, Pengcheng, Chen, Weizhu, Gao, Jianfeng, Oguz, Barlas, Chen, Xilun, Karpukhin, Vladimir, Peshterliev, Stan, Okhonko, Dmytro, Schlichtkrull, Michael, Gupta, Sonal, Mehdad, Yashar, and Yih, Wen-tau
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage contestants to explore the trade-off between storing retrieval corpora or the parameters of learned models. In this report, we describe the motivation and organization of the competition, review the best submissions, and analyze system predictions to inform a discussion of evaluation for open-domain QA., Comment: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track
Published: 2020

11. Joint Verification and Reranking for Open Fact Checking Over Tables

Author: Schlichtkrull, Michael, Karpukhin, Vladimir, Oğuz, Barlas, Lewis, Mike, Yih, Wen-tau, and Riedel, Sebastian
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Structured information is an important knowledge source for automatic verification of factual claims. Nevertheless, the majority of existing research into this task has focused on textual data, and the few recent inquiries into structured data have been for the closed-domain setting where appropriate evidence for each claim is assumed to have already been retrieved. In this paper, we investigate verification over structured data in the open-domain setting, introducing a joint reranking-and-verification model which fuses evidence documents in the verification component. Our open-domain model achieves performance comparable to the closed-domain state-of-the-art on the TabFact dataset, and demonstrates performance gains from the inclusion of multiple tables as well as a significant improvement over a heuristic retrieval baseline.
Published: 2020
Full Text: View/download PDF

12. UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

Author: Oguz, Barlas, Chen, Xilun, Karpukhin, Vladimir, Peshterliev, Stan, Okhonko, Dmytro, Schlichtkrull, Michael, Gupta, Sonal, Mehdad, Yashar, and Yih, Scott
Subjects: Computer Science - Computation and Language
Abstract: We study open-domain question answering with structured, unstructured and semi-structured knowledge sources, including text, tables, lists and knowledge bases. Departing from prior work, we propose a unifying approach that homogenizes all sources by reducing them to text and applies the retriever-reader model which has so far been limited to text sources only. Our approach greatly improves the results on knowledge-base QA tasks by 11 points, compared to latest graph-based methods. More importantly, we demonstrate that our unified knowledge (UniK-QA) model is a simple and yet effective way to combine heterogeneous sources of knowledge, advancing the state-of-the-art results on two popular question answering benchmarks, NaturalQuestions and WebQuestions, by 3.5 and 2.6 points, respectively. The code of UniK-QA is available at: https://github.com/facebookresearch/UniK-QA., Comment: NAACL-HLT 2022 Findings
Published: 2020

13. Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking

Author: Schlichtkrull, Michael Sejr, De Cao, Nicola, and Titov, Ivan
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Graph neural networks (GNNs) have become a popular approach to integrating structural inductive biases into NLP models. However, there has been little work on interpreting them, and specifically on understanding which parts of the graphs (e.g. syntactic trees or co-reference structures) contribute to a prediction. In this work, we introduce a post-hoc method for interpreting the predictions of GNNs which identifies unnecessary edges. Given a trained GNN model, we learn a simple classifier that, for every edge in every layer, predicts if that edge can be dropped. We demonstrate that such a classifier can be trained in a fully differentiable fashion, employing stochastic gates and encouraging sparsity through the expected $L_0$ norm. We use our technique as an attribution method to analyze GNN models for two tasks -- question answering and semantic role labeling -- providing insights into the information flow in these models. We show that we can drop a large proportion of edges without deteriorating the performance of the model, while we can analyse the remaining edges for interpreting model predictions.
Published: 2020

14. Evaluating for Diversity in Question Generation over Text

Author: Schlichtkrull, Michael Sejr and Cheng, Weiwei
Subjects: Computer Science - Computation and Language, Statistics - Machine Learning
Abstract: Generating diverse and relevant questions over text is a task with widespread applications. We argue that commonly-used evaluation metrics such as BLEU and METEOR are not suitable for this task due to the inherent diversity of reference questions, and propose a scheme for extending conventional metrics to reflect diversity. We furthermore propose a variational encoder-decoder model for this task. We show through automatic and human evaluation that our variational model improves diversity without loss of quality, and demonstrate how our evaluation scheme reflects this improvement.
Published: 2020

15. How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking

Author: De Cao, Nicola, Schlichtkrull, Michael, Aziz, Wilker, and Titov, Ivan
Subjects: Computer Science - Computation and Language, Statistics - Machine Learning
Abstract: Attribution methods assess the contribution of inputs to the model prediction. One way to do so is erasure: a subset of inputs is considered irrelevant if it can be removed without affecting the prediction. Though conceptually simple, erasure's objective is intractable and approximate search remains expensive with modern deep NLP models. Erasure is also susceptible to the hindsight bias: the fact that an input can be dropped does not mean that the model `knows' it can be dropped. The resulting pruning is over-aggressive and does not reflect how the model arrives at the prediction. To deal with these challenges, we introduce Differentiable Masking. DiffMask learns to mask-out subsets of the input while maintaining differentiability. The decision to include or disregard an input token is made with a simple model based on intermediate hidden layers of the analyzed model. First, this makes the approach efficient because we predict rather than search. Second, as with probing classifiers, this reveals what the network `knows' at the corresponding layers. This lets us not only plot attribution heatmaps but also analyze how decisions are formed across network layers. We use DiffMask to study BERT models on sentiment classification and question answering., Comment: Accepted at the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Source code available at https://github.com/nicola-decao/diffmask . 18 pages, 15 figures, 4 tables
Published: 2020

16. Modeling Relational Data with Graph Convolutional Networks

Author: Schlichtkrull, Michael, Kipf, Thomas N., Bloem, Peter, Berg, Rianne van den, Titov, Ivan, and Welling, Max
Subjects: Statistics - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Databases, Computer Science - Learning
Abstract: Knowledge graphs enable a wide variety of applications, including question answering and information retrieval. Despite the great effort invested in their creation and maintenance, even the largest (e.g., Yago, DBPedia or Wikidata) remain incomplete. We introduce Relational Graph Convolutional Networks (R-GCNs) and apply them to two standard knowledge base completion tasks: Link prediction (recovery of missing facts, i.e. subject-predicate-object triples) and entity classification (recovery of missing entity attributes). R-GCNs are related to a recent class of neural networks operating on graphs, and are developed specifically to deal with the highly multi-relational data characteristic of realistic knowledge bases. We demonstrate the effectiveness of R-GCNs as a stand-alone model for entity classification. We further show that factorization models for link prediction such as DistMult can be significantly improved by enriching them with an encoder model to accumulate evidence over multiple inference steps in the relational graph, demonstrating a large improvement of 29.8% on FB15k-237 over a decoder-only baseline.
Published: 2017

17. Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages

Author: Schlichtkrull, Michael Sejr and Søgaard, Anders
Subjects: Computer Science - Computation and Language
Abstract: In cross-lingual dependency annotation projection, information is often lost during transfer because of early decoding. We present an end-to-end graph-based neural network dependency parser that can be trained to reproduce matrices of edge scores, which can be directly projected across word alignments. We show that our approach to cross-lingual dependency parsing is not only simpler, but also achieves an absolute improvement of 2.25% averaged across 10 languages compared to the previous state of the art., Comment: To be published at EACL 2017
Published: 2017

18. Modeling Relational Data with Graph Convolutional Networks

Author: Schlichtkrull, Michael, Kipf, Thomas N., Bloem, Peter, van den Berg, Rianne, Titov, Ivan, Welling, Max, Hutchison, David, Series Editor, Kanade, Takeo, Series Editor, Kittler, Josef, Series Editor, Kleinberg, Jon M., Series Editor, Mattern, Friedemann, Series Editor, Mitchell, John C., Series Editor, Naor, Moni, Series Editor, Pandu Rangan, C., Series Editor, Steffen, Bernhard, Series Editor, Terzopoulos, Demetri, Series Editor, Tygar, Doug, Series Editor, Weikum, Gerhard, Series Editor, Gangemi, Aldo, editor, Navigli, Roberto, editor, Vidal, Maria-Esther, editor, Hitzler, Pascal, editor, Troncy, Raphaël, editor, Hollink, Laura, editor, Tordai, Anna, editor, and Alam, Mehwish, editor
Published: 2018
Full Text: View/download PDF

19. Modeling Relational Data with Graph Convolutional Networks

Author: Schlichtkrull, Michael, primary, Kipf, Thomas N., additional, Bloem, Peter, additional, van den Berg, Rianne, additional, Titov, Ivan, additional, and Welling, Max, additional
Published: 2018
Full Text: View/download PDF

20. Are Embedded Potatoes Still Vegetables? On the Limitations of WordNet Embeddings for Lexical Semantics

Author: Cheng, Xuyou, primary, Schlichtkrull, Michael, additional, and Emerson, Guy, additional
Published: 2023
Full Text: View/download PDF

21. The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who

Author: Schlichtkrull, Michael, primary, Ousidhoum, Nedjma, additional, and Vlachos, Andreas, additional
Published: 2023
Full Text: View/download PDF

22. Multimodal Automated Fact-Checking: A Survey

Author: Akhtar, Mubashara, primary, Schlichtkrull, Michael, additional, Guo, Zhijiang, additional, Cocarascu, Oana, additional, Simperl, Elena, additional, and Vlachos, Andreas, additional
Published: 2023
Full Text: View/download PDF

23. UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

Author: Oguz, Barlas, Chen, Xilun, Karpukhin, Vladimir, Peshterliev, Stan, Okhonko, Dmytro, Schlichtkrull, Michael, Gupta, Sonal, Mehdad, Yashar, and Yih, Scott
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Computation and Language (cs.CL)
Abstract: We study open-domain question answering with structured, unstructured and semi-structured knowledge sources, including text, tables, lists and knowledge bases. Departing from prior work, we propose a unifying approach that homogenizes all sources by reducing them to text and applies the retriever-reader model which has so far been limited to text sources only. Our approach greatly improves the results on knowledge-base QA tasks by 11 points, compared to latest graph-based methods. More importantly, we demonstrate that our unified knowledge (UniK-QA) model is a simple and yet effective way to combine heterogeneous sources of knowledge, advancing the state-of-the-art results on two popular question answering benchmarks, NaturalQuestions and WebQuestions, by 3.5 and 2.6 points, respectively. The code of UniK-QA is available at: https://github.com/facebookresearch/UniK-QA., NAACL-HLT 2022 Findings
Published: 2022

24. A Survey on Automated Fact-Checking

Author: Guo, Zhijiang, primary, Schlichtkrull, Michael, additional, and Vlachos, Andreas, additional
Published: 2022
Full Text: View/download PDF

25. NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

Author: Min, Sewon, Boyd-Graber, Jordan, Alberti, Chris, Chen, Danqi, Choi, Eunsol, Collins, Michael, Guu, Kelvin, Hajishirzi, Hannaneh, Lee, Kenton, Palomaki, Jennimaria, Raffel, Colin, Roberts, Adam, Kwiatkowski, Tom, Lewis, Patrick, Wu, Yuxiang, Küttler, Heinrich, Liu, Linqing, Minervini, Pasquale, Stenetorp, Pontus, Riedel, Sebastian, Yang, Sohee, Seo, Minjoon, Izacard, Gautier, Petroni, Fabio, Hosseini, Lucas, De Cao, Nicola, Grave, Edouard, Yamada, Ikuya, Shimaoka, Sonse, Suzuki, Masatoshi, Miyawaki, Shumpei, Sato, Shun, Takahashi, Ryo, Suzuki, Jun, Fajcik, Martin, Docekal, Martin, Ondrej, Karel, Smrz, Pavel, Cheng, Hao, Shen, Yelong, Liu, Xiaodong, He, Pengcheng, Chen, Weizhu, Gao, Jianfeng, Oguz, Barlas, Chen, Xilun, Karpukhin, Vladimir, Peshterliev, Stan, Okhonko, Dmytro, Schlichtkrull, Michael, Gupta, Sonal, Mehdad, Yashar, and Yih, Wen-tau
Subjects: FOS: Computer and information sciences, Computer Science - Computation and Language, Artificial Intelligence (cs.AI), Computer Science - Artificial Intelligence, Computation and Language (cs.CL)
Abstract: We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain question answering (QA), where systems take natural language questions as input and return natural language answers. The aim of the competition was to build systems that can predict correct answers while also satisfying strict on-disk memory budgets. These memory budgets were designed to encourage contestants to explore the trade-off between storing retrieval corpora or the parameters of learned models. In this report, we describe the motivation and organization of the competition, review the best submissions, and analyze system predictions to inform a discussion of evaluation for open-domain QA., Comment: 26 pages; Published in Proceedings of Machine Learning Research (PMLR), NeurIPS 2020 Competition and Demonstration Track
Published: 2021
Full Text: View/download PDF

26. The Fact Extraction and VERification Over Unstructured and Structuredinformation (FEVEROUS) Shared Task

Author: Aly, Rami, Guo, Zhijiang, Schlichtkrull, Michael Sejr, Thorne, James, Vlachos, Andreas, Christodoulopoulos, Christos, Cocarascu, Oana, and Mittal, Arpit
Published: 2021
Full Text: View/download PDF

27. Joint Verification and Reranking for Open Fact Checking Over Tables

Author: Schlichtkrull, Michael Sejr, primary, Karpukhin, Vladimir, additional, Oguz, Barlas, additional, Lewis, Mike, additional, Yih, Wen-tau, additional, and Riedel, Sebastian, additional
Published: 2021
Full Text: View/download PDF

28. The Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS) Shared Task

Author: Aly, Rami, primary, Guo, Zhijiang, additional, Schlichtkrull, Michael Sejr, additional, Thorne, James, additional, Vlachos, Andreas, additional, Christodoulopoulos, Christos, additional, Cocarascu, Oana, additional, and Mittal, Arpit, additional
Published: 2021
Full Text: View/download PDF

29. How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking

Author: De Cao, Nicola, primary, Schlichtkrull, Michael Sejr, additional, Aziz, Wilker, additional, and Titov, Ivan, additional
Published: 2020
Full Text: View/download PDF

30. Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages

Author: Schlichtkrull, Michael, primary and Søgaard, Anders, additional
Published: 2017
Full Text: View/download PDF

31. Cross-Lingual Dependency Parsing with Tensor-LSTM

Author: Søgaard, Anders Østerskov, Sejr Schlichtkrull, Michael, Søgaard, Anders Østerskov, and Sejr Schlichtkrull, Michael
Abstract: Jeg udvikler og evaluerer en komplet graf-baseret dependensparser bygget på en neural netværk arkitektur. Jeg formulerer dependensparsing som et matrixtransformationsproblem, og gør det derved muligt at implementere en fuldstændig parser ved at træne en dyb LSTM til at reproducere tensorer – Tensor-LSTM. Den fleksible model gør parseren idéel til at lære fra både træbanke, fra ikke-velformet data, og fra partielt annoteret data. Jeg eksperimenterer med to varianter af parseren. Den første er udviklet til at lære fra velformede træbanke, og bruge en matrix-baseret cross entropy tabsfunktion under træningsfasen. Parserens product er en matrix hvori rækkerne svarer til sandsynlighedsfordelinger over dependensrelationer. Den anden variant er udviklet til at lære fra matricer bestående af bedømmelser fra andre parsere, eller fra projekterede annotationer. Denne version trænes med mean square error som tabsfunktion, og giver forudsigelser svarende til reelle matricer. I begge tilfælde afkodes matricerne med Chu-Liu-Edmonds algoritme for at producere velformede træer. Jeg gennemgår først den teoretiske baggrund for parseren. Derefter evaluerer jeg cross entropy-varianten på en monolingual parsing-opgave. Jeg udfører en numerisk test på syv sprog fra Universal Dependencies datasættet, og sammenligner resultaterne mod MaltParser, TurboParser, og Parsito. Tensor-LSTM giver en gennemsnitlig unlabelled attachment score på 83.70, hvilket er lidt bedre end TurboParser, ækvivalent med MaltParser, og lidt dårligere end Parsito. Jeg benytter derefter mean squared error-varianten på en kryds-lingual parsing-opgave. I Agic et al., 2016 demonstrerede forfatterne, hvordan bedre resultater kan opnås ved at undgå et afkodningsskridt før de projekterede annotationer udregnes. Ved at udnytte Tensor-LSTM kan jeg omgå endnu et afkodningsskridt efter projektionsprocessen. For at demonstrere effektiviteten af denne metode evaluerer jeg den kryds-linguale Tensor- LSTM på ti sprog
Published: 2016

32. MSejrKu at SemEval-2016 Task 14: Taxonomy Enrichment by Evidence Ranking

Author: Schlichtkrull, Michael, primary and Martínez Alonso, Héctor, additional
Published: 2016
Full Text: View/download PDF

33. Learning affective projections for emoticons on Twitter

Author: Schlichtkrull, Michael Sejr, primary
Published: 2015
Full Text: View/download PDF

34. Evidence Selection as a Token-Level Prediction Task

Author: Dominik Stammbach, Aly, Rami, Christodoulopoulos, Christos, Cocarascu, Oana, Guo, Zhijiang, Mittal, Arpit, Schlichtkrull, Michael, Thorne, James, and Vlachos, Andreas
Abstract: In Automated Claim Verification, we retrieve evidence from a knowledge base to determine the veracity of a claim. Intuitively, the retrieval of the correct evidence plays a crucial role in this process. Often, evidence selection is tackled as a pairwise sentence classification task, i.e., we train a model to predict for each sentence individually whether it is evidence for a claim. In this work, we fine-tune document level transformers to extract all evidence from a Wikipedia document at once. We show that this approach performs better than a comparable model classifying sentences individually on all relevant evidence selection metrics in FEVER. Our complete pipeline building on this evidence selection procedure produces a new state-of-the-art result on FEVER, a popular claim verification benchmark., Proceedings of the Fourth Workshop on Fact Extraction and VERification (FEVER), ISBN:978-1-955917-03-2
Published: 2021

35. Stance Detection in German News Articles

Author: Laura Mascarell, Tatyana Ruzsics, Christian Schneebeli, Philippe Schlattner, Luca Campanella, Severin Klingler, Cristina Kadar, Aly, Rami, Christodoulopoulos, Christos, Cocarascu, Oana, Guo, Zhijiang, Mittal, Arpit, Schlichtkrull, Michael, Thorne, James, and Vlachos, Andreas
Subjects: GeneralLiterature_REFERENCE(e.g.,dictionaries,encyclopedias,glossaries)
Abstract: Proceedings of the Fourth Workshop on Fact Extraction and VERification (FEVER), ISBN:978-1-955917-03-2
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

35 results on '"Schlichtkrull, Michael"'

1. Ev2R: Evaluating Evidence Retrieval in Automated Fact-Checking

2. The Automated Verification of Textual Claims (AVeriTeC) Shared Task

3. Generating Media Background Checks for Automated Source Critical Reasoning

4. Document-level Claim Extraction and Decontextualisation for Fact-Checking

5. Multimodal Automated Fact-Checking: A Survey

6. AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web

7. The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who

8. A Survey on Automated Fact-Checking

9. FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

10. NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

11. Joint Verification and Reranking for Open Fact Checking Over Tables

12. UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

13. Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking

14. Evaluating for Diversity in Question Generation over Text

15. How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking

16. Modeling Relational Data with Graph Convolutional Networks

17. Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages

18. Modeling Relational Data with Graph Convolutional Networks

19. Modeling Relational Data with Graph Convolutional Networks

20. Are Embedded Potatoes Still Vegetables? On the Limitations of WordNet Embeddings for Lexical Semantics

21. The Intended Uses of Automated Fact-Checking Artefacts: Why, How and Who

22. Multimodal Automated Fact-Checking: A Survey

23. UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering

24. A Survey on Automated Fact-Checking

25. NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

26. The Fact Extraction and VERification Over Unstructured and Structuredinformation (FEVEROUS) Shared Task

27. Joint Verification and Reranking for Open Fact Checking Over Tables

28. The Fact Extraction and VERification Over Unstructured and Structured information (FEVEROUS) Shared Task

29. How do Decisions Emerge across Layers in Neural Models? Interpretation with Differentiable Masking

30. Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages

31. Cross-Lingual Dependency Parsing with Tensor-LSTM

32. MSejrKu at SemEval-2016 Task 14: Taxonomy Enrichment by Evidence Ranking

33. Learning affective projections for emoticons on Twitter

34. Evidence Selection as a Token-Level Prediction Task

35. Stance Detection in German News Articles

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

35 results on '"Schlichtkrull, Michael"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources