Author: "Weissweiler, Leonie" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Weissweiler, Leonie"' showing total 22 results

Start Over Author "Weissweiler, Leonie"

22 results on '"Weissweiler, Leonie"'

1. Derivational Morphology Reveals Analogical Generalization in Large Language Models

Author: Hofmann, Valentin, Weissweiler, Leonie, Mortensen, David, Schütze, Hinrich, and Pierrehumbert, Janet
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: What mechanisms underlie linguistic generalization in large language models (LLMs)? This question has attracted considerable attention, with most studies analyzing the extent to which the language skills of LLMs resemble rules. As of yet, it is not known whether linguistic generalization in LLMs could equally well be explained as the result of analogical processes, which can be formalized as similarity operations on stored exemplars. A key shortcoming of prior research is its focus on linguistic phenomena with a high degree of regularity, for which rule-based and analogical approaches make the same predictions. Here, we instead examine derivational morphology, specifically English adjective nominalization, which displays notable variability. We introduce a new method for investigating linguistic generalization in LLMs: focusing on GPT-J, we fit cognitive models that instantiate rule-based and analogical learning to the LLM training data and compare their predictions on a set of nonce adjectives with those of the LLM, allowing us to draw direct conclusions regarding underlying mechanisms. As expected, rule-based and analogical models explain the predictions of GPT-J equally well for adjectives with regular nominalization patterns. However, for adjectives with variable nominalization patterns, the analogical model provides a much better match. Furthermore, GPT-J's behavior is sensitive to the individual word frequencies, even for regular forms, a behavior that is consistent with an analogical account of regular forms but not a rule-based one. These findings refute the hypothesis that GPT-J's linguistic generalization on adjective nominalization involves rules, suggesting similarity operations on stored exemplars as the underlying mechanism. Overall, our study suggests that analogical processes play a bigger role in the linguistic generalization of LLMs than previously thought.
Published: 2024

2. Models Can and Should Embrace the Communicative Nature of Human-Generated Math

Author: Boguraev, Sasha, Lipkin, Ben, Weissweiler, Leonie, and Mahowald, Kyle
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Math is constructed by people for people: just as natural language corpora reflect not just propositions but the communicative goals of language users, the math data that models are trained on reflects not just idealized mathematical entities but rich communicative intentions. While there are important advantages to treating math in a purely symbolic manner, we here hypothesize that there are benefits to treating math as situated linguistic communication and that language models are well suited for this goal, in ways that are not fully appreciated. We illustrate these points with two case studies. First, we ran an experiment in which we found that language models interpret the equals sign in a humanlike way -- generating systematically different word problems for the same underlying equation arranged in different ways. Second, we found that language models prefer proofs to be ordered in naturalistic ways, even though other orders would be logically equivalent. We advocate for AI systems that learn from and represent the communicative intentions latent in human-generated math.
Published: 2024

3. SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

Author: Zhao, Raoyuan, Köksal, Abdullatif, Liu, Yihong, Weissweiler, Leonie, Korhonen, Anna, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Traditional benchmarking in NLP typically involves using static held-out test sets. However, this approach often results in an overestimation of performance and lacks the ability to offer comprehensive, interpretable, and dynamic assessments of NLP models. Recently, works like DynaBench (Kiela et al., 2021) and CheckList (Ribeiro et al., 2020) have addressed these limitations through behavioral testing of NLP models with test types generated by a multistep human-annotated pipeline. Unfortunately, manually creating a variety of test types requires much human labor, often at prohibitive cost. In this work, we propose SYNTHEVAL, a hybrid behavioral testing framework that leverages large language models (LLMs) to generate a wide range of test types for a comprehensive evaluation of NLP models. SYNTHEVAL first generates sentences via LLMs using controlled generation, and then identifies challenging examples by comparing the predictions made by LLMs with task-specific NLP models. In the last stage, human experts investigate the challenging examples, manually design templates, and identify the types of failures the taskspecific models consistently exhibit. We apply SYNTHEVAL to two classification tasks, sentiment analysis and toxic language detection, and show that our framework is effective in identifying weaknesses of strong models on these tasks. We share our code in https://github.com/Loreley99/SynthEval_CheckList., Comment: EMNLP 2024 - Findings
Published: 2024

4. Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs

Author: Mortensen, David R., Izrailevitch, Valentina, Xiao, Yunze, Schütze, Hinrich, and Weissweiler, Leonie
Subjects: Computer Science - Computation and Language
Abstract: Lexical-syntactic flexibility, in the form of conversion (or zero-derivation) is a hallmark of English morphology. In conversion, a word with one part of speech is placed in a non-prototypical context, where it is coerced to behave as if it had a different part of speech. However, while this process affects a large part of the English lexicon, little work has been done to establish the degree to which language models capture this type of generalization. This paper reports the first study on the behavior of large language models with reference to conversion. We design a task for testing lexical-syntactic flexibility -- the degree to which models can generalize over words in a construction with a non-prototypical part of speech. This task is situated within a natural language inference paradigm. We test the abilities of five language models -- two proprietary models (GPT-3.5 and GPT-4), three open-source models (Mistral 7B, Falcon 40B, and Llama 2 70B). We find that GPT-4 performs best on the task, followed by GPT-3.5, but that the open source language models are also able to perform it and that the 7B parameter Mistral displays as little difference between its baseline performance on the natural language inference task and the non-prototypical syntactic category task, as the massive GPT-4., Comment: LREC-COLING 2024
Published: 2024

5. Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons

Author: Zhou, Shijia, Weissweiler, Leonie, He, Taiqi, Schütze, Hinrich, Mortensen, David R., and Levin, Lori
Subjects: Computer Science - Computation and Language
Abstract: In this paper, we make a contribution that can be understood from two perspectives: from an NLP perspective, we introduce a small challenge dataset for NLI with large lexical overlap, which minimises the possibility of models discerning entailment solely based on token distinctions, and show that GPT-4 and Llama 2 fail it with strong bias. We then create further challenging sub-tasks in an effort to explain this failure. From a Computational Linguistics perspective, we identify a group of constructions with three classes of adjectives which cannot be distinguished by surface features. This enables us to probe for LLM's understanding of these constructions in various ways, and we find that they fail in a variety of ways to distinguish between them, suggesting that they don't adequately represent their meaning or capture the lexical properties of phrasal heads., Comment: LREC-COLING 2024
Published: 2024

6. UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies

Author: Weissweiler, Leonie, Böbel, Nina, Guiller, Kirian, Herrera, Santiago, Scivetti, Wesley, Lorenzi, Arthur, Melnik, Nurit, Bhatia, Archna, Schütze, Hinrich, Levin, Lori, Zeldes, Amir, Nivre, Joakim, Croft, William, and Schneider, Nathan
Subjects: Computer Science - Computation and Language
Abstract: The Universal Dependencies (UD) project has created an invaluable collection of treebanks with contributions in over 140 languages. However, the UD annotations do not tell the full story. Grammatical constructions that convey meaning through a particular combination of several morphosyntactic elements -- for example, interrogative sentences with special markers and/or word orders -- are not labeled holistically. We argue for (i) augmenting UD annotations with a 'UCxn' annotation layer for such meaning-bearing grammatical constructions, and (ii) approaching this in a typologically informed way so that morphosyntactic strategies can be compared across languages. As a case study, we consider five construction families in ten languages, identifying instances of each construction in UD treebanks through the use of morphosyntactic patterns. In addition to findings regarding these particular constructions, our study yields important insights on methodology for describing and identifying constructions in language-general and language-particular ways, and lays the foundation for future constructional enrichment of UD treebanks., Comment: LREC-COLING 2024
Published: 2024

7. Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena

Author: Weissweiler, Leonie, Köksal, Abdullatif, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Argument Structure Constructions (ASCs) are one of the most well-studied construction groups, providing a unique opportunity to demonstrate the usefulness of Construction Grammar (CxG). For example, the caused-motion construction (CMC, ``She sneezed the foam off her cappuccino'') demonstrates that constructions must carry meaning, otherwise the fact that ``sneeze'' in this context causes movement cannot be explained. We form the hypothesis that this remains challenging even for state-of-the-art Large Language Models (LLMs), for which we devise a test based on substituting the verb with a prototypical motion verb. To be able to perform this test at statistically significant scale, in the absence of adequate CxG corpora, we develop a novel pipeline of NLP-assisted collection of linguistically annotated text. We show how dependency parsing and GPT-3.5 can be used to significantly reduce annotation cost and thus enable the annotation of rare phenomena at scale. We then evaluate GPT, Gemini, Llama2 and Mistral models for their understanding of the CMC using the newly collected corpus. We find that all models struggle with understanding the motion component that the CMC adds to a sentence.
Published: 2024

8. Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

Author: Weissweiler, Leonie, Hofmann, Valentin, Kantharuban, Anjali, Cai, Anna, Dutt, Ritam, Hengle, Amey, Kabra, Anubha, Kulkarni, Atharva, Vijayakumar, Abhishek, Yu, Haofei, Schütze, Hinrich, Oflazer, Kemal, and Mortensen, David R.
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LLMs) have recently reached an impressive level of linguistic capability, prompting comparisons with human language skills. However, there have been relatively few systematic inquiries into the linguistic capabilities of the latest generation of LLMs, and those studies that do exist (i) ignore the remarkable ability of humans to generalize, (ii) focus only on English, and (iii) investigate syntax or semantics and overlook other capabilities that lie at the heart of human language, like morphology. Here, we close these gaps by conducting the first rigorous analysis of the morphological capabilities of ChatGPT in four typologically varied languages (specifically, English, German, Tamil, and Turkish). We apply a version of Berko's (1958) wug test to ChatGPT, using novel, uncontaminated datasets for the four examined languages. We find that ChatGPT massively underperforms purpose-built systems, particularly in English. Overall, our results -- through the lens of morphology -- cast a new light on the linguistic capabilities of ChatGPT, suggesting that claims of human-like language skills are premature and misleading., Comment: EMNLP 2023
Published: 2023

9. How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

Author: Wang, Xinpeng, Weissweiler, Leonie, Schütze, Hinrich, and Plank, Barbara
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Recently, various intermediate layer distillation (ILD) objectives have been shown to improve compression of BERT models via Knowledge Distillation (KD). However, a comprehensive evaluation of the objectives in both task-specific and task-agnostic settings is lacking. To the best of our knowledge, this is the first work comprehensively evaluating distillation objectives in both settings. We show that attention transfer gives the best performance overall. We also study the impact of layer choice when initializing the student from the teacher layers, finding a significant impact on the performance in task-specific distillation. For vanilla KD and hidden states transfer, initialisation with lower layers of the teacher gives a considerable improvement over higher layers, especially on the task of QNLI (up to an absolute percentage change of 17.8 in accuracy). Attention transfer behaves consistently under different initialisation settings. We release our code as an efficient transformer-based model distillation framework for further studies., Comment: ACL 2023
Published: 2023

10. Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

Author: Liu, Yihong, Ye, Haotian, Weissweiler, Leonie, Pei, Renhao, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In comparative linguistics, colexification refers to the phenomenon of a lexical form conveying two or more distinct meanings. Existing work on colexification patterns relies on annotated word lists, limiting scalability and usefulness in NLP. In contrast, we identify colexification patterns of more than 2,000 concepts across 1,335 languages directly from an unannotated parallel corpus. We then propose simple and effective methods to build multilingual graphs from the colexification patterns: ColexNet and ColexNet+. ColexNet's nodes are concepts and its edges are colexifications. In ColexNet+, concept nodes are additionally linked through intermediate nodes, each representing an ngram in one of 1,334 languages. We use ColexNet+ to train $\overrightarrow{\mbox{ColexNet+}}$, high-quality multilingual embeddings that are well-suited for transfer learning. In our experiments, we first show that ColexNet achieves high recall on CLICS, a dataset of crosslingual colexifications. We then evaluate $\overrightarrow{\mbox{ColexNet+}}$ on roundtrip translation, sentence retrieval and sentence classification and show that our embeddings surpass several transfer learning baselines. This demonstrates the benefits of using colexification as a source of information in multilingual NLP., Comment: EMNLP 2023 Findings
Published: 2023

11. A Crosslingual Investigation of Conceptualization in 1335 Languages

Author: Liu, Yihong, Ye, Haotian, Weissweiler, Leonie, Wicke, Philipp, Pei, Renhao, Zangenfeind, Robert, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Languages differ in how they divide up the world into concepts and words; e.g., in contrast to English, Swahili has a single concept for `belly' and `womb'. We investigate these differences in conceptualization across 1,335 languages by aligning concepts in a parallel corpus. To this end, we propose Conceptualizer, a method that creates a bipartite directed alignment graph between source language concepts and sets of target language strings. In a detailed linguistic analysis across all languages for one concept (`bird') and an evaluation on gold standard data for 32 Swadesh concepts, we show that Conceptualizer has good alignment accuracy. We demonstrate the potential of research on conceptualization in NLP with two experiments. (1) We define crosslingual stability of a concept as the degree to which it has 1-1 correspondences across languages, and show that concreteness predicts stability. (2) We represent each language by its conceptualization pattern for 83 concepts, and define a similarity measure on these representations. The resulting measure for the conceptual similarity of two languages is complementary to standard genealogical, typological, and surface similarity measures. For four out of six language families, we can assign languages to their correct family based on conceptual similarity with accuracy between 54% and 87%., Comment: ACL 2023
Published: 2023

12. Construction Grammar Provides Unique Insight into Neural Language Models

Author: Weissweiler, Leonie, He, Taiqi, Otani, Naoki, Mortensen, David R., Levin, Lori, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Construction Grammar (CxG) has recently been used as the basis for probing studies that have investigated the performance of large pretrained language models (PLMs) with respect to the structure and meaning of constructions. In this position paper, we make suggestions for the continuation and augmentation of this line of research. We look at probing methodology that was not designed with CxG in mind, as well as probing methodology that was designed for specific constructions. We analyse selected previous work in detail, and provide our view of the most important challenges and research questions that this promising new field faces., Comment: GURT 2023
Published: 2023

13. The Better Your Syntax, the Better Your Semantics? Probing Pretrained Language Models for the English Comparative Correlative

Author: Weissweiler, Leonie, Hofmann, Valentin, Köksal, Abdullatif, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: Construction Grammar (CxG) is a paradigm from cognitive linguistics emphasising the connection between syntax and semantics. Rather than rules that operate on lexical items, it posits constructions as the central building blocks of language, i.e., linguistic units of different granularity that combine syntax and semantics. As a first step towards assessing the compatibility of CxG with the syntactic and semantic knowledge demonstrated by state-of-the-art pretrained language models (PLMs), we present an investigation of their capability to classify and understand one of the most commonly studied constructions, the English comparative correlative (CC). We conduct experiments examining the classification accuracy of a syntactic probe on the one hand and the models' behaviour in a semantic application task on the other, with BERT, RoBERTa, and DeBERTa as the example PLMs. Our results show that all three investigated PLMs are able to recognise the structure of the CC but fail to use its meaning. While human-like performance of PLMs on many NLP tasks has been alleged, this indicates that PLMs still suffer from substantial shortcomings in central domains of linguistic knowledge., Comment: EMNLP 2022
Published: 2022

14. CaMEL: Case Marker Extraction without Labels

Author: Weissweiler, Leonie, Hofmann, Valentin, Sabet, Masoud Jalili, and Schütze, Hinrich
Subjects: Computer Science - Computation and Language
Abstract: We introduce CaMEL (Case Marker Extraction without Labels), a novel and challenging task in computational morphology that is especially relevant for low-resource languages. We propose a first model for CaMEL that uses a massively multilingual corpus to extract case markers in 83 languages based only on a noun phrase chunker and an alignment system. To evaluate CaMEL, we automatically construct a silver standard from UniMorph. The case markers extracted by our model can be used to detect and visualise similarities and differences between the case systems of different languages as well as to annotate fine-grained deep cases in languages in which they are not overtly marked., Comment: ACL 2022
Published: 2022

15. Developing a Stemmer for German Based on a Comparative Analysis of Publicly Available Stemmers

Author: Weissweiler, Leonie, Fraser, Alexander, Hutchison, David, Series editor, Kanade, Takeo, Series editor, Kittler, Josef, Series editor, Kleinberg, Jon M., Series editor, Mattern, Friedemann, Series editor, Mitchell, John C., Series editor, Naor, Moni, Series editor, Pandu Rangan, C., Series editor, Steffen, Bernhard, Series editor, Terzopoulos, Demetri, Series editor, Tygar, Doug, Series editor, Weikum, Gerhard, Series editor, Rehm, Georg, editor, and Declerck, Thierry, editor
Published: 2018
Full Text: View/download PDF

16. Explaining pretrained language models' understanding of linguistic structures using construction grammar

Author: Weissweiler, Leonie, primary, Hofmann, Valentin, additional, Köksal, Abdullatif, additional, and Schütze, Hinrich, additional
Published: 2023
Full Text: View/download PDF

17. Counting the Bugs in ChatGPT’s Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

Author: Weissweiler, Leonie, primary, Hofmann, Valentin, additional, Kantharuban, Anjali, additional, Cai, Anna, additional, Dutt, Ritam, additional, Hengle, Amey, additional, Kabra, Anubha, additional, Kulkarni, Atharva, additional, Vijayakumar, Abhishek, additional, Yu, Haofei, additional, Schuetze, Hinrich, additional, Oflazer, Kemal, additional, and Mortensen, David, additional
Published: 2023
Full Text: View/download PDF

18. A Crosslingual Investigation of Conceptualization in 1335 Languages

Author: Liu, Yihong, primary, Ye, Haotian, additional, Weissweiler, Leonie, additional, Wicke, Philipp, additional, Pei, Renhao, additional, Zangenfeind, Robert, additional, and Schütze, Hinrich, additional
Published: 2023
Full Text: View/download PDF

19. Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

Author: Liu, Yihong, primary, Ye, Haotian, additional, Weissweiler, Leonie, additional, Pei, Renhao, additional, and Schuetze, Hinrich, additional
Published: 2023
Full Text: View/download PDF

20. How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

Author: Wang, Xinpeng, primary, Weissweiler, Leonie, additional, Schütze, Hinrich, additional, and Plank, Barbara, additional
Published: 2023
Full Text: View/download PDF

21. The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative

Author: Weissweiler, Leonie, primary, Hofmann, Valentin, additional, Köksal, Abdullatif, additional, and Schütze, Hinrich, additional
Published: 2022
Full Text: View/download PDF

22. CaMEL: Case Marker Extraction without Labels

Author: Weissweiler, Leonie, primary, Hofmann, Valentin, additional, Jalili Sabet, Masoud, additional, and Schuetze, Hinrich, additional
Published: 2022
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

22 results on '"Weissweiler, Leonie"'

1. Derivational Morphology Reveals Analogical Generalization in Large Language Models

2. Models Can and Should Embrace the Communicative Nature of Human-Generated Math

3. SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists

4. Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs

5. Constructions Are So Difficult That Even Large Language Models Get Them Right for the Wrong Reasons

6. UCxn: Typologically Informed Annotation of Constructions Atop Universal Dependencies

7. Hybrid Human-LLM Corpus Construction and LLM Evaluation for Rare Linguistic Phenomena

8. Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

9. How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

10. Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

11. A Crosslingual Investigation of Conceptualization in 1335 Languages

12. Construction Grammar Provides Unique Insight into Neural Language Models

13. The Better Your Syntax, the Better Your Semantics? Probing Pretrained Language Models for the English Comparative Correlative

14. CaMEL: Case Marker Extraction without Labels

15. Developing a Stemmer for German Based on a Comparative Analysis of Publicly Available Stemmers

16. Explaining pretrained language models' understanding of linguistic structures using construction grammar

17. Counting the Bugs in ChatGPT’s Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model

18. A Crosslingual Investigation of Conceptualization in 1335 Languages

19. Crosslingual Transfer Learning for Low-Resource Languages Based on Multilingual Colexification Graphs

20. How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives

21. The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative

22. CaMEL: Case Marker Extraction without Labels

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

22 results on '"Weissweiler, Leonie"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources