Author: "A A, Tsvetkov" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"A A, Tsvetkov"' showing total 15,728 results

Start Over Author "A A, Tsvetkov"

15,728 results on '"A A, Tsvetkov"'

1. ComPO: Community Preferences for Language Model Personalization

Author: Kumar, Sachin, Park, Chan Young, Tsvetkov, Yulia, Smith, Noah A., and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language
Abstract: Conventional algorithms for training language models (LMs) with human feedback rely on preferences that are assumed to account for an "average" user, disregarding subjectivity and finer-grained variations. Recent studies have raised concerns that aggregating such diverse and often contradictory human feedback to finetune models results in generic models that generate outputs not preferred by many user groups, as they tend to average out styles and norms. To address this issue, we draw inspiration from recommendation systems and propose ComPO, a method to personalize preference optimization in LMs by contextualizing the probability distribution of model outputs with the preference provider. Focusing on group-level preferences rather than individuals, we collect and release ComPRed, a question answering dataset with community-level preferences from Reddit. This dataset facilitates studying diversity in preferences without incurring privacy concerns associated with individual feedback. Our experiments reveal that conditioning language models on a community identifier (i.e., subreddit name) during preference tuning substantially enhances model performance. Conversely, replacing this context with random subreddit identifiers significantly diminishes performance, highlighting the effectiveness of our approach in tailoring responses to communities' preferences.
Published: 2024

2. Model Swarms: Collaborative Search to Adapt LLM Experts via Swarm Intelligence

Author: Feng, Shangbin, Wang, Zifeng, Wang, Yike, Ebrahimi, Sayna, Palangi, Hamid, Miculicich, Lesly, Kulshrestha, Achin, Rauschmayr, Nathalie, Choi, Yejin, Tsvetkov, Yulia, Lee, Chen-Yu, and Pfister, Tomas
Subjects: Computer Science - Computation and Language
Abstract: We propose Model Swarms, a collaborative search algorithm to adapt LLMs via swarm intelligence, the collective behavior guiding individual systems. Specifically, Model Swarms starts with a pool of LLM experts and a utility function. Guided by the best-found checkpoints across models, diverse LLM experts collaboratively move in the weight space and optimize a utility function representing model adaptation objectives. Compared to existing model composition approaches, Model Swarms offers tuning-free model adaptation, works in low-data regimes with as few as 200 examples, and does not require assumptions about specific experts in the swarm or how they should be composed. Extensive experiments demonstrate that Model Swarms could flexibly adapt LLM experts to a single task, multi-task domains, reward models, as well as diverse human interests, improving over 12 model composition baselines by up to 21.0% across tasks and contexts. Further analysis reveals that LLM experts discover previously unseen capabilities in initial checkpoints and that Model Swarms enable the weak-to-strong transition of experts through the collaborative search process.
Published: 2024

3. Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only

Author: Yao, Jihan, Ding, Wenxuan, Feng, Shangbin, Wang, Lucy Lu, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: In the absence of abundant reliable annotations for challenging tasks and contexts, how can we expand the frontier of LLM capabilities with potentially wrong answers? We focus on two research questions: (1) Can LLMs generate reliable preferences among wrong options? And if so, (2) Would alignment with such wrong-over-wrong preferences be helpful? We employ methods based on self-consistency, token probabilities, and LLM-as-a-judge to elicit wrong-over-wrong preferences, and fine-tune language models with preference optimization approaches using these synthesized preferences. Extensive experiments with seven LLMs and eight datasets demonstrate that (1) LLMs do have preliminary capability in distinguishing various shades of wrong, achieving up to 20.9% higher performance than random guess; (2) Alignment with wrong-over-wrong preferences helps LLMs to produce less wrong and sometimes even outright correct answers, while overall improving model calibration.
Published: 2024

4. Biased AI can Influence Political Decision-Making

Author: Fisher, Jillian, Feng, Shangbin, Aron, Robert, Richardson, Thomas, Choi, Yejin, Fisher, Daniel W., Pan, Jennifer, Tsvetkov, Yulia, and Reinecke, Katharina
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Artificial Intelligence
Abstract: As modern AI models become integral to everyday tasks, concerns about their inherent biases and their potential impact on human decision-making have emerged. While bias in models are well-documented, less is known about how these biases influence human decisions. This paper presents two interactive experiments investigating the effects of partisan bias in AI language models on political decision-making. Participants interacted freely with either a biased liberal, biased conservative, or unbiased control model while completing political decision-making tasks. We found that participants exposed to politically biased models were significantly more likely to adopt opinions and make decisions aligning with the AI's bias, regardless of their personal political partisanship. However, we also discovered that prior knowledge about AI could lessen the impact of the bias, highlighting the possible importance of AI education for robust bias mitigation. Our findings not only highlight the critical effects of interacting with biased AI and its ability to impact public discourse and political conduct, but also highlights potential techniques for mitigating these risks in the future.
Published: 2024

5. Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia

Author: Samir, Farhan, Park, Chan Young, Field, Anjalie, Shwartz, Vered, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: To explain social phenomena and identify systematic biases, much research in computational social science focuses on comparative text analyses. These studies often rely on coarse corpus-level statistics or local word-level analyses, mainly in English. We introduce the InfoGap method -- an efficient and reliable approach to locating information gaps and inconsistencies in articles at the fact level, across languages. We evaluate InfoGap by analyzing LGBT people's portrayals, across 2.7K biography pages on English, Russian, and French Wikipedias. We find large discrepancies in factual coverage across the languages. Moreover, our analysis reveals that biographical facts carrying negative connotations are more likely to be highlighted in Russian Wikipedia. Crucially, InfoGap both facilitates large scale analyses, and pinpoints local document- and fact-level information gaps, laying a new foundation for targeted and nuanced comparative language analysis at scale., Comment: 15 pages, 3 figures. To appear at EMNLP'24
Published: 2024

6. CulturalBench: a Robust, Diverse and Challenging Benchmark on Measuring the (Lack of) Cultural Knowledge of LLMs

Author: Chiu, Yu Ying, Jiang, Liwei, Lin, Bill Yuchen, Park, Chan Young, Li, Shuyue Stella, Ravi, Sahithya, Bhatia, Mehar, Antoniak, Maria, Tsvetkov, Yulia, Shwartz, Vered, and Choi, Yejin
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: To make large language models (LLMs) more helpful across diverse cultures, it is essential to have effective cultural knowledge benchmarks to measure and track our progress. Effective benchmarks need to be robust, diverse, and challenging. We introduce CulturalBench: a set of 1,227 human-written and human-verified questions for effectively assessing LLMs' cultural knowledge, covering 45 global regions including the underrepresented ones like Bangladesh, Zimbabwe, and Peru. Questions - each verified by five independent annotators - span 17 diverse topics ranging from food preferences to greeting etiquettes. We evaluate models on two setups: CulturalBench-Easy and CulturalBench-Hard which share the same questions but asked differently. We find that LLMs are sensitive to such difference in setups (e.g., GPT-4o with 27.3% difference). Compared to human performance (92.6% accuracy), CulturalBench-Hard is more challenging for frontier LLMs with the best performing model (GPT-4o) at only 61.5% and the worst (Llama3-8b) at 21.4%. Moreover, we find that LLMs often struggle with tricky questions that have multiple correct answers (e.g., What utensils do the Chinese usually use?), revealing a tendency to converge to a single answer. Our results also indicate that OpenAI GPT-4o substantially outperform other proprietary and open source models in questions related to all but one region (Oceania). Nonetheless, all models consistently underperform on questions related to South America and the Middle East., Comment: Preprint. Under review
Published: 2024

7. Evidence for spin droplets (ferrons) formation in the heavy fermion metal CeB$_6$ with dynamic charge stripes

Author: Azarevich, A. N., Khrykina, O. N., Bolotina, N. B., Gridchina, V. G., Bogach, A. V., Demishev, S. V., Krasnorussky, V. N., Gavrilkin, S. Yu., Tsvetkov, A. Yu., Shitsevalova, N. Yu., Voronov, V. V., Kugel, K. I., Rakhmanov, A. L., Gabani, S., Flachbart, K., and Sluchanko, N. E.
Subjects: Condensed Matter - Strongly Correlated Electrons
Abstract: The presented studies of resistivity (R), thermal conductivity (k) and specific heat (C) at low temperature 1.8-7 K in magnetic field up to 90 kOe made it possible to detect for the first time the exponential field dependences R(H), 1/k(H), $C(H) \sim \exp(-{\mu}_{\rm eff}H/kBT)$ of the charge transport and thermal characteristics in the so-called antiferroquadrupole (AFQ) phase of the archetypal heavy-fermion CeB$_6$ hexaboride. From magnetoresistance measurements it is shown that in the AFQ state the effective magnetic moment varies in the range ${\mu}_{\rm eff}(T) = 1.4$-1.9$\mu$B, and its value is very close to ${\mu}_{\rm eff}(\tau)(T) = 2{\mu}$B, derived from the field dependence of the relaxation time $|tau(H)$ observed in the heat capacity and thermal conductivity experiments. The phenomenological model proposed here allowed us to attribute the magnetic moments to spin droplets (ferrons), that appear in the bulk AFQ phase of CeB$_6$ crystals. The relevant electron phase separation at the nanoscale, manifested by dynamic charge stripes, that leads to the formation of ferrons, was revealed from the analysis of low-temperature X-ray diffraction experiments using the maximum entropy method. We argue that the Jahn-Teller collective mode of B$_6$ clusters is responsible for the charge stripe formation, which subsequently induces transverse quasi-local vibrations of Ce ions in the form of pairs and triples. These lead to 4f-5d spin fluctuations providing spin-polarons (ferrons) in the CeB$_6$ matrix., Comment: 25 pages, 9 figires and 2 tables
Published: 2024

8. JPEG-LM: LLMs as Image Generators with Canonical Codec Representations

Author: Han, Xiaochuang, Ghazvininejad, Marjan, Koh, Pang Wei, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language, Computer Science - Computer Vision and Pattern Recognition, Computer Science - Machine Learning
Abstract: Recent work in image and video generation has been adopting the autoregressive LLM architecture due to its generality and potentially easy integration into multi-modal systems. The crux of applying autoregressive training in language generation to visual generation is discretization -- representing continuous data like images and videos as discrete tokens. Common methods of discretizing images and videos include modeling raw pixel values, which are prohibitively lengthy, or vector quantization, which requires convoluted pre-hoc training. In this work, we propose to directly model images and videos as compressed files saved on computers via canonical codecs (e.g., JPEG, AVC/H.264). Using the default Llama architecture without any vision-specific modifications, we pretrain JPEG-LM from scratch to generate images (and AVC-LM to generate videos as a proof of concept), by directly outputting compressed file bytes in JPEG and AVC formats. Evaluation of image generation shows that this simple and straightforward approach is more effective than pixel-based modeling and sophisticated vector quantization baselines (on which our method yields a 31% reduction in FID). Our analysis shows that JPEG-LM has an especial advantage over vector quantization models in generating long-tail visual elements. Overall, we show that using canonical codec representations can help lower the barriers between language generation and visual generation, facilitating future research on multi-modal language/image/video LLMs.
Published: 2024

9. Know Your Limits: A Survey of Abstention in Large Language Models

Author: Wen, Bingbing, Yao, Jihan, Feng, Shangbin, Xu, Chenjun, Tsvetkov, Yulia, Howe, Bill, and Wang, Lucy Lu
Subjects: Computer Science - Computation and Language
Abstract: Abstention, the refusal of large language models (LLMs) to provide an answer, is increasingly recognized for its potential to mitigate hallucinations and enhance safety in LLM systems. In this survey, we introduce a framework to examine abstention from three perspectives: the query, the model, and human values. We organize the literature on abstention methods, benchmarks, and evaluation metrics using this framework, and discuss merits and limitations of prior work. We further identify and motivate areas for future work, centered around whether abstention can be achieved as a meta-capability that transcends specific tasks or domains, while still providing opportunities to optimize abstention abilities based on context., Comment: preprint
Published: 2024

10. MAGNET: Improving the Multilingual Fairness of Language Models with Adaptive Gradient-Based Tokenization

Author: Ahia, Orevaoghene, Kumar, Sachin, Gonen, Hila, Hoffman, Valentin, Limisiewicz, Tomasz, Tsvetkov, Yulia, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: In multilingual settings, non-Latin scripts and low-resource languages are usually disadvantaged in terms of language models' utility, efficiency, and cost. Specifically, previous studies have reported multiple modeling biases that the current tokenization algorithms introduce to non-Latin script languages, the main one being over-segmentation. In this work, we propose MAGNET; multilingual adaptive gradient-based tokenization to reduce over-segmentation via adaptive gradient-based subword tokenization. MAGNET learns to predict segment boundaries between byte tokens in a sequence via sub-modules within the model, which act as internal boundary predictors (tokenizers). Previous gradient-based tokenization methods aimed for uniform compression across sequences by integrating a single boundary predictor during training and optimizing it end-to-end through stochastic reparameterization alongside the next token prediction objective. However, this approach still results in over-segmentation for non-Latin script languages in multilingual settings. In contrast, MAGNET offers a customizable architecture where byte-level sequences are routed through language-script-specific predictors, each optimized for its respective language script. This modularity enforces equitable segmentation granularity across different language scripts compared to previous methods. Through extensive experiments, we demonstrate that in addition to reducing segmentation disparities, MAGNET also enables faster language modelling and improves downstream utility.
Published: 2024

11. The Art of Saying No: Contextual Noncompliance in Language Models

Author: Brahman, Faeze, Kumar, Sachin, Balachandran, Vidhisha, Dasigi, Pradeep, Pyatkin, Valentina, Ravichander, Abhilasha, Wiegreffe, Sarah, Dziri, Nouha, Chandu, Khyathi, Hessel, Jack, Tsvetkov, Yulia, Smith, Noah A., Choi, Yejin, and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction
Abstract: Chat-based language models are designed to be helpful, yet they should not comply with every user request. While most existing work primarily focuses on refusal of "unsafe" queries, we posit that the scope of noncompliance should be broadened. We introduce a comprehensive taxonomy of contextual noncompliance describing when and how models should not comply with user requests. Our taxonomy spans a wide range of categories including incomplete, unsupported, indeterminate, and humanizing requests (in addition to unsafe requests). To test noncompliance capabilities of language models, we use this taxonomy to develop a new evaluation suite of 1000 noncompliance prompts. We find that most existing models show significantly high compliance rates in certain previously understudied categories with models like GPT-4 incorrectly complying with as many as 30% of requests. To address these gaps, we explore different training strategies using a synthetically-generated training set of requests and expected noncompliant responses. Our experiments demonstrate that while direct finetuning of instruction-tuned models can lead to both over-refusal and a decline in general capabilities, using parameter efficient methods like low rank adapters helps to strike a good balance between appropriate noncompliance and other capabilities.
Published: 2024

12. ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions

Author: Park, Chan Young, Li, Shuyue Stella, Jung, Hayoung, Volkova, Svitlana, Mitra, Tanushree, Jurgens, David, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: This study introduces ValueScope, a framework leveraging language models to quantify social norms and values within online communities, grounded in social science perspectives on normative structures. We employ ValueScope to dissect and analyze linguistic and stylistic expressions across 13 Reddit communities categorized under gender, politics, science, and finance. Our analysis provides a quantitative foundation showing that even closely related communities exhibit remarkably diverse norms. This diversity supports existing theories and adds a new dimension--community preference--to understanding community interactions. ValueScope not only delineates differing social norms among communities but also effectively traces their evolution and the influence of significant external events like the U.S. presidential elections and the emergence of new sub-communities. The framework thus highlights the pivotal role of social norms in shaping online interactions, presenting a substantial advance in both the theory and application of social norm studies in digital spaces., Comment: First three authors contributed equally. Accepted at EMNLP Findings 2024
Published: 2024

13. Voices Unheard: NLP Resources and Models for Yor\`ub\'a Regional Dialects

Author: Ahia, Orevaoghene, Aremu, Anuoluwapo, Abagyan, Diana, Gonen, Hila, Adelani, David Ifeoluwa, Abolade, Daud, Smith, Noah A., and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: Yor\`ub\'a an African language with roughly 47 million speakers encompasses a continuum with several dialects. Recent efforts to develop NLP technologies for African languages have focused on their standard dialects, resulting in disparities for dialects and varieties for which there are little to no resources or tools. We take steps towards bridging this gap by introducing a new high-quality parallel text and speech corpus YOR\`ULECT across three domains and four regional Yor\`ub\'a dialects. To develop this corpus, we engaged native speakers, travelling to communities where these dialects are spoken, to collect text and speech data. Using our newly created corpus, we conducted extensive experiments on (text) machine translation, automatic speech recognition, and speech-to-text translation. Our results reveal substantial performance disparities between standard Yor\`ub\'a and the other dialects across all tasks. However, we also show that with dialect-adaptive finetuning, we are able to narrow this gap. We believe our dataset and experimental analysis will contribute greatly to developing NLP tools for Yor\`ub\'a and its dialects, and potentially for other African languages, by improving our understanding of existing challenges and offering a high-quality dataset for further development. We release YOR\`ULECT dataset and models publicly under an open license.
Published: 2024

14. Can LLM Graph Reasoning Generalize beyond Pattern Memorization?

Author: Zhang, Yizhuo, Wang, Heng, Feng, Shangbin, Tan, Zhaoxuan, Han, Xiaochuang, He, Tianxing, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language, I.2.7
Abstract: Large language models (LLMs) demonstrate great potential for problems with implicit graphical structures, while recent works seek to enhance the graph reasoning capabilities of LLMs through specialized instruction tuning. The resulting 'graph LLMs' are evaluated with in-distribution settings only, thus it remains underexplored whether LLMs are learning generalizable graph reasoning skills or merely memorizing patterns in the synthetic training data. To this end, we propose the NLGift benchmark, an evaluation suite of LLM graph reasoning generalization: whether LLMs could go beyond semantic, numeric, structural, reasoning patterns in the synthetic training data and improve utility on real-world graph-based tasks. Extensive experiments with two LLMs across four graph reasoning tasks demonstrate that while generalization on simple patterns (semantic, numeric) is somewhat satisfactory, LLMs struggle to generalize across reasoning and real-world patterns, casting doubt on the benefit of synthetic graph tuning for real-world tasks with underlying network structures. We explore three strategies to improve LLM graph reasoning generalization, and we find that while post-training alignment is most promising for real-world tasks, empowering LLM graph reasoning to go beyond pattern memorization remains an open research question., Comment: 17 pages, 6 figures. EMNLP 2024 Findings. Code and data is publicly available at https://github.com/MatthewYZhang/NLGift
Published: 2024

15. Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration

Author: Feng, Shangbin, Sorensen, Taylor, Liu, Yuhan, Fisher, Jillian, Park, Chan Young, Choi, Yejin, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: While existing alignment paradigms have been integral in developing large language models (LLMs), LLMs often learn an averaged human preference and struggle to model diverse preferences across cultures, demographics, and communities. We propose Modular Pluralism, a modular framework based on multi-LLM collaboration for pluralistic alignment: it "plugs into" a base LLM a pool of smaller but specialized community LMs, where models collaborate in distinct modes to flexibility support three modes of pluralism: Overton, steerable, and distributional. Modular Pluralism is uniquely compatible with black-box LLMs and offers the modular control of adding new community LMs for previously underrepresented communities. We evaluate Modular Pluralism with six tasks and four datasets featuring questions/instructions with value-laden and perspective-informed responses. Extensive experiments demonstrate that Modular Pluralism advances the three pluralism objectives across six black-box and open-source LLMs. Further analysis reveals that LLMs are generally faithful to the inputs from smaller community LLMs, allowing seamless patching by adding a new community LM to better cover previously underrepresented communities., Comment: EMNLP 2024
Published: 2024

16. Teaching LLMs to Abstain across Languages via Multilingual Feedback

Author: Feng, Shangbin, Shi, Weijia, Wang, Yike, Ding, Wenxuan, Ahia, Orevaoghene, Li, Shuyue Stella, Balachandran, Vidhisha, Sitaram, Sunayana, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: Multilingual LLMs often have knowledge disparities across languages, with larger gaps in under-resourced languages. Teaching LLMs to abstain in the face of knowledge gaps is thus a promising strategy to mitigate hallucinations in multilingual settings. However, previous studies on LLM abstention primarily focus on English; we find that directly applying existing solutions beyond English results in up to 20.5% performance gaps between high and low-resource languages, potentially due to LLMs' drop in calibration and reasoning beyond a few resource-rich languages. To this end, we propose strategies to enhance LLM abstention by learning from multilingual feedback, where LLMs self-reflect on proposed answers in one language by generating multiple feedback items in related languages: we show that this helps identifying the knowledge gaps across diverse languages, cultures, and communities. Extensive experiments demonstrate that our multilingual feedback approach outperforms various strong baselines, achieving up to 9.2% improvement for low-resource languages across three black-box and open models on three datasets, featuring open-book, closed-book, and commonsense QA. Further analysis reveals that multilingual feedback is both an effective and a more equitable abstain strategy to serve diverse language speakers, and cultural factors have great impact on language selection and LLM abstention behavior, highlighting future directions for multilingual and multi-cultural reliable language modeling., Comment: EMNLP 2024
Published: 2024

17. MediQ: Question-Asking LLMs and a Benchmark for Reliable Interactive Clinical Reasoning

Author: Li, Shuyue Stella, Balachandran, Vidhisha, Feng, Shangbin, Ilgen, Jonathan S., Pierson, Emma, Koh, Pang Wei, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Users typically engage with LLMs interactively, yet most existing benchmarks evaluate them in a static, single-turn format, posing reliability concerns in interactive scenarios. We identify a key obstacle towards reliability: LLMs are trained to answer any question, even with incomplete context or insufficient knowledge. In this paper, we propose to change the static paradigm to an interactive one, develop systems that proactively ask questions to gather more information and respond reliably, and introduce an benchmark - MediQ - to evaluate question-asking ability in LLMs. MediQ simulates clinical interactions consisting of a Patient System and an adaptive Expert System; with potentially incomplete initial information, the Expert refrains from making diagnostic decisions when unconfident, and instead elicits missing details via follow-up questions. We provide a pipeline to convert single-turn medical benchmarks into an interactive format. Our results show that directly prompting state-of-the-art LLMs to ask questions degrades performance, indicating that adapting LLMs to proactive information-seeking settings is nontrivial. We experiment with abstention strategies to better estimate model confidence and decide when to ask questions, improving diagnostic accuracy by 22.3%; however, performance still lags compared to an (unrealistic in practice) upper bound with complete information upfront. Further analyses show improved interactive performance with filtering irrelevant contexts and reformatting conversations. Overall, we introduce a novel problem towards LLM reliability, an interactive MediQ benchmark and a novel question-asking system, and highlight directions to extend LLMs' information-seeking abilities in critical domains., Comment: 29 pages, 12 figures
Published: 2024

18. Learning Syntax Without Planting Trees: Understanding When and Why Transformers Generalize Hierarchically

Author: Ahuja, Kabir, Balachandran, Vidhisha, Panwar, Madhur, He, Tianxing, Smith, Noah A., Goyal, Navin, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: Transformers trained on natural language data have been shown to learn its hierarchical structure and generalize to sentences with unseen syntactic structures without explicitly encoding any structural bias. In this work, we investigate sources of inductive bias in transformer models and their training that could cause such generalization behavior to emerge. We extensively experiment with transformer models trained on multiple synthetic datasets and with different training objectives and show that while other objectives e.g. sequence-to-sequence modeling, prefix language modeling, often failed to lead to hierarchical generalization, models trained with the language modeling objective consistently learned to generalize hierarchically. We then conduct pruning experiments to study how transformers trained with the language modeling objective encode hierarchical structure. When pruned, we find joint existence of subnetworks within the model with different generalization behaviors (subnetworks corresponding to hierarchical structure and linear order). Finally, we take a Bayesian perspective to further uncover transformers' preference for hierarchical generalization: We establish a correlation between whether transformers generalize hierarchically on a dataset and whether the simplest explanation of that dataset is provided by a hierarchical grammar compared to regular grammars exhibiting linear generalization., Comment: Code now available: https://github.com/kabirahuja2431/transformers-hg
Published: 2024

19. CulturalTeaming: AI-Assisted Interactive Red-Teaming for Challenging LLMs' (Lack of) Multicultural Knowledge

Author: Chiu, Yu Ying, Jiang, Liwei, Antoniak, Maria, Park, Chan Young, Li, Shuyue Stella, Bhatia, Mehar, Ravi, Sahithya, Tsvetkov, Yulia, Shwartz, Vered, and Choi, Yejin
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction
Abstract: Frontier large language models (LLMs) are developed by researchers and practitioners with skewed cultural backgrounds and on datasets with skewed sources. However, LLMs' (lack of) multicultural knowledge cannot be effectively assessed with current methods for developing benchmarks. Existing multicultural evaluations primarily rely on expensive and restricted human annotations or potentially outdated internet resources. Thus, they struggle to capture the intricacy, dynamics, and diversity of cultural norms. LLM-generated benchmarks are promising, yet risk propagating the same biases they are meant to measure. To synergize the creativity and expert cultural knowledge of human annotators and the scalability and standardizability of LLM-based automation, we introduce CulturalTeaming, an interactive red-teaming system that leverages human-AI collaboration to build truly challenging evaluation dataset for assessing the multicultural knowledge of LLMs, while improving annotators' capabilities and experiences. Our study reveals that CulturalTeaming's various modes of AI assistance support annotators in creating cultural questions, that modern LLMs fail at, in a gamified manner. Importantly, the increased level of AI assistance (e.g., LLM-generated revision hints) empowers users to create more difficult questions with enhanced perceived creativity of themselves, shedding light on the promises of involving heavier AI assistance in modern evaluation dataset creation procedures. Through a series of 1-hour workshop sessions, we gather CULTURALBENCH-V0.1, a compact yet high-quality evaluation dataset with users' red-teaming attempts, that different families of modern LLMs perform with accuracy ranging from 37.7% to 72.2%, revealing a notable gap in LLMs' multicultural proficiency., Comment: Preprint (under review)
Published: 2024

20. DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

Author: Faisal, Fahim, Ahia, Orevaoghene, Srivastava, Aarohi, Ahuja, Kabir, Chiang, David, Tsvetkov, Yulia, and Anastasopoulos, Antonios
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Language technologies should be judged on their usefulness in real-world use cases. An often overlooked aspect in natural language processing (NLP) research and evaluation is language variation in the form of non-standard dialects or language varieties (hereafter, varieties). Most NLP benchmarks are limited to standard language varieties. To fill this gap, we propose DIALECTBENCH, the first-ever large-scale benchmark for NLP on varieties, which aggregates an extensive set of task-varied variety datasets (10 text-level tasks covering 281 varieties). This allows for a comprehensive evaluation of NLP system performance on different language varieties. We provide substantial evidence of performance disparities between standard and non-standard language varieties, and we also identify language clusters with large performance divergence across tasks. We believe DIALECTBENCH provides a comprehensive view of the current state of NLP for language varieties and one step towards advancing it further. Code/data: https://github.com/ffaisal93/DialectBench, Comment: Equal contribution: Fahim Faisal, Orevaoghene Ahia
Published: 2024

21. Alpaca against Vicuna: Using LLMs to Uncover Memorization of LLMs

Author: Kassem, Aly M., Mahmoud, Omar, Mireshghallah, Niloofar, Kim, Hyunwoo, Tsvetkov, Yulia, Choi, Yejin, Saad, Sherif, and Rana, Santu
Subjects: Computer Science - Computation and Language
Abstract: In this paper, we introduce a black-box prompt optimization method that uses an attacker LLM agent to uncover higher levels of memorization in a victim agent, compared to what is revealed by prompting the target model with the training data directly, which is the dominant approach of quantifying memorization in LLMs. We use an iterative rejection-sampling optimization process to find instruction-based prompts with two main characteristics: (1) minimal overlap with the training data to avoid presenting the solution directly to the model, and (2) maximal overlap between the victim model's output and the training data, aiming to induce the victim to spit out training data. We observe that our instruction-based prompts generate outputs with 23.7% higher overlap with training data compared to the baseline prefix-suffix measurements. Our findings show that (1) instruction-tuned models can expose pre-training data as much as their base-models, if not more so, (2) contexts other than the original training data can lead to leakage, and (3) using instructions proposed by other LLMs can open a new avenue of automated attacks that we should further study and explore. The code can be found at https://github.com/Alymostafa/Instruction_based_attack .
Published: 2024

22. Autocollimation Optical Doppler Velocimeter: Velocity Measurement of Hard-to-Access Objects

Author: Pavlov, A. L., Pyrkov, Yu. N., and Tsvetkov, V. B.
Published: 2024
Full Text: View/download PDF

23. Gain-Switched Ytterbium Fiber Laser Operating at a Wavelength of 1127 nm

Author: Filatova, S. A., Kamynin, V. A., Makeeva, A. R., Rybaltovsky, A. A., Fale, A. E., Fedoseev, A. I., and Tsvetkov, V. B.
Published: 2024
Full Text: View/download PDF

24. How random immigration impacts order–chaos transformations and extinction in population dynamics

Author: Ryashko, Lev and Tsvetkov, Ivan
Published: 2024
Full Text: View/download PDF

25. Extracting Lexical Features from Dialects via Interpretable Dialect Classifiers

Author: Xie, Roy, Ahia, Orevaoghene, Tsvetkov, Yulia, and Anastasopoulos, Antonios
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Identifying linguistic differences between dialects of a language often requires expert knowledge and meticulous human analysis. This is largely due to the complexity and nuance involved in studying various dialects. We present a novel approach to extract distinguishing lexical features of dialects by utilizing interpretable dialect classifiers, even in the absence of human experts. We explore both post-hoc and intrinsic approaches to interpretability, conduct experiments on Mandarin, Italian, and Low Saxon, and experimentally demonstrate that our method successfully identifies key language-specific lexical features that contribute to dialectal variations., Comment: Code is available at https://github.com/ruoyuxie/interpretable_dialect_classifier
Published: 2024

26. Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Author: Wang, Yichen, Feng, Shangbin, Hou, Abe Bohan, Pu, Xiao, Shen, Chao, Liu, Xiaoming, Tsvetkov, Yulia, and He, Tianxing
Subjects: Computer Science - Computation and Language
Abstract: The widespread use of large language models (LLMs) is increasing the demand for methods that detect machine-generated text to prevent misuse. The goal of our study is to stress test the detectors' robustness to malicious attacks under realistic scenarios. We comprehensively study the robustness of popular machine-generated text detectors under attacks from diverse categories: editing, paraphrasing, prompting, and co-generating. Our attacks assume limited access to the generator LLMs, and we compare the performance of detectors on different attacks under different budget levels. Our experiments reveal that almost none of the existing detectors remain robust under all the attacks, and all detectors exhibit different loopholes. Averaging all detectors, the performance drops by 35% across all attacks. Further, we investigate the reasons behind these defects and propose initial out-of-the-box patches to improve robustness.
Published: 2024

27. DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

Author: Wan, Herun, Feng, Shangbin, Tan, Zhaoxuan, Wang, Heng, Tsvetkov, Yulia, and Luo, Minnan
Subjects: Computer Science - Computation and Language
Abstract: Large language models are limited by challenges in factuality and hallucinations to be directly employed off-the-shelf for judging the veracity of news articles, where factual accuracy is paramount. In this work, we propose DELL that identifies three key stages in misinformation detection where LLMs could be incorporated as part of the pipeline: 1) LLMs could \emph{generate news reactions} to represent diverse perspectives and simulate user-news interaction networks; 2) LLMs could \emph{generate explanations} for proxy tasks (e.g., sentiment, stance) to enrich the contexts of news articles and produce experts specializing in various aspects of news understanding; 3) LLMs could \emph{merge task-specific experts} and provide an overall prediction by incorporating the predictions and confidence scores of varying experts. Extensive experiments on seven datasets with three LLMs demonstrate that DELL outperforms state-of-the-art baselines by up to 16.8\% in macro f1-score. Further analysis reveals that the generated reactions and explanations are greatly helpful in misinformation detection, while our proposed LLM-guided expert merging helps produce better-calibrated predictions.
Published: 2024

28. Do Membership Inference Attacks Work on Large Language Models?

Author: Duan, Michael, Suri, Anshuman, Mireshghallah, Niloofar, Min, Sewon, Shi, Weijia, Zettlemoyer, Luke, Tsvetkov, Yulia, Choi, Yejin, Evans, David, and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language
Abstract: Membership inference attacks (MIAs) attempt to predict whether a particular datapoint is a member of a target model's training data. Despite extensive research on traditional machine learning models, there has been limited work studying MIA on the pre-training data of large language models (LLMs). We perform a large-scale evaluation of MIAs over a suite of language models (LMs) trained on the Pile, ranging from 160M to 12B parameters. We find that MIAs barely outperform random guessing for most settings across varying LLM sizes and domains. Our further analyses reveal that this poor performance can be attributed to (1) the combination of a large dataset and few training iterations, and (2) an inherently fuzzy boundary between members and non-members. We identify specific settings where LLMs have been shown to be vulnerable to membership inference and show that the apparent success in such settings can be attributed to a distribution shift, such as when members and non-members are drawn from the seemingly identical domain but with different temporal ranges. We release our code and data as a unified benchmark package that includes all existing MIAs, supporting future work., Comment: Accepted at Conference on Language Modeling (COLM), 2024
Published: 2024

29. What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Author: Feng, Shangbin, Wan, Herun, Wang, Ningnan, Tan, Zhaoxuan, Luo, Minnan, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: Social media bot detection has always been an arms race between advancements in machine learning bot detectors and adversarial bot strategies to evade detection. In this work, we bring the arms race to the next level by investigating the opportunities and risks of state-of-the-art large language models (LLMs) in social bot detection. To investigate the opportunities, we design novel LLM-based bot detectors by proposing a mixture-of-heterogeneous-experts framework to divide and conquer diverse user information modalities. To illuminate the risks, we explore the possibility of LLM-guided manipulation of user textual and structured information to evade detection. Extensive experiments with three LLMs on two datasets demonstrate that instruction tuning on merely 1,000 annotated examples produces specialized LLMs that outperform state-of-the-art baselines by up to 9.1% on both datasets, while LLM-guided manipulation strategies could significantly bring down the performance of existing bot detectors by up to 29.6% and harm the calibration and reliability of bot detection systems., Comment: ACL 2024
Published: 2024

30. Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration

Author: Feng, Shangbin, Shi, Weijia, Wang, Yike, Ding, Wenxuan, Balachandran, Vidhisha, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: Despite efforts to expand the knowledge of large language models (LLMs), knowledge gaps -- missing or outdated information in LLMs -- might always persist given the evolving nature of knowledge. In this work, we study approaches to identify LLM knowledge gaps and abstain from answering questions when knowledge gaps are present. We first adapt existing approaches to model calibration or adaptation through fine-tuning/prompting and analyze their ability to abstain from generating low-confidence outputs. Motivated by their failures in self-reflection and over-reliance on held-out sets, we propose two novel approaches that are based on model collaboration, i.e., LLMs probing other LLMs for knowledge gaps, either cooperatively or competitively. Extensive experiments with three LLMs on four QA tasks featuring diverse knowledge domains demonstrate that both cooperative and competitive approaches to unveiling LLM knowledge gaps achieve up to 19.3% improvements on abstain accuracy against the strongest baseline. Further analysis reveals that our proposed mechanisms could help identify failure cases in retrieval augmentation and pinpoint knowledge gaps in multi-hop reasoning., Comment: ACL 2024
Published: 2024

31. Knots of Darkness in Atmospheric Turbulence

Author: Pires, D. G., Tsvetkov, D., Chandra, N., and Litchinitser, N. M.
Subjects: Physics - Optics, 78A02
Abstract: Topology, which originated as a mathematical discipline, nowadays advances the understanding of many branches of science and technology from elementary particle physics and cosmology to condensed matter physics. In optics, the topology of light and darkness facilitates new degrees of freedom for sculpting optical beams beyond conventionally used amplitude, phase, and polarization. This fundamentally new, spatial dimension opens new opportunities for several optical applications, ranging from optical manipulation, trapping, data processing, optical sensing and metrology, enhanced imaging, and microscopy, to classical and quantum communications. While topological stability of mathematical knots implying robustness to perturbations suggests their potential as information carriers, the behavior of optical knots in perturbative environments such as atmospheric turbulence is largely unexplored. Here, we experimentally and theoretically investigate the effects of atmospheric turbulence of optical knot stability and demonstrate that the number of crossing (the topological invariant) is preserved in the weak-turbulence regime, but may not be conserved in the stronger turbulence conditions. The turbulent medium is simulated in the laboratory using phase screens, which carry the refractive index changes associated with the Kolmogorov power spectrum, encoded in a spatial light modulator. The optical knots are reconstructed by single-shot measurements of the complex field, and the resilience of the knot topology is analyzed for various realistic turbulence strengths. These studies may give rise to entirely new approaches to the three-dimensional (3D) spatially resolved probing of turbulence., Comment: 23 pages and 11 figures
Published: 2024

32. Thermonuclear explosions as Type II supernovae

Author: Kozyreva, Alexandra, Moran-Fraile, Javier, Holas, Alexander, Bronner, Vincent A., Roepke, Friedrich K., Pavlyuk, Nikolay, Mironov, Alexey, and Tsvetkov, Dmitriy
Subjects: Astrophysics - Solar and Stellar Astrophysics, Astrophysics - High Energy Astrophysical Phenomena
Abstract: We consider a binary stellar system, in which a low-mass, of 0.6 Msun, carbon-oxygen white dwarf (WD) mergers with a degenerate helium core of 0.4 Msun of a red giant. We analyse the outcome of a merger within a common envelope (CE). We predict the observational properties of the resulting transient. We find that the double detonation of the WD, being a pure thermonuclear explosion and embedded into the hydrogen-rich CE, has a light curve with the distinct plateau shape, i.e. looks like a supernova (SN) Type IIP, with a duration of about 40 days. We find five observed SNe IIP: SN 2004dy, SN 2005af, SN 2005hd, SN 2007aa, and SN 2008bu, that match the V-band light curve of our models. Hence, we show that a thermonuclear explosion within a CE might be mistakenly identified as a SN IIP, which are believed to be an outcome of a core-collapse neutrino-driven explosion of a massive star. We discuss a number of diagnostics, that may help to distinguish this kind of a thermonuclear explosion from a core-collapse SN., Comment: 12 pp., accepted for publication in A & A
Published: 2024

33. Tuning Language Models by Proxy

Author: Liu, Alisa, Han, Xiaochuang, Wang, Yizhong, Tsvetkov, Yulia, Choi, Yejin, and Smith, Noah A.
Subjects: Computer Science - Computation and Language
Abstract: Despite the general capabilities of large pretrained language models, they consistently benefit from further adaptation to better achieve desired behaviors. However, tuning these models has become increasingly resource-intensive, or impossible when model weights are private. We introduce proxy-tuning, a lightweight decoding-time algorithm that operates on top of black-box LMs to achieve the same end as direct tuning, but by accessing only its predictions over the output vocabulary, not its parameters. Our method tunes a smaller LM, then applies the difference between the predictions of the small tuned and untuned LMs to shift the original predictions of the larger untuned model in the direction of tuning, while retaining the benefits of larger-scale pretraining. In experiments, when we apply proxy-tuning to Llama2-70B using proxies of only 7B size, we can close 88% of the gap between Llama2-70B and its truly-tuned chat version, when evaluated across knowledge, reasoning, and safety benchmarks. We then demonstrate the generality of proxy-tuning by applying it to domain adaptation on code, and task-specific finetuning on question-answering and math problems. Finally, we show how to proxy-tune a truly black-box LM, GPT-3.5, for temporal adaptation, increasing its knowledge about recent events. Our work demonstrates the promise of using small tuned LMs to efficiently customize large, potentially proprietary LMs through decoding-time guidance., Comment: COLM 2024 camera-ready, code available at https://github.com/alisawuffles/proxy-tuning
Published: 2024

34. Fine-grained Hallucination Detection and Editing for Language Models

Author: Mishra, Abhika, Asai, Akari, Balachandran, Vidhisha, Wang, Yizhong, Neubig, Graham, Tsvetkov, Yulia, and Hajishirzi, Hannaneh
Subjects: Computer Science - Computation and Language
Abstract: Large language models (LMs) are prone to generate factual errors, which are often called hallucinations. In this paper, we introduce a comprehensive taxonomy of hallucinations and argue that hallucinations manifest in diverse forms, each requiring varying degrees of careful assessments to verify factuality. We propose a novel task of automatic fine-grained hallucination detection and construct a new evaluation benchmark, FavaBench, that includes about one thousand fine-grained human judgments on three LM outputs across various domains. Our analysis reveals that ChatGPT and Llama2-Chat (70B, 7B) exhibit diverse types of hallucinations in the majority of their outputs in information-seeking scenarios. We train FAVA, a retrieval-augmented LM by carefully creating synthetic data to detect and correct fine-grained hallucinations. On our benchmark, our automatic and human evaluations show that FAVA significantly outperforms ChatGPT and GPT-4 on fine-grained hallucination detection, and edits suggested by FAVA improve the factuality of LM-generated text., Comment: Our code, data, and demo are available at https://fine-grained-hallucination.github.io. Published as a conference paper at COLM 2024
Published: 2024

35. Analysis of Emerging Particle Tracks in the BM@N Experiment of the NICA Accelerator Complex in the Mathematical Model of Fractal Thermodynamics

Author: Voytishin, N. N., Mikheev, S. A., Tsvetkov, V. P., and Tsvetkov, I. V.
Published: 2024
Full Text: View/download PDF

36. A Review of Empirical Studies Examining the Effects of E-Learning on University Students' Academic Achievement

Author: Lyudmila S. Chikileva, Alexey A. Chistyakov, Maryana V. Busygina, Alexey I. Prokopyev, Elena V. Grib, and Dmitry N. Tsvetkov
Abstract: The purpose of this research is to review peer-reviewed articles on the effects of e-learning on the academic performance of university students. The SCOPUS database was searched for peer-reviewed articles. The data obtained were analyzed using the content analysis method. Twenty-seven articles were found in journals indexed in the SCOPUS database and considered suitable for this study. Two researchers used the content analysis method to determine the effects of the articles reviewed. The results showed that studies in this area have increased in intensity in recent years. These studies were generally conducted over five years. It was found that quantitative methods were predominantly chosen. Researchers published most articles in 2021 and 2022. Most of the studies reviewed used a quantitative design, and only seven articles chose an experimental research design. Most studies were conducted in Pakistan, Saudi Arabia, Spain, India, Iran, and Turkey. The results show that different measurement instruments or tools were used to measure students' academic achievement. The impact of the peer-reviewed articles on the impact of e-learning on college students' academic achievement was examined in four categories. These categories are detailed in the results. Finally, pedagogical conclusions are drawn in light of the results obtained.
Published: 2023

37. Magnetic Properties of (Mg1 –xCox)3Si2O5(OH)4 Nanoscrolls and Nanocomposites Based on Them

Author: Belskaya, N. A., Khrapova, E. K., Ivanova, A. A., Eremina, R. M., Batulin, R. G., Chikurov, D. S., Degtyarenko, P. N., Tsvetkov, A. Yu., Kirilenko, D. A., and Krasilin, A. A.
Published: 2024
Full Text: View/download PDF

38. Superconducting Properties of 2G HTS Wires Irradiated by Bi Ions with Energy 670 MeV

Author: Degtyarenko, P. N., Skuratov, V. A., Semina, V. K., Ovcharov, A. V., Vasiliev, A. L., Malyavina, A. Yu., Gavrilkin, S. Yu., Tsvetkov, A. Yu., Petrzhik, A. M., and Novikov, M. S.
Published: 2024
Full Text: View/download PDF

39. Effect of Global Warming on Fish Population Parameters in Upper Volga Reservoirs

Author: Gerasimov, Yu. V., Solomatin, Yu. I., Bazarov, M. I., Lapshin, O. M., and Tsvetkov, A. I.
Published: 2024
Full Text: View/download PDF

40. Catalytic Ignition of Deuterium–Air Mixtures Over a Metallic Rhodium Surface at Pressures of 1–2 ATM

Author: Troshin, K. Ya., Rubtsov, N. M., Chernysh, V. I., Tsvetkov, G. I., Shamshin, I. O., Izmaylova, Yu. A., Kalinin, A. P., Leont’ev, A. A., and Rodionov, A. I.
Published: 2024
Full Text: View/download PDF

41. Neutralization of Sulfur-Containing Gases During Coal Filtration Combustion

Author: Tsvetkova, Yu. Yu., Kislov, V. M., Pilipenko, E. N., Salganskaya, M. V., and Tsvetkov, M. V.
Published: 2024
Full Text: View/download PDF

42. Macrozoobenthos of the Mekong River Delta under Conditions of Extremely Low Water

Author: Prokin, A. A., Seleznev, D. G., Dinh, Cu Nguyen, Van Tien, Tran, and Tsvetkov, A. I.
Published: 2024
Full Text: View/download PDF

43. CT-guided percutaneous marking of small pulmonary nodules with [99mTc]Tc-Macrosalb is very accurate and allows minimally invasive lung-sparing resection: a single-centre quality control

Author: Doncic, Nikola, Zech, Christoph J., Wild, Damian, Bachmann, Helga, Mallaev, Makhmudbek, Tsvetkov, Nikolay, Hojski, Aljaz, Takes, Martin T. L., and Lardinois, Didier
Published: 2024
Full Text: View/download PDF

44. Sorption of Nickel Ions from Aqueous Solutions by Lignite

Author: Zykov, I. Yu., Tsvetkov, V. E., and Fedorova, N. I.
Published: 2024
Full Text: View/download PDF

45. Localized Superconductivity in LaB6 Hexaboride with Dynamic Charge Stripes

Author: Azarevich, A. N., Bogach, A. V., Khrykina, O. N., Bolotina, N. B., Gridchina, V. M., Shitsevalova, N. Yu., Gavrilkin, S. Yu., Tsvetkov, A. Yu., Gabáni, S., Flachbart, K., Kuznetsov, A. V., and Sluchanko, N. E.
Published: 2024
Full Text: View/download PDF

46. Features of the Interaction of the Combustion Front of Diluted Methane–Oxygen Mixtures with Hollow Cylindrical and Conical Obstacles at Low Pressures

Author: Troshin, K. Ya., Rubtsov, N. M., Chernysh, V. I., and Tsvetkov, G. I.
Published: 2024
Full Text: View/download PDF

47. The First Record of Golden Jackal Canis aureus in Pskov Oblast—Causes and Consequences

Author: Korablev, N. P., Tsvetkov, I. N., Korablev, P. N., and Korablev, M. P.
Published: 2024
Full Text: View/download PDF

48. P^3SUM: Preserving Author's Perspective in News Summarization with Diffusion Language Models

Author: Liu, Yuhan, Feng, Shangbin, Han, Xiaochuang, Balachandran, Vidhisha, Park, Chan Young, Kumar, Sachin, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language, Computer Science - Machine Learning
Abstract: In this work, we take a first step towards designing summarization systems that are faithful to the author's intent, not only the semantic content of the article. Focusing on a case study of preserving political perspectives in news summarization, we find that existing approaches alter the political opinions and stances of news articles in more than 50% of summaries, misrepresenting the intent and perspectives of the news authors. We thus propose P^3SUM, a diffusion model-based summarization approach controlled by political perspective classifiers. In P^3SUM, the political leaning of a generated summary is iteratively evaluated at each decoding step, and any drift from the article's original stance incurs a loss back-propagated to the embedding layers, steering the political stance of the summary at inference time. Extensive experiments on three news summarization datasets demonstrate that P^3SUM outperforms state-of-the-art summarization systems and large language models by up to 13.7% in terms of the success rate of stance preservation, with competitive performance on standard metrics of summarization quality. Our findings present a first analysis of preservation of pragmatic features in summarization, highlight the lacunae in existing summarization models -- that even state-of-the-art models often struggle to preserve author's intents -- and develop new summarization systems that are more faithful to author's perspectives.
Published: 2023

49. Gen-Z: Generative Zero-Shot Text Classification with Contextualized Label Descriptions

Author: Kumar, Sachin, Park, Chan Young, and Tsvetkov, Yulia
Subjects: Computer Science - Computation and Language
Abstract: Language model (LM) prompting--a popular paradigm for solving NLP tasks--has been shown to be susceptible to miscalibration and brittleness to slight prompt variations, caused by its discriminative prompting approach, i.e., predicting the label given the input. To address these issues, we propose Gen-Z--a generative prompting framework for zero-shot text classification. GEN-Z is generative, as it measures the LM likelihood of input text, conditioned on natural language descriptions of labels. The framework is multivariate, as label descriptions allow us to seamlessly integrate additional contextual information about the labels to improve task performance. On various standard classification benchmarks, with six open-source LM families, we show that zero-shot classification with simple contextualization of the data source of the evaluation set consistently outperforms both zero-shot and few-shot baselines while improving robustness to prompt variations. Further, our approach enables personalizing classification in a zero-shot manner by incorporating author, subject, or reader information in the label descriptions.
Published: 2023

50. Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Contextual Integrity Theory

Author: Mireshghallah, Niloofar, Kim, Hyunwoo, Zhou, Xuhui, Tsvetkov, Yulia, Sap, Maarten, Shokri, Reza, and Choi, Yejin
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computation and Language, Computer Science - Cryptography and Security
Abstract: The interactive use of large language models (LLMs) in AI assistants (at work, home, etc.) introduces a new set of inference-time privacy risks: LLMs are fed different types of information from multiple sources in their inputs and are expected to reason about what to share in their outputs, for what purpose and with whom, within a given context. In this work, we draw attention to the highly critical yet overlooked notion of contextual privacy by proposing ConfAIde, a benchmark designed to identify critical weaknesses in the privacy reasoning capabilities of instruction-tuned LLMs. Our experiments show that even the most capable models such as GPT-4 and ChatGPT reveal private information in contexts that humans would not, 39% and 57% of the time, respectively. This leakage persists even when we employ privacy-inducing prompts or chain-of-thought reasoning. Our work underscores the immediate need to explore novel inference-time privacy-preserving approaches, based on reasoning and theory of mind., Comment: 2024 ICLR Spotlight. The dataset and code can be found at https://confaide.github.io
Published: 2023

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Region

Database

Publisher

15,728 results on '"A A, Tsvetkov"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources