Author: "Gligorić, Kristina" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Gligorić, Kristina"' showing total 35 results

Start Over Author "Gligorić, Kristina"

35 results on '"Gligorić, Kristina"'

1. Can Unconfident LLM Annotations Be Used for Confident Conclusions?

Author: Gligorić, Kristina, Zrnic, Tijana, Lee, Cinoo, Candès, Emmanuel J., and Jurafsky, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Human-Computer Interaction
Abstract: Large language models (LLMs) have shown high agreement with human raters across a variety of tasks, demonstrating potential to ease the challenges of human data collection. In computational social science (CSS), researchers are increasingly leveraging LLM annotations to complement slow and expensive human annotations. Still, guidelines for collecting and using LLM annotations, without compromising the validity of downstream conclusions, remain limited. We introduce Confidence-Driven Inference: a method that combines LLM annotations and LLM confidence indicators to strategically select which human annotations should be collected, with the goal of producing accurate statistical estimates and provably valid confidence intervals while reducing the number of human annotations needed. Our approach comes with safeguards against LLM annotations of poor quality, guaranteeing that the conclusions will be both valid and no less accurate than if we only relied on human annotations. We demonstrate the effectiveness of Confidence-Driven Inference over baselines in statistical estimation tasks across three CSS settings--text politeness, stance, and bias--reducing the needed number of human annotations by over 25% in each. Although we use CSS settings for demonstration, Confidence-Driven Inference can be used to estimate most standard quantities across a broad range of NLP problems.
Published: 2024

2. NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps

Author: Gligoric, Kristina, Cheng, Myra, Zheng, Lucia, Durmus, Esin, and Jurafsky, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Computers and Society, Computer Science - Human-Computer Interaction, Computer Science - Social and Information Networks
Abstract: The use of words to convey speaker's intent is traditionally distinguished from the `mention' of words for quoting what someone said, or pointing out properties of a word. Here we show that computationally modeling this use-mention distinction is crucial for dealing with counterspeech online. Counterspeech that refutes problematic content often mentions harmful language but is not harmful itself (e.g., calling a vaccine dangerous is not the same as expressing disapproval of someone for calling vaccines dangerous). We show that even recent language models fail at distinguishing use from mention, and that this failure propagates to two key downstream tasks: misinformation and hate speech detection, resulting in censorship of counterspeech. We introduce prompting mitigations that teach the use-mention distinction, and show they reduce these errors. Our work highlights the importance of the use-mention distinction for NLP and CSS and offers ways to address it., Comment: NAACL 2024 (Main conference)
Published: 2024

3. AnthroScore: A Computational Linguistic Measure of Anthropomorphism

Author: Cheng, Myra, Gligoric, Kristina, Piccardi, Tiziano, and Jurafsky, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Computers and Society
Abstract: Anthropomorphism, or the attribution of human-like characteristics to non-human entities, has shaped conversations about the impacts and possibilities of technology. We present AnthroScore, an automatic metric of implicit anthropomorphism in language. We use a masked language model to quantify how non-human entities are implicitly framed as human by the surrounding context. We show that AnthroScore corresponds with human judgments of anthropomorphism and dimensions of anthropomorphism described in social science literature. Motivated by concerns of misleading anthropomorphism in computer science discourse, we use AnthroScore to analyze 15 years of research papers and downstream news articles. In research papers, we find that anthropomorphism has steadily increased over time, and that papers related to language models have the most anthropomorphism. Within ACL papers, temporal increases in anthropomorphism are correlated with key neural advancements. Building upon concerns of scientific misinformation in mass media, we identify higher levels of anthropomorphism in news headlines compared to the research papers they cite. Since AnthroScore is lexicon-free, it can be directly applied to a wide range of text sources., Comment: EACL 2024 Main Conference
Published: 2024

4. Grounding Gaps in Language Model Generations

Author: Shaikh, Omar, Gligorić, Kristina, Khetan, Ashna, Gerstgrasser, Matthias, Yang, Diyi, and Jurafsky, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Human-Computer Interaction
Abstract: Effective conversation requires common ground: a shared understanding between the participants. Common ground, however, does not emerge spontaneously in conversation. Speakers and listeners work together to both identify and construct a shared basis while avoiding misunderstanding. To accomplish grounding, humans rely on a range of dialogue acts, like clarification (What do you mean?) and acknowledgment (I understand.). However, it is unclear whether large language models (LLMs) generate text that reflects human grounding. To this end, we curate a set of grounding acts and propose corresponding metrics that quantify attempted grounding. We study whether LLM generations contain grounding acts, simulating turn-taking from several dialogue datasets and comparing results to humans. We find that -- compared to humans -- LLMs generate language with less conversational grounding, instead generating text that appears to simply presume common ground. To understand the roots of the identified grounding gap, we examine the role of instruction tuning and preference optimization, finding that training on contemporary preference data leads to a reduction in generated grounding acts. Altogether, we highlight the need for more research investigating conversational grounding in human-AI interaction., Comment: NAACL 2024; 18 pages, 2 figures
Published: 2023

5. In-class Data Analysis Replications: Teaching Students while Testing Science

Author: Gligoric, Kristina, Piccardi, Tiziano, Hofman, Jake, and West, Robert
Subjects: Computer Science - Computers and Society, Computer Science - Artificial Intelligence, Computer Science - Machine Learning, Computer Science - Social and Information Networks
Abstract: Science is facing a reproducibility crisis. Previous work has proposed incorporating data analysis replications into classrooms as a potential solution. However, despite the potential benefits, it is unclear whether this approach is feasible, and if so, what the involved stakeholders-students, educators, and scientists-should expect from it. Can students perform a data analysis replication over the course of a class? What are the costs and benefits for educators? And how can this solution help benchmark and improve the state of science? In the present study, we incorporated data analysis replications in the project component of the Applied Data Analysis course (CS-401) taught at EPFL (N=354 students). Here we report pre-registered findings based on surveys administered throughout the course. First, we demonstrate that students can replicate previously published scientific papers, most of them qualitatively and some exactly. We find discrepancies between what students expect of data analysis replications and what they experience by doing them along with changes in expectations about reproducibility, which together serve as evidence of attitude shifts to foster students' critical thinking. Second, we provide information for educators about how much overhead is needed to incorporate replications into the classroom and identify concerns that replications bring as compared to more traditional assignments. Third, we identify tangible benefits of the in-class data analysis replications for scientific communities, such as a collection of replication reports and insights about replication barriers in scientific work that should be avoided going forward. Overall, we demonstrate that incorporating replication tasks into a large data science class can increase the reproducibility of scientific work as a by-product of data science instruction, thus benefiting both science and students.
Published: 2023

6. Food Choice Mimicry on a Large University Campus

Author: Gligoric, Kristina, Chiolero, Arnaud, Kıcıman, Emre, White, Ryen W., Horvitz, Eric, and West, Robert
Subjects: Computer Science - Computers and Society, Computer Science - Social and Information Networks
Abstract: Social influence is a strong determinant of food consumption, which in turn influences health. Although consistent observations have been made on the role of social factors in driving similarities in food consumption, much less is known about the precise governing mechanisms. We study social influence on food choice through carefully designed causal analyses, leveraging the sequential nature of shop queues on a major university campus. In particular, we consider a large number of adjacent purchases where a focal user immediately follows another user ("partner") in the checkout queue and both make a purchase. Identifying the partner's impact on the focal user, we find strong evidence of a specific behavioral mechanism for how dietary similarities between individuals arise: purchasing mimicry, a phenomenon where the focal user copies the partner's purchases. For instance, across food additions purchased during lunchtime together with a meal, we find that the focal user is significantly more likely to purchase the food item when the partner buys the item, v.s. when the partner does not, increasing the purchasing probability by 14% in absolute terms, or by 83% in relative terms. The effect is observed across all food types, but largest for condiments, and smallest for soft drinks. We find that no such effect is observed when a focal user is compared to a random (rather than directly preceding) partner. Furthermore, purchasing mimicry is present across age, gender, and status subpopulations, but strongest for students and the youngest persons. Finally, we find a dose-response relationship whereby mimicry decreases as proximity in the purchasing queue decreases. The results of this study elucidate the behavioral mechanism of purchasing mimicry and have further implications for understanding and improving dietary behaviors on campus.
Published: 2023

7. Othering and low status framing of immigrant cuisines in US restaurant reviews and large language models

Author: Luo, Yiwei, Gligorić, Kristina, and Jurafsky, Dan
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence
Abstract: Identifying implicit attitudes toward food can mitigate social prejudice due to food's salience as a marker of ethnic identity. Stereotypes about food are representational harms that may contribute to racialized discourse and negatively impact economic outcomes for restaurants. Understanding the presence of representational harms in online corpora in particular is important, given the increasing use of large language models (LLMs) for text generation and their tendency to reproduce attitudes in their training data. Through careful linguistic analyses, we evaluate social theories about attitudes toward immigrant cuisine in a large-scale study of framing differences in 2.1M English language Yelp reviews. Controlling for factors such as restaurant price and neighborhood racial diversity, we find that immigrant cuisines are more likely to be othered using socially constructed frames of authenticity (e.g., "authentic," "traditional"), and that non-European cuisines (e.g., Indian, Mexican) in particular are described as more exotic compared to European ones (e.g., French). We also find that non-European cuisines are more likely to be described as cheap and dirty, even after controlling for price, and even among the most expensive restaurants. Finally, we show that reviews generated by LLMs reproduce similar framing tendencies, pointing to the downstream retention of these representational harms. Our results corroborate social theories of gastronomic stereotyping, revealing racialized evaluative processes and linguistic strategies through which they manifest., Comment: ICWSM '24
Published: 2023

8. Biased Bytes: On the Validity of Estimating Food Consumption from Digital Traces

Author: Gligorić, Kristina, Đorđević, Irena, and West, Robert
Subjects: Computer Science - Social and Information Networks, Computer Science - Computers and Society, Computer Science - Human-Computer Interaction
Abstract: Given that measuring food consumption at a population scale is a challenging task, researchers have begun to explore digital traces (e.g., from social media or from food-tracking applications) as potential proxies. However, it remains unclear to what extent digital traces reflect real food consumption. The present study aims to bridge this gap by quantifying the link between dietary behaviors as captured via social media (Twitter) v.s. a food-tracking application (MyFoodRepo). We focus on the case of Switzerland and contrast images of foods collected through the two platforms, by designing and deploying a novel crowdsourcing framework for estimating biases with respect to nutritional properties and appearance. We find that the food type distributions in social media v.s. food tracking diverge; e.g., bread is 2.5 times more frequent among consumed and tracked foods than on Twitter, whereas cake is 12 times more frequent on Twitter. Controlling for the different food type distributions, we contrast consumed and tracked foods of a given type with foods shared on Twitter. Across food types, food posted on Twitter is perceived as tastier, more caloric, less healthy, less likely to have been consumed at home, more complex, and larger-portioned, compared to consumed and tracked foods. The fact that there is a divergence between food consumption as measured via the two platforms implies that at least one of the two is not a faithful representation of the true food consumption in the general Swiss population. Thus, researchers should be attentive and aim to establish evidence of validity before using digital traces as a proxy for the true food consumption of a general population. We conclude by discussing the potential sources of these biases and their implications, outlining pitfalls and threats to validity, and proposing actionable ways for overcoming them.
Published: 2022
Full Text: View/download PDF

9. Anticipated versus Actual Effects of Platform Design Change: A Case Study of Twitter's Character Limit

Author: Gligorić, Kristina, Częstochowska, Justyna, Anderson, Ashton, and West, Robert
Subjects: Computer Science - Social and Information Networks, Computer Science - Computers and Society, Computer Science - Human-Computer Interaction
Abstract: The design of online platforms is both critically important and challenging, as any changes may lead to unintended consequences, and it can be hard to predict how users will react. Here we conduct a case study of a particularly important real-world platform design change: Twitter's decision to double the character limit from 140 to 280 characters to soothe users' need to ''cram'' or ''squeeze'' their tweets, informed by modeling of historical user behavior. In our analysis, we contrast Twitter's anticipated pre-intervention predictions about user behavior with actual post-intervention user behavior: Did the platform design change lead to the intended user behavior shifts, or did a gap between anticipated and actual behavior emerge? Did different user groups react differently? We find that even though users do not ''cram'' as much under 280 characters as they used to under 140 characters, emergent ``cramming'' at the new limit seems to not have been taken into account when designing the platform change. Furthermore, investigating textual features, we find that, although post-intervention ''crammed'' tweets are longer, their syntactic and semantic characteristics remain similar and indicative of ''squeezing''. Applying the same approach as Twitter policy-makers, we create updated counterfactual estimates and find that the character limit would need to be increased further to reduce cramming that re-emerged at the new limit. We contribute to the rich literature studying online user behavior with an empirical study that reveals a dynamic interaction between platform design and user behavior, with immediate policy and practical implications for the design of socio-technical systems.
Published: 2022
Full Text: View/download PDF

10. On the Context-Free Ambiguity of Emoji

Author: Czestochowska, Justyna, Gligoric, Kristina, Peyrard, Maxime, Mentha, Yann, Bien, Michal, Grutter, Andrea, Auer, Anita, Xanthos, Aris, and West, Robert
Subjects: Computer Science - Computation and Language, Computer Science - Computers and Society
Abstract: Emojis come with prepacked semantics making them great candidates to create new forms of more accessible communications. Yet, little is known about how much of this emojis semantic is agreed upon by humans, outside of textual contexts. Thus, we collected a crowdsourced dataset of one-word emoji descriptions for 1,289 emojis presented to participants with no surrounding text. The emojis and their interpretations were then examined for ambiguity. We find that with 30 annotations per emoji, 16 emojis (1.2%) are completely unambiguous, whereas 55 emojis (4.3%) are so ambiguous that their descriptions are indistinguishable from randomly chosen descriptions. Most of studied emojis are spread out between the two extremes. Furthermore, investigating the ambiguity of different types of emojis, we find that an important factor is the extent to which an emoji has an embedded symbolical meaning drawn from an established code-book of symbols. We conclude by discussing design implications.
Published: 2022

11. Population-scale dietary interests during the COVID-19 pandemic

Author: Gligoric, Kristina, Chiolero, Arnaud, Kıcıman, Emre, White, Ryen W., and West, Robert
Subjects: Computer Science - Social and Information Networks
Abstract: The SARS-CoV-2 virus has altered people's lives around the world. Here we document population-wide shifts in dietary interests in 18 countries in 2020, as revealed through time series of Google search volumes. We find that during the first wave of the COVID-19 pandemic there was an overall surge in food interest, larger and longer-lasting than the surge during typical end-of-year holidays in Western countries. The shock of decreased mobility manifested as a drastic increase in interest in consuming food at home and a corresponding decrease in consuming food outside of home. The largest (up to threefold) increases occurred for calorie-dense carbohydrate-based foods such as pastries, bakery products, bread, and pies. The observed shifts in dietary interests have the potential to globally affect food consumption and health outcomes. These findings can inform governmental and organizational decisions regarding measures to mitigate the effects of the COVID-19 pandemic on diet and nutrition., Comment: Nature Communications (2022)
Published: 2021
Full Text: View/download PDF

12. Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

Author: Peyrard, Maxime, Borges, Beatriz, Gligorić, Kristina, and West, Robert
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: The automatic detection of humor poses a grand challenge for natural language processing. Transformer-based systems have recently achieved remarkable results on this task, but they usually (1)~were evaluated in setups where serious vs humorous texts came from entirely different sources, and (2)~focused on benchmarking performance without providing insights into how the models work. We make progress in both respects by training and analyzing transformer-based humor recognition models on a recently introduced dataset consisting of minimal pairs of aligned sentences, one serious, the other humorous. We find that, although our aligned dataset is much harder than previous datasets, transformer-based models recognize the humorous sentence in an aligned pair with high accuracy (78%). In a careful error analysis, we characterize easy vs hard instances. Finally, by analyzing attention weights, we obtain important insights into the mechanisms by which transformers recognize humor. Most remarkably, we find clear evidence that one single attention head learns to recognize the words that make a test sentence humorous, even without access to this information at training time., Comment: Published at IJCAI 2021
Published: 2021

13. Formation of Social Ties Influences Food Choice: A Campus-Wide Longitudinal Study

Author: Gligorić, Kristina, White, Ryen W., Kıcıman, Emre, Horvitz, Eric, Chiolero, Arnaud, and West, Robert
Subjects: Computer Science - Social and Information Networks, Computer Science - Artificial Intelligence
Abstract: Nutrition is a key determinant of long-term health, and social influence has long been theorized to be a key determinant of nutrition. It has been difficult to quantify the postulated role of social influence on nutrition using traditional methods such as surveys, due to the typically small scale and short duration of studies. To overcome these limitations, we leverage a novel source of data: logs of 38 million food purchases made over an 8-year period on the Ecole Polytechnique Federale de Lausanne (EPFL) university campus, linked to anonymized individuals via the smartcards used to make on-campus purchases. In a longitudinal observational study, we ask: How is a person's food choice affected by eating with someone else whose own food choice is healthy vs. unhealthy? To estimate causal effects from the passively observed log data, we control confounds in a matched quasi-experimental design: we identify focal users who at first do not have any regular eating partners but then start eating with a fixed partner regularly, and we match focal users into comparison pairs such that paired users are nearly identical with respect to covariates measured before acquiring the partner, where the two focal users' new eating partners diverge in the healthiness of their respective food choice. A difference-in-differences analysis of the paired data yields clear evidence of social influence: focal users acquiring a healthy-eating partner change their habits significantly more toward healthy foods than focal users acquiring an unhealthy-eating partner. We further identify foods whose purchase frequency is impacted significantly by the eating partner's healthiness of food choice. Beyond the main results, the work demonstrates the utility of passively sensed food purchase logs for deriving insights, with the potential of informing the design of public health interventions and food offerings.
Published: 2021
Full Text: View/download PDF

14. Adoption of Twitter's New Length Limit: Is 280 the New 140?

Author: Gligorić, Kristina, Anderson, Ashton, and West, Robert
Subjects: Computer Science - Social and Information Networks, Computer Science - Computation and Language
Abstract: In November 2017, Twitter doubled the maximum allowed tweet length from 140 to 280 characters, a drastic switch on one of the world's most influential social media platforms. In the first long-term study of how the new length limit was adopted by Twitter users, we ask: Does the effect of the new length limit resemble that of the old one? Or did the doubling of the limit fundamentally change how Twitter is shaped by the limited length of posted content? By analyzing Twitter's publicly available 1% sample over a period of around 3 years, we find that, when the length limit was raised from 140 to 280 characters, the prevalence of tweets around 140 characters dropped immediately, while the prevalence of tweets around 280 characters rose steadily for about 6 months. Despite this rise, tweets approaching the length limit have been far less frequent after than before the switch. We find widely different adoption rates across languages and client-device types. The prevalence of tweets around 140 characters before the switch in a given language is strongly correlated with the prevalence of tweets around 280 characters after the switch in the same language, and very long tweets are vastly more popular on Web clients than on mobile clients. Moreover, tweets of around 280 characters after the switch are syntactically and semantically similar to tweets of around 140 characters before the switch, manifesting patterns of message squeezing in both cases. Taken together, these findings suggest that the new 280-character limit constitutes a new, less intrusive version of the old 140-character limit. The length limit remains an important factor that should be considered in all studies using Twitter data.
Published: 2020

15. Experts and authorities receive disproportionate attention on Twitter during the COVID-19 crisis

Author: Gligorić, Kristina, Ribeiro, Manoel Horta, Müller, Martin, Altunina, Olesia, Peyrard, Maxime, Salathé, Marcel, Colavizza, Giovanni, and West, Robert
Subjects: Computer Science - Social and Information Networks
Abstract: Timely access to accurate information is crucial during the COVID-19 pandemic. Prompted by key stakeholders' cautioning against an "infodemic", we study information sharing on Twitter from January through May 2020. We observe an overall surge in the volume of general as well as COVID-19-related tweets around peak lockdown in March/April 2020. With respect to engagement (retweets and likes), accounts related to healthcare, science, government and politics received by far the largest boosts, whereas accounts related to religion and sports saw a relative decrease in engagement. While the threat of an "infodemic" remains, our results show that social media also provide a platform for experts and public authorities to be widely heard during a global crisis., Comment: Kristina Gligori\'c, Manoel Horta Ribeiro and Martin M\"uller contributed equally to this work
Published: 2020

16. Sudden Attention Shifts on Wikipedia During the COVID-19 Crisis

Author: Ribeiro, Manoel Horta, Gligorić, Kristina, Peyrard, Maxime, Lemmerich, Florian, Strohmaier, Markus, and West, Robert
Subjects: Computer Science - Computers and Society, Computer Science - Social and Information Networks
Abstract: We study how the COVID-19 pandemic, alongside the severe mobility restrictions that ensued, has impacted information access on Wikipedia, the world's largest online encyclopedia. A longitudinal analysis that combines pageview statistics for 12 Wikipedia language editions with mobility reports published by Apple and Google reveals massive shifts in the volume and nature of information seeking patterns during the pandemic. Interestingly, while we observe a transient increase in Wikipedia's pageview volume following mobility restrictions, the nature of information sought was impacted more permanently. These changes are most pronounced for language editions associated with countries where the most severe mobility restrictions were implemented. We also find that articles belonging to different topics behaved differently; e.g., attention towards entertainment-related topics is lingering and even increasing, while the interest in health- and biology-related topics was either small or transient. Our results highlight the utility of Wikipedia for studying how the pandemic is affecting people's needs, interests, and concerns., Comment: Manoel Horta Ribeiro, Kristina Gligori\'c and Maxime Peyrard contributed equally to this work. Also, this paper has been accepted at the 15th International Conference on Web and Social Media (ICWSM), please cite accordingly
Published: 2020

17. Revealed versus potential spatial accessibility of healthcare and changing patterns during the COVID-19 pandemic

Author: Gligorić, Kristina, Kamath, Chaitanya, Weiss, Daniel J., Bavadekar, Shailesh, Liu, Yun, Shekel, Tomer, Schulman, Kevin, and Gabrilovich, Evgeniy
Published: 2023
Full Text: View/download PDF

18. Causal Effects of Brevity on Style and Success in Social Media

Author: Gligoric, Kristina, Anderson, Ashton, and West, Robert
Subjects: Computer Science - Social and Information Networks
Abstract: In online communities, where billions of people strive to propagate their messages, understanding how wording affects success is of primary importance. In this work, we are interested in one particularly salient aspect of wording: brevity. What is the causal effect of brevity on message success? What are the linguistic traits of brevity? When is brevity beneficial, and when is it not? Whereas most prior work has studied the effect of wording on style and success in observational setups, we conduct a controlled experiment, in which crowd workers shorten social media posts to prescribed target lengths and other crowd workers subsequently rate the original and shortened versions. This allows us to isolate the causal effect of brevity on the success of a message. We find that concise messages are on average more successful than the original messages up to a length reduction of 30-40%. The optimal reduction is on average between 10% and 20%. The observed effect is robust across different subpopulations of raters and is the strongest for raters who visit social media on a daily basis. Finally, we discover unique linguistic and content traits of brevity and correlate them with the measured probability of success in order to distinguish effective from ineffective shortening strategies. Overall, our findings are important for developing a better understanding of the effect of brevity on the success of messages in online social media.
Published: 2019

19. Message Distortion in Information Cascades

Author: Ribeiro, Manoel Horta, Gligorić, Kristina, and West, Robert
Subjects: Computer Science - Social and Information Networks
Abstract: Information diffusion is usually modeled as a process in which immutable pieces of information propagate over a network. In reality, however, messages are not immutable, but may be morphed with every step, potentially entailing large cumulative distortions. This process may lead to misinformation even in the absence of malevolent actors, and understanding it is crucial for modeling and improving online information systems. Here, we perform a controlled, crowdsourced experiment in which we simulate the propagation of information from medical research papers. Starting from the original abstracts, crowd workers iteratively shorten previously produced summaries to increasingly smaller lengths. We also collect control summaries where the original abstract is compressed directly to the final target length. Comparing cascades to controls allows us to separate the effect of the length constraint from that of accumulated distortion. Via careful manual coding, we annotate lexical and semantic units in the medical abstracts and track them along cascades. We find that iterative summarization has a negative impact due to the accumulation of error, but that high-quality intermediate summaries result in less distorted messages than in the control case. Different types of information behave differently; in particular, the conclusion of a medical abstract (i.e., its key message) is distorted most. Finally, we compare abstractive with extractive summaries, finding that the latter are less prone to semantic distortion. Overall, this work is a first step in studying information cascades without the assumption that disseminated content is immutable, with implications on our understanding of the role of word-of-mouth effects on the misreporting of science., Comment: Presented at TheWebConf 2019
Published: 2019
Full Text: View/download PDF

20. Visible Light Communications Based Indoor Positioning via Compressed Sensing

Author: Gligoric, Kristina, Ajmani, Manisha, Vukobratovic, Dejan, and Sinanovic, Sinan
Subjects: Computer Science - Information Theory
Abstract: This paper presents an approach for visible light communication-based indoor positioning using compressed sensing. We consider a large number of light emitting diodes (LEDs) simultaneously transmitting their positional information and a user device equipped with a photo-diode. By casting the LED signal separation problem into an equivalent compressed sensing framework, the user device is able to detect the set of nearby LEDs using sparse signal recovery algorithms. From this set, and using proximity method, position estimation is proposed based on the concept that if signal separation is possible, then overlapping light beam regions lead to decrease in positioning error due to increase in the number of reference points. The proposed method is evaluated in a LED-illuminated large-scale indoor open-plan office space scenario. The positioning accuracy is compared against the positioning error lower bound of the proximity method, for various system parameters., Comment: to appear in IEEE Communication Letters
Published: 2018

21. How Constraints Affect Content: The Case of Twitter's Switch from 140 to 280 Characters

Author: Gligorić, Kristina, Anderson, Ashton, and West, Robert
Subjects: Computer Science - Social and Information Networks
Abstract: It is often said that constraints affect creative production, both in terms of form and quality. Online social media platforms frequently impose constraints on the content that users can produce, limiting the range of possible contributions. Do these restrictions tend to push creators towards producing more or less successful content? How do creators adapt their contributions to fit the limits imposed by social media platforms? To answer these questions, we conduct an observational study of a recent event: on November 7, 2017, Twitter changed the maximum allowable length of a tweet from 140 to 280 characters, thereby significantly altering its signature constraint. In the first study of this switch, we compare tweets with nearly or exactly 140 characters before the change to tweets of the same length posted after the change. This setup enables us to characterize how users alter their tweets to fit the constraint and how this affects their tweets' success. We find that in response to a length constraint, users write more tersely, use more abbreviations and contracted forms, and use fewer definite articles. Also, although in general tweet success increases with length, we find initial evidence that tweets made to fit the 140-character constraint tend to be more successful than similar-length tweets written when the constraint was removed, suggesting that the length constraint improved tweet quality., Comment: To appear in the Proceedings of AAAI ICWSM 2018
Published: 2018

22. Population-scale dietary interests during the COVID-19 pandemic

Author: Gligorić, Kristina, Chiolero, Arnaud, Kıcıman, Emre, White, Ryen W., and West, Robert
Published: 2022
Full Text: View/download PDF

23. Measuring and shaping the nutritional environment via food sales logs: case studies of campus-wide food choice and a call to action

Author: Gligorić, Kristina, primary, Zbinden, Robin, additional, Chiolero, Arnaud, additional, Kıcıman, Emre, additional, White, Ryen W., additional, Horvitz, Eric, additional, and West, Robert, additional
Published: 2024
Full Text: View/download PDF

24. People who share encounters with racism are silenced online by humans and machines, but a guideline-reframing intervention holds promise.

Author: Cinoo Lee, Gligorić, Kristina, Kalluri, Pratyusha Ria, Harrington, Maggie, Durmus, Esin, Sanchez, Kiara L., Nay San, Tse, Danny, Xuan Zhao, Hamedani, MarYam G., Markus, Hazel Rose, Jurafsky, Dan, and Eberhardt, Jennifer L.
Subjects: *LANGUAGE models, *SOCIAL media, *NATURAL language processing, *RACE discrimination, *INTERNET content moderation
Abstract: Are members of marginalized communities silenced on social media when they share personal experiences of racism? Here, we investigate the role of algorithms, humans, and platform guidelines in suppressing disclosures of racial discrimination. In a field study of actual posts from a neighborhood-based social media platform, we find that when users talk about their experiences as targets of racism, their posts are disproportionately flagged for removal as toxic by five widely used moderation algorithms from major online platforms, including the most recent large language models. We show that human users disproportionately flag these disclosures for removal as well. Next, in a follow-up experiment, we demonstrate that merely witnessing such suppression negatively influences how Black Americans view the community and their place in it. Finally, to address these challenges to equity and inclusion in online spaces, we introduce a mitigation strategy: a guideline-reframing intervention that is effective at reducing silencing behavior across the political spectrum. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

25. Othering and low prestige framing of immigrant cuisines in US restaurant reviews and large language models

Author: Luo, Yiwei, Gligorić, Kristina, and Jurafsky, Dan
Subjects: FOS: Computer and information sciences, Artificial Intelligence (cs.AI), Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computation and Language (cs.CL)
Abstract: Identifying and understanding implicit attitudes toward food can help efforts to mitigate social prejudice due to food's pervasive role as a marker of cultural and ethnic identity. Stereotypes about food are a form of microaggression that contribute to harmful public discourse that may in turn perpetuate prejudice toward ethnic groups and negatively impact economic outcomes for restaurants. Through careful linguistic analyses, we evaluate social theories about attitudes toward immigrant cuisine in a large-scale study of framing differences in 2.1M English language Yelp reviews of restaurants in 14 US states. Controlling for factors such as restaurant price and neighborhood racial diversity, we find that immigrant cuisines are more likely to be framed in objectifying and othering terms of authenticity (e.g., authentic, traditional), exoticism (e.g., exotic, different), and prototypicality (e.g., typical, usual), but that non-Western immigrant cuisines (e.g., Indian, Mexican) receive more othering than European cuisines (e.g., French, Italian). We further find that non-Western immigrant cuisines are framed less positively and as lower status, being evaluated in terms of affordability and hygiene. Finally, we show that reviews generated by large language models (LLMs) reproduce many of the same framing tendencies. Our results empirically corroborate social theories of taste and gastronomic stereotyping, and reveal linguistic processes by which such attitudes are reified.
Published: 2023

26. Linguistic effects on news headline success: Evidence from thousands of online field experiments (Registered Report)

Author: Gligorić, Kristina, primary, Lifchits, George, additional, West, Robert, additional, and Anderson, Ashton, additional
Published: 2023
Full Text: View/download PDF

27. On the Context-Free Ambiguity of Emoji

Author: Częstochowska, Justyna, primary, Gligorić, Kristina, additional, Peyrard, Maxime, additional, Mentha, Yann, additional, Bień, Michał, additional, Grütter, Andrea, additional, Auer, Anita, additional, Xanthos, Aris, additional, and West, Robert, additional
Published: 2022
Full Text: View/download PDF

28. Linguistic effects on news headline success: Evidence from thousands of online field experiments (Registered Report Protocol)

Author: Gligorić, Kristina, primary, Lifchits, George, additional, West, Robert, additional, and Anderson, Ashton, additional
Published: 2021
Full Text: View/download PDF

29. Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

Author: Peyrard, Maxime, primary, Borges, Beatriz, additional, Gligorić, Kristina, additional, and West, Robert, additional
Published: 2021
Full Text: View/download PDF

30. Sudden Attention Shifts on Wikipedia During the COVID-19 Crisis

Author: Horta Ribeiro, Manoel, primary, Gligorić, Kristina, additional, Peyrard, Maxime, additional, Lemmerich, Florian, additional, Strohmaier, Markus, additional, and West, Robert, additional
Published: 2021
Full Text: View/download PDF

31. Formation of Social Ties Influences Food Choice

Author: Gligorić, Kristina, primary, White, Ryen W., additional, Kiciman, Emre, additional, Horvitz, Eric, additional, Chiolero, Arnaud, additional, and West, Robert, additional
Published: 2021
Full Text: View/download PDF

32. Causal Effects of Brevity on Style and Success in Social Media

Author: Gligorić, Kristina, primary, Anderson, Ashton, additional, and West, Robert, additional
Published: 2019
Full Text: View/download PDF

33. Comparing and Developing Tools to Measure the Readability of Domain-Specific Texts

Author: Redmiles, Elissa, primary, Maszkiewicz, Lisa, additional, Hwang, Emily, additional, Kuchhal, Dhruv, additional, Liu, Everest, additional, Morales, Miraida, additional, Peskov, Denis, additional, Rao, Sudha, additional, Stevens, Rock, additional, Gligorić, Kristina, additional, Kross, Sean, additional, Mazurek, Michelle, additional, and Daumé III, Hal, additional
Published: 2019
Full Text: View/download PDF

34. How Constraints Affect Content: The Case of Twitter’s Switch from 140 to 280 Characters

Author: Gligorić, Kristina, primary, Anderson, Ashton, additional, and West, Robert, additional
Published: 2018
Full Text: View/download PDF

35. People who share encounters with racism are silenced online by humans and machines, but a guideline-reframing intervention holds promise.

Author: Lee C, Gligorić K, Kalluri PR, Harrington M, Durmus E, Sanchez KL, San N, Tse D, Zhao X, Hamedani MG, Markus HR, Jurafsky D, and Eberhardt JL
Subjects: Humans, Black or African American, Algorithms, Racism, Social Media
Abstract: Are members of marginalized communities silenced on social media when they share personal experiences of racism? Here, we investigate the role of algorithms, humans, and platform guidelines in suppressing disclosures of racial discrimination. In a field study of actual posts from a neighborhood-based social media platform, we find that when users talk about their experiences as targets of racism, their posts are disproportionately flagged for removal as toxic by five widely used moderation algorithms from major online platforms, including the most recent large language models. We show that human users disproportionately flag these disclosures for removal as well. Next, in a follow-up experiment, we demonstrate that merely witnessing such suppression negatively influences how Black Americans view the community and their place in it. Finally, to address these challenges to equity and inclusion in online spaces, we introduce a mitigation strategy: a guideline-reframing intervention that is effective at reducing silencing behavior across the political spectrum., Competing Interests: Competing interests statement:J.L.E. serves on the advisory board for a social media company. The position is unpaid.
Published: 2024
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

35 results on '"Gligorić, Kristina"'

1. Can Unconfident LLM Annotations Be Used for Confident Conclusions?

2. NLP Systems That Can't Tell Use from Mention Censor Counterspeech, but Teaching the Distinction Helps

3. AnthroScore: A Computational Linguistic Measure of Anthropomorphism

4. Grounding Gaps in Language Model Generations

5. In-class Data Analysis Replications: Teaching Students while Testing Science

6. Food Choice Mimicry on a Large University Campus

7. Othering and low status framing of immigrant cuisines in US restaurant reviews and large language models

8. Biased Bytes: On the Validity of Estimating Food Consumption from Digital Traces

9. Anticipated versus Actual Effects of Platform Design Change: A Case Study of Twitter's Character Limit

10. On the Context-Free Ambiguity of Emoji

11. Population-scale dietary interests during the COVID-19 pandemic

12. Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

13. Formation of Social Ties Influences Food Choice: A Campus-Wide Longitudinal Study

14. Adoption of Twitter's New Length Limit: Is 280 the New 140?

15. Experts and authorities receive disproportionate attention on Twitter during the COVID-19 crisis

16. Sudden Attention Shifts on Wikipedia During the COVID-19 Crisis

17. Revealed versus potential spatial accessibility of healthcare and changing patterns during the COVID-19 pandemic

18. Causal Effects of Brevity on Style and Success in Social Media

19. Message Distortion in Information Cascades

20. Visible Light Communications Based Indoor Positioning via Compressed Sensing

21. How Constraints Affect Content: The Case of Twitter's Switch from 140 to 280 Characters

22. Population-scale dietary interests during the COVID-19 pandemic

23. Measuring and shaping the nutritional environment via food sales logs: case studies of campus-wide food choice and a call to action

24. People who share encounters with racism are silenced online by humans and machines, but a guideline-reframing intervention holds promise.

25. Othering and low prestige framing of immigrant cuisines in US restaurant reviews and large language models

26. Linguistic effects on news headline success: Evidence from thousands of online field experiments (Registered Report)

27. On the Context-Free Ambiguity of Emoji

28. Linguistic effects on news headline success: Evidence from thousands of online field experiments (Registered Report Protocol)

29. Laughing Heads: Can Transformers Detect What Makes a Sentence Funny?

30. Sudden Attention Shifts on Wikipedia During the COVID-19 Crisis

31. Formation of Social Ties Influences Food Choice

32. Causal Effects of Brevity on Style and Success in Social Media

33. Comparing and Developing Tools to Measure the Readability of Domain-Specific Texts

34. How Constraints Affect Content: The Case of Twitter’s Switch from 140 to 280 Characters

35. People who share encounters with racism are silenced online by humans and machines, but a guideline-reframing intervention holds promise.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

35 results on '"Gligorić, Kristina"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources