Author: "Leqi Liu" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Leqi Liu"' showing total 34 results

Start Over Author "Leqi Liu"

34 results on '"Leqi Liu"'

1. A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

Author: Yuan, Hui, Zeng, Yifan, Wu, Yue, Wang, Huazheng, Wang, Mengdi, and Leqi, Liu
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Computer Science - Computation and Language
Abstract: Reinforcement Learning from Human Feedback (RLHF) has become the predominant approach for language model (LM) alignment. At its core, RLHF uses a margin-based loss for preference optimization, specifying ideal LM behavior only by the difference between preferred and dispreferred responses. In this paper, we identify a common pitfall of margin-based methods -- the under-specification of ideal LM behavior on preferred and dispreferred responses individually, which leads to two unintended consequences as the margin increases: (1) The probability of dispreferred (e.g., unsafe) responses may increase, resulting in potential safety alignment failures. (2) The probability of preferred responses may decrease, even when those responses are ideal. We demystify the reasons behind these problematic behaviors: margin-based losses couple the change in the preferred probability to the gradient of the dispreferred one, and vice versa, often preventing the preferred probability from increasing while the dispreferred one decreases, and thus causing a synchronized increase or decrease in both probabilities. We term this effect, inherent in margin-based objectives, gradient entanglement. Formally, we derive conditions for general margin-based alignment objectives under which gradient entanglement becomes concerning: the inner product of the gradients of preferred and dispreferred log-probabilities is large relative to the individual gradient norms. We theoretically investigate why such inner products can be large when aligning language models and empirically validate our findings. Empirical implications of our framework extend to explaining important differences in the training dynamics of various preference optimization algorithms, and suggesting potential algorithm designs to mitigate the under-specification issue of margin-based methods and thereby improving language model alignment.
Published: 2024

2. A Unified Causal Framework for Auditing Recommender Systems for Ethical Concerns

Author: Sharma, Vibhhu, Gupta, Shantanu, Akpinar, Nil-Jana, Lipton, Zachary C., and Leqi, Liu
Subjects: Computer Science - Machine Learning, Computer Science - Information Retrieval
Abstract: As recommender systems become widely deployed in different domains, they increasingly influence their users' beliefs and preferences. Auditing recommender systems is crucial as it not only ensures the continuous improvement of recommendation algorithms but also safeguards against potential issues like biases and ethical concerns. In this paper, we view recommender system auditing from a causal lens and provide a general recipe for defining auditing metrics. Under this general causal auditing framework, we categorize existing auditing metrics and identify gaps in them -- notably, the lack of metrics for auditing user agency while accounting for the multi-step dynamics of the recommendation process. We leverage our framework and propose two classes of such metrics:future- and past-reacheability and stability, that measure the ability of a user to influence their own and other users' recommendations, respectively. We provide both a gradient-based and a black-box approach for computing these metrics, allowing the auditor to compute them under different levels of access to the recommender system. In our experiments, we demonstrate the efficacy of methods for computing the proposed metrics and inspect the design of recommender systems through these proposed metrics., Comment: 28 pages
Published: 2024

3. Ambient Air Pollution and Hospitalization for Acute Myocardial Infarction in Chongqing, China: A Time-Stratified Case Crossover Analysis

Author: Mingming Zhao, Xing Liu, Ming Yuan, Ying Yang, Hao Chen, Mengmeng Li, Pan Luo, Yong Duan, Jie Fan, Leqi Liu, and Li Zhou
Subjects: acute myocardial infarction, air pollution, environment, hospitalization, risk factor, Physics, QC1-999
Abstract: Previous studies have demonstrated that short-term exposure to ambient air pollution was associated with hospital admissions for cardiovascular diseases, but the evidence of its effects on acute myocardial infarction (AMI) in East Asian countries is limited and inconsistent. We aimed to investigate the association between air pollution and AMI hospitalizations in Chongqing, China. This time-stratified case-crossover study included 872 patients with AMI from three hospitals in Chongqing from January 2015 to December 2016. Exposures were compared between days with AMI (case days) and days without AMI (control days). Spearman’s correlation coefficient was applied to explore the correlation between air pollutants and meteorological conditions. Conditional logistic regression was used to assess the associations between air pollution exposure with different lag periods and AMI hospitalizations. Stratification analysis was further implemented by sex, age, and season. Hospitalizations for AMI were signifficantly associated with air pollution. All analyzed air pollutants showed lag-specific at lag 0 day and lag 01 day, whereas a 10 μg/m3 increase of average concentrations in PM2.5, PM10, SO2, NO2, and CO was associated with 1.034% (95% CI: 1.003–1.067%), 1.035% (95% CI:1.015–1.056%), 1.231% (95% CI: 1.053–1.438%), 1.062% (95% CI: 1.018–1.107%), and 1.406% (95% CI: 1.059–1.866%) increase in hospitalizations for AMI, respectively. No effect modifications were detected for sex, age, and season. Our findings suggest that short-term exposure to PM2.5, PM10, SO2, NO2, and CO contributes to increase AMI hospitalizations, which have public health implications for primary prevention and emergency health services.
Published: 2022
Full Text: View/download PDF

4. Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

Author: Dean, Sarah, Dong, Evan, Jagadeesan, Meena, and Leqi, Liu
Subjects: Computer Science - Machine Learning, Computer Science - Computers and Society, Computer Science - Computer Science and Game Theory, Computer Science - Information Retrieval
Abstract: As AI systems enter into a growing number of societal domains, these systems increasingly shape and are shaped by user preferences, opinions, and behaviors. However, the design of AI systems rarely accounts for how AI and users shape one another. In this position paper, we argue for the development of formal interaction models which mathematically specify how AI and users shape one another. Formal interaction models can be leveraged to (1) specify interactions for implementation, (2) monitor interactions through empirical analysis, (3) anticipate societal impacts via counterfactual analysis, and (4) control societal impacts via interventions. The design space of formal interaction models is vast, and model design requires careful consideration of factors such as style, granularity, mathematical complexity, and measurability. Using content recommender systems as a case study, we critically examine the nascent literature of formal interaction models with respect to these use-cases and design axes. More broadly, we call for the community to leverage formal interaction models when designing, evaluating, or auditing any AI system which interacts with users.
Published: 2024

5. Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework

Author: Li, Jingling, Tang, Zeyu, Liu, Xiaoyu, Spirtes, Peter, Zhang, Kun, Leqi, Liu, and Liu, Yang
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Large language models (LLMs) can easily generate biased and discriminative responses. As LLMs tap into consequential decision-making (e.g., hiring and healthcare), it is of crucial importance to develop strategies to mitigate these biases. This paper focuses on social bias, tackling the association between demographic information and LLM outputs. We propose a causality-guided debiasing framework that utilizes causal understandings of (1) the data-generating process of the training corpus fed to LLMs, and (2) the internal reasoning process of LLM inference, to guide the design of prompts for debiasing LLM outputs through selection mechanisms. Our framework unifies existing de-biasing prompting approaches such as inhibitive instructions and in-context contrastive examples, and sheds light on new ways of debiasing by encouraging bias-free reasoning. Our strong empirical performance on real-world datasets demonstrates that our framework provides principled guidelines on debiasing LLM outputs even with only the black-box access., Comment: 18 pages, 11 figures
Published: 2024

6. Personalized Language Modeling from Personalized Human Feedback

Author: Li, Xinyu, Lipton, Zachary C., and Leqi, Liu
Subjects: Computer Science - Computation and Language, Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Reinforcement Learning from Human Feedback (RLHF) is commonly used to fine-tune large language models to better align with human preferences. However, the underlying premise of algorithms developed under this framework can be problematic when user preferences encoded in human feedback are diverse. In this work, we aim to address this problem by developing methods for building personalized language models. We first formally introduce the task of learning from personalized human feedback and explain why vanilla RLHF can be ineffective in this context. We then propose a general Personalized-RLHF (P-RLHF) framework, including a user model that maps user information to user representations and can flexibly encode our assumptions on user preferences. We develop new learning objectives to perform personalized Direct Preference Optimization that jointly learns a user model and a personalized language model. We demonstrate the efficacy of our proposed method through (1) a synthetic task where we fine-tune a GPT-J 6B model to align with users with conflicting preferences on generation length; and (2) an instruction following task where we fine-tune a Tulu-7B model to generate responses for users with diverse preferences on the style of responses. In both cases, our learned models can generate personalized responses that are better aligned with the preferences of individual users.
Published: 2024

7. A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits

Author: Leqi, Liu, Zhou, Giulio, Kılınç-Karzan, Fatma, Lipton, Zachary C., and Montgomery, Alan L.
Subjects: Computer Science - Information Retrieval, Computer Science - Human-Computer Interaction, Computer Science - Machine Learning
Abstract: Personalized recommender systems suffuse modern life, shaping what media we read and what products we consume. Algorithms powering such systems tend to consist of supervised learning-based heuristics, such as latent factor models with a variety of heuristically chosen prediction targets. Meanwhile, theoretical treatments of recommendation frequently address the decision-theoretic nature of the problem, including the need to balance exploration and exploitation, via the multi-armed bandits (MABs) framework. However, MAB-based approaches rely heavily on assumptions about human preferences. These preference assumptions are seldom tested using human subject studies, partly due to the lack of publicly available toolkits to conduct such studies. In this work, we conduct a study with crowdworkers in a comics recommendation MABs setting. Each arm represents a comic category, and users provide feedback after each recommendation. We check the validity of core MABs assumptions-that human preferences (reward distributions) are fixed over time-and find that they do not hold. This finding suggests that any MAB algorithm used for recommender systems should account for human preference dynamics. While answering these questions, we provide a flexible experimental framework for understanding human preference dynamics and testing MABs algorithms with human users. The code for our experimental framework and the collected data can be found at https://github.com/HumainLab/human-bandit-evaluation., Comment: Accepted to CHI. 16 pages, 6 figures
Published: 2023
Full Text: View/download PDF

8. Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

Author: Jiang, Yueyi, Jiang, Yunfan, Leqi, Liu, and Winkielman, Piotr
Subjects: Prevention, Mental health, Good Health and Well Being
Abstract: Loneliness has been associated with negative outcomes for physical and mental health. Understanding how people express and cope with various forms of loneliness is critical for early screening and targeted interventions to reduce loneliness, particularly among vulnerable groups such as young adults. To examine how different forms of loneliness and coping strategies manifest in loneliness self-disclosure, we built a dataset, FIG-Loneliness (FIne-Grained Loneliness) by using Reddit posts in two young adult-focused forums and two loneliness related forums consisting of a diverse age group. We provided annotations by trained human annotators for binary and fine-grained loneliness classifications of the posts. Trained on FIG-Loneliness, two BERT-based models were used to understand loneliness forms and authors’ coping strategies in these forums. Our binary loneliness classification achieved an accuracy above 97%, and fine-grained loneliness category classification reached an average accuracy of 77% across all labeled categories. With FIG-Loneliness and model predictions, we found that loneliness expressions in the young adult related forums were distinct from other forums. Those in young adult-focused forums were more likely to express concerns pertaining to peer relationship, and were potentially more sensitive to geographical isolation impacted by the COVID-19 pandemic lockdown. Also, we showed that different forms of loneliness have differential use in coping strategies.
Published: 2023

9. Off-Policy Risk Assessment in Markov Decision Processes

Author: Huang, Audrey, Leqi, Liu, Lipton, Zachary Chase, and Azizzadenesheli, Kamyar
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: Addressing such diverse ends as safety alignment with human preferences, and the efficiency of learning, a growing line of reinforcement learning research focuses on risk functionals that depend on the entire distribution of returns. Recent work on \emph{off-policy risk assessment} (OPRA) for contextual bandits introduced consistent estimators for the target policy's CDF of returns along with finite sample guarantees that extend to (and hold simultaneously over) all risk. In this paper, we lift OPRA to Markov decision processes (MDPs), where importance sampling (IS) CDF estimators suffer high variance on longer trajectories due to small effective sample size. To mitigate these problems, we incorporate model-based estimation to develop the first doubly robust (DR) estimator for the CDF of returns in MDPs. This estimator enjoys significantly less variance and, when the model is well specified, achieves the Cramer-Rao variance lower bound. Moreover, for many risk functionals, the downstream estimates enjoy both lower bias and lower variance. Additionally, we derive the first minimax lower bounds for off-policy CDF and risk estimation, which match our error bounds up to a constant factor. Finally, we demonstrate the precision of our DR CDF estimates experimentally on several different environments.
Published: 2022

10. Supervised Learning with General Risk Functionals

Author: Leqi, Liu, Huang, Audrey, Lipton, Zachary C., and Azizzadenesheli, Kamyar
Subjects: Statistics - Machine Learning, Computer Science - Machine Learning
Abstract: Standard uniform convergence results bound the generalization gap of the expected loss over a hypothesis class. The emergence of risk-sensitive learning requires generalization guarantees for functionals of the loss distribution beyond the expectation. While prior works specialize in uniform convergence of particular functionals, our work provides uniform convergence for a general class of H\"older risk functionals for which the closeness in the Cumulative Distribution Function (CDF) entails closeness in risk. We establish the first uniform convergence results for estimating the CDF of the loss distribution, yielding guarantees that hold simultaneously both over all H\"older risk functionals and over all hypotheses. Thus licensed to perform empirical risk minimization, we develop practical gradient-based methods for minimizing distortion risks (widely studied subset of H\"older risks that subsumes the spectral risks, including the mean, conditional value at risk, cumulative prospect theory risks, and others) and provide convergence guarantees. In experiments, we demonstrate the efficacy of our learning procedure, both in settings where uniform convergence results hold and in high-dimensional settings with deep networks.
Published: 2022

11. A Taxonomy of Human and ML Strengths in Decision-Making to Investigate Human-ML Complementarity

Author: Rastogi, Charvi, Leqi, Liu, Holstein, Kenneth, and Heidari, Hoda
Subjects: Computer Science - Human-Computer Interaction, Computer Science - Machine Learning
Abstract: Hybrid human-ML systems increasingly make consequential decisions in a wide range of domains. These systems are often introduced with the expectation that the combined human-ML system will achieve complementary performance, that is, the combined decision-making system will be an improvement compared with either decision-making agent in isolation. However, empirical results have been mixed, and existing research rarely articulates the sources and mechanisms by which complementary performance is expected to arise. Our goal in this work is to provide conceptual tools to advance the way researchers reason and communicate about human-ML complementarity. Drawing upon prior literature in human psychology, machine learning, and human-computer interaction, we propose a taxonomy characterizing distinct ways in which human and ML-based decision-making can differ. In doing so, we conceptually map potential mechanisms by which combining human and ML decision-making may yield complementary performance, developing a language for the research community to reason about design of hybrid systems in any decision-making domain. To illustrate how our taxonomy can be used to investigate complementarity, we provide a mathematical aggregation framework to examine enabling conditions for complementarity. Through synthetic simulations, we demonstrate how this framework can be used to explore specific aspects of our taxonomy and shed light on the optimal mechanisms for combining human-ML judgments, Comment: 19 pages, 5 figures, Proceedings of HCOMP
Published: 2022

12. Modeling Attrition in Recommender Systems with Departing Bandits

Author: Ben-Porat, Omer, Cohen, Lee, Leqi, Liu, Lipton, Zachary C., and Mansour, Yishay
Subjects: Computer Science - Machine Learning, Computer Science - Information Retrieval, Statistics - Machine Learning
Abstract: Traditionally, when recommender systems are formalized as multi-armed bandits, the policy of the recommender system influences the rewards accrued, but not the length of interaction. However, in real-world systems, dissatisfied users may depart (and never come back). In this work, we propose a novel multi-armed bandit setup that captures such policy-dependent horizons. Our setup consists of a finite set of user types, and multiple arms with Bernoulli payoffs. Each (user type, arm) tuple corresponds to an (unknown) reward probability. Each user's type is initially unknown and can only be inferred through their response to recommendations. Moreover, if a user is dissatisfied with their recommendation, they might depart the system. We first address the case where all users share the same type, demonstrating that a recent UCB-based algorithm is optimal. We then move forward to the more challenging case, where users are divided among two types. While naive approaches cannot handle this setting, we provide an efficient learning algorithm that achieves $\tilde{O}(\sqrt{T})$ regret, where $T$ is the number of users., Comment: Accepted at AAAI 2022
Published: 2022

13. Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

Author: Jiang, Yueyi, Jiang, Yunfan, Leqi, Liu, and Winkielman, Piotr
Subjects: Computer Science - Computation and Language, Computer Science - Social and Information Networks
Abstract: Loneliness has been associated with negative outcomes for physical and mental health. Understanding how people express and cope with various forms of loneliness is critical for early screening and targeted interventions to reduce loneliness, particularly among vulnerable groups such as young adults. To examine how different forms of loneliness and coping strategies manifest in loneliness self-disclosure, we built a dataset, FIG-Loneliness (FIne-Grained Loneliness) by using Reddit posts in two young adult-focused forums and two loneliness related forums consisting of a diverse age group. We provided annotations by trained human annotators for binary and fine-grained loneliness classifications of the posts. Trained on FIG-Loneliness, two BERT-based models were used to understand loneliness forms and authors' coping strategies in these forums. Our binary loneliness classification achieved an accuracy above 97%, and fine-grained loneliness category classification reached an average accuracy of 77% across all labeled categories. With FIG-Loneliness and model predictions, we found that loneliness expressions in the young adults related forums were distinct from other forums. Those in young adult-focused forums were more likely to express concerns pertaining to peer relationship, and were potentially more sensitive to geographical isolation impacted by the COVID-19 pandemic lockdown. Also, we showed that different forms of loneliness have differential use in coping strategies.
Published: 2022

14. Action-Sufficient State Representation Learning for Control with Structural Constraints

Author: Huang, Biwei, Lu, Chaochao, Leqi, Liu, Hernández-Lobato, José Miguel, Glymour, Clark, Schölkopf, Bernhard, and Zhang, Kun
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: Perceived signals in real-world scenarios are usually high-dimensional and noisy, and finding and using their representation that contains essential and sufficient information required by downstream decision-making tasks will help improve computational efficiency and generalization ability in the tasks. In this paper, we focus on partially observable environments and propose to learn a minimal set of state representations that capture sufficient information for decision-making, termed \textit{Action-Sufficient state Representations} (ASRs). We build a generative environment model for the structural relationships among variables in the system and present a principled way to characterize ASRs based on structural constraints and the goal of maximizing cumulative reward in policy learning. We then develop a structured sequential Variational Auto-Encoder to estimate the environment model and extract ASRs. Our empirical results on CarRacing and VizDoom demonstrate a clear advantage of learning and using ASRs for policy learning. Moreover, the estimated environment model and ASRs allow learning behaviors from imagined outcomes in the compact latent space to improve sample efficiency.
Published: 2021

15. When Curation Becomes Creation: Algorithms, Microcontent, and the Vanishing Distinction between Platforms and Creators

Author: Leqi, Liu, Hadfield-Menell, Dylan, and Lipton, Zachary C.
Subjects: Computer Science - Computers and Society
Abstract: Ever since social activity on the Internet began migrating from the wilds of the open web to the walled gardens erected by so-called platforms, debates have raged about the responsibilities that these platforms ought to bear. And yet, despite intense scrutiny from the news media and grassroots movements of outraged users, platforms continue to operate, from a legal standpoint, on the friendliest terms. Under the current regulatory framework, platforms simultaneously benefit from: (1) broad discretion to organize (and censor) content however they choose; (2) powerful algorithms for curating a practically limitless supply of user-posted microcontent according to whatever ends they wish; and (3) absolution from the sorts of liability born by creators of the underlying content. In this paper, we contest the very validity of the platform-creator distinction, arguing that it is ill-adapted to the modern social media landscape where, in a real sense, platforms are creating derivative media products. We argue that any coherent regulatory framework must adapt to this reality, recognizing the subtle continuum of activities that span the curation-creation spectrum, providing a finer system of categorization and clearer guidance for precisely when platforms assume the responsibilities associated with content creation.
Published: 2021

16. Off-Policy Risk Assessment in Contextual Bandits

Author: Huang, Audrey, Leqi, Liu, Lipton, Zachary C., and Azizzadenesheli, Kamyar
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Even when unable to run experiments, practitioners can evaluate prospective policies, using previously logged data. However, while the bandits literature has adopted a diverse set of objectives, most research on off-policy evaluation to date focuses on the expected reward. In this paper, we introduce Lipschitz risk functionals, a broad class of objectives that subsumes conditional value-at-risk (CVaR), variance, mean-variance, many distorted risks, and CPT risks, among others. We propose Off-Policy Risk Assessment (OPRA), a framework that first estimates a target policy's CDF and then generates plugin estimates for any collection of Lipschitz risks, providing finite sample guarantees that hold simultaneously over the entire class. We instantiate OPRA with both importance sampling and doubly robust estimators. Our primary theoretical contributions are (i) the first uniform concentration inequalities for both CDF estimators in contextual bandits and (ii) error bounds on our Lipschitz risk estimates, which all converge at a rate of $O(1/\sqrt{n})$.
Published: 2021

17. On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

Author: Huang, Audrey, Leqi, Liu, Lipton, Zachary C., and Azizzadenesheli, Kamyar
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: In order to model risk aversion in reinforcement learning, an emerging line of research adapts familiar algorithms to optimize coherent risk functionals, a class that includes conditional value-at-risk (CVaR). Because optimizing the coherent risk is difficult in Markov decision processes, recent work tends to focus on the Markov coherent risk (MCR), a time-consistent surrogate. While, policy gradient (PG) updates have been derived for this objective, it remains unclear (i) whether PG finds a global optimum for MCR; (ii) how to estimate the gradient in a tractable manner. In this paper, we demonstrate that, in general, MCR objectives (unlike the expected return) are not gradient dominated and that stationary points are not, in general, guaranteed to be globally optimal. Moreover, we present a tight upper bound on the suboptimality of the learned policy, characterizing its dependence on the nonlinearity of the objective and the degree of risk aversion. Addressing (ii), we propose a practical implementation of PG that uses state distribution reweighting to overcome previous limitations. Through experiments, we demonstrate that when the optimality gap is small, PG can learn risk-sensitive policies. However, we find that instances with large suboptimality gaps are abundant and easy to construct, outlining an important challenge for future research.
Published: 2021

18. Median Optimal Treatment Regimes

Author: Leqi, Liu and Kennedy, Edward H.
Subjects: Statistics - Methodology, Computer Science - Machine Learning
Abstract: Optimal treatment regimes are personalized policies for making a treatment decision based on subject characteristics, with the policy chosen to maximize some value. It is common to aim to maximize the mean outcome in the population, via a regime assigning treatment only to those whose mean outcome is higher under treatment versus control. However, the mean can be an unstable measure of centrality, resulting in imprecise statistical procedures, as well as unrobust decisions that can be overly influenced by a small fraction of subjects. In this work, we propose a new median optimal treatment regime that instead treats individuals whose conditional median is higher under treatment. This ensures that optimal decisions for individuals from the same group are not overly influenced either by (i) a small fraction of the group (unlike the mean criterion), or (ii) unrelated subjects from different groups (unlike marginal median/quantile criteria). We introduce a new measure of value, the Average Conditional Median Effect (ACME), which summarizes across-group median treatment outcomes of a policy, and which the median optimal treatment regime maximizes. After developing key motivating examples that distinguish median optimal treatment regimes from mean and marginal median optimal treatment regimes, we give a nonparametric efficiency bound for estimating the ACME of a policy, and propose a new doubly robust-style estimator that achieves the efficiency bound under weak conditions. To construct the median optimal treatment regime, we introduce a new doubly robust-style estimator for the conditional median treatment effect. Finite-sample properties are explored via numerical simulations and the proposed algorithm is illustrated using data from a randomized clinical trial in patients with HIV.
Published: 2021

19. Rebounding Bandits for Modeling Satiation Effects

Author: Leqi, Liu, Kilinc-Karzan, Fatma, Lipton, Zachary C., and Montgomery, Alan L.
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: Psychological research shows that enjoyment of many goods is subject to satiation, with short-term satisfaction declining after repeated exposures to the same item. Nevertheless, proposed algorithms for powering recommender systems seldom model these dynamics, instead proceeding as though user preferences were fixed in time. In this work, we introduce rebounding bandits, a multi-armed bandit setup, where satiation dynamics are modeled as time-invariant linear dynamical systems. Expected rewards for each arm decline monotonically with consecutive exposures to it and rebound towards the initial reward whenever that arm is not pulled. Unlike classical bandit settings, methods for tackling rebounding bandits must plan ahead and model-based methods rely on estimating the parameters of the satiation dynamics. We characterize the planning problem, showing that the greedy policy is optimal when the arms exhibit identical deterministic dynamics. To address stochastic satiation dynamics with unknown parameters, we propose Explore-Estimate-Plan (EEP), an algorithm that pulls arms methodically, estimates the system dynamics, and then plans accordingly.
Published: 2020

20. Game Design for Eliciting Distinguishable Behavior

Author: Yang, Fan, Leqi, Liu, Wu, Yifan, Lipton, Zachary C., Ravikumar, Pradeep, Cohen, William W., and Mitchell, Tom
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Statistics - Machine Learning
Abstract: The ability to inferring latent psychological traits from human behavior is key to developing personalized human-interacting machine learning systems. Approaches to infer such traits range from surveys to manually-constructed experiments and games. However, these traditional games are limited because they are typically designed based on heuristics. In this paper, we formulate the task of designing \emph{behavior diagnostic games} that elicit distinguishable behavior as a mutual information maximization problem, which can be solved by optimizing a variational lower bound. Our framework is instantiated by using prospect theory to model varying player traits, and Markov Decision Processes to parameterize the games. We validate our approach empirically, showing that our designed games can successfully distinguish among players with different traits, outperforming manually-designed ones by a large margin., Comment: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)
Published: 2019

21. Automated Dependence Plots

Author: Inouye, David I., Leqi, Liu, Kim, Joon Sik, Aragam, Bryon, and Ravikumar, Pradeep
Subjects: Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: In practical applications of machine learning, it is necessary to look beyond standard metrics such as test accuracy in order to validate various qualitative properties of a model. Partial dependence plots (PDP), including instance-specific PDPs (i.e., ICE plots), have been widely used as a visual tool to understand or validate a model. Yet, current PDPs suffer from two main drawbacks: (1) a user must manually sort or select interesting plots, and (2) PDPs are usually limited to plots along a single feature. To address these drawbacks, we formalize a method for automating the selection of interesting PDPs and extend PDPs beyond showing single features to show the model response along arbitrary directions, for example in raw feature space or a latent space arising from some generative model. We demonstrate the usefulness of our automated dependence plots (ADP) across multiple use-cases and datasets including model selection, bias detection, understanding out-of-sample behavior, and exploring the latent space of a generative model., Comment: In Uncertainty in Artificial Intelligence (UAI 2020). Camera-ready version. Code is available at https://github.com/davidinouye/adp
Published: 2019

22. Sample Complexity of Nonparametric Semi-Supervised Learning

Author: Dan, Chen, Leqi, Liu, Aragam, Bryon, Ravikumar, Pradeep, and Xing, Eric P.
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence, Mathematics - Statistics Theory, Statistics - Machine Learning
Abstract: We study the sample complexity of semi-supervised learning (SSL) and introduce new assumptions based on the mismatch between a mixture model learned from unlabeled data and the true mixture model induced by the (unknown) class conditional distributions. Under these assumptions, we establish an $\Omega(K\log K)$ labeled sample complexity bound without imposing parametric assumptions, where $K$ is the number of classes. Our results suggest that even in nonparametric settings it is possible to learn a near-optimal classifier using only a few labeled samples. Unlike previous theoretical work which focuses on binary classification, we consider general multiclass classification ($K>2$), which requires solving a difficult permutation learning problem. This permutation defines a classifier whose classification error is controlled by the Wasserstein distance between mixing measures, and we provide finite-sample results characterizing the behaviour of the excess risk of this classifier. Finally, we describe three algorithms for computing these estimators based on a connection to bipartite graph matching, and perform experiments to illustrate the superiority of the MLE over the majority vote estimator., Comment: 18 pages, 3 figures
Published: 2018

23. A Taxonomy of Human and ML Strengths in Decision-Making to Investigate Human-ML Complementarity

Author: Rastogi, Charvi, primary, Leqi, Liu, additional, Holstein, Kenneth, additional, and Heidari, Hoda, additional
Published: 2023
Full Text: View/download PDF

24. Analyzing Personality through Social Media Profile Picture Choice.

Author: Leqi Liu, Daniel Preotiuc-Pietro, Zahra Riahi Samani, Mohsen Ebrahimi Moghaddam, and Lyle H. Ungar
Published: 2016

25. Shared genetic architecture in autoimmune disease - preliminary analysis.

Author: Leqi Liu, Jia Tao 0001, Ziyan Yang, and Fadi Towfic
Published: 2015
Full Text: View/download PDF

26. A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits

Author: Leqi, Liu, primary, Zhou, Giulio, additional, Kilinc-Karzan, Fatma, additional, Lipton, Zachary, additional, and Montgomery, Alan, additional
Published: 2023
Full Text: View/download PDF

27. When Curation Becomes Creation.

Author: LEQI, LIU, HADFIELD-MENELL, DYLAN, and LIPTON, ZACHARY C.
Subjects: *INTERNET content providers, *ARTISTIC creation, *COMPUTING platforms, *SOCIAL media laws
Abstract: The authors discuss what constitutes creation on internet platforms. They mention the work of internet content providers, the issue of regulating such platforms and content, and how the two aspects of creating content and presenting it on the internet are starting to merge.
Published: 2021
Full Text: View/download PDF

28. Analyzing Personality through Social Media Profile Picture Choice

Author: Leqi Liu, Daniel Preotiuc-Pietro, Zahra Riahi Samani, Mohsen E. Moghaddam, and Lyle Ungar
Abstract: The content of images users post to their social media is driven in part by personality. In this study, we analyze how Twitter profile images vary with the personality of the users posting them. In our main analysis, we use profile images from over 66,000 users whose personality we estimate based on their tweets. To facilitate interpretability, we focus our analysis on aesthetic and facial features and control for demographic variation in image features and personality. Our results show significant differences in profile picture choice between personality traits, and that these can be harnessed to predict personality traits with robust accuracy. For example, agreeable and conscientious users display more positive emotions in their profile pictures, while users high in openness prefer more aesthetic photos.
Published: 2021

29. Modeling Attrition in Recommender Systems with Departing Bandits

Author: Ben-Porat, Omer, primary, Cohen, Lee, additional, Leqi, Liu, additional, Lipton, Zachary C., additional, and Mansour, Yishay, additional
Published: 2022
Full Text: View/download PDF

30. Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

Author: Jiang, Yueyi, primary, Jiang, Yunfan, additional, Leqi, Liu, additional, and Winkielman, Piotr, additional
Published: 2022
Full Text: View/download PDF

31. A Unifying Framework for Combining Complementary Strengths of Humans and ML toward Better Predictive Decision-Making

Author: Rastogi, Charvi, Leqi, Liu, Holstein, Kenneth, and Heidari, Hoda
Subjects: FOS: Computer and information sciences, Computer Science - Machine Learning, Computer Science - Human-Computer Interaction, Human-Computer Interaction (cs.HC), Machine Learning (cs.LG)
Abstract: Hybrid human-ML systems are increasingly in charge of consequential decisions in a wide range of domains. A growing body of empirical and theoretical work has advanced our understanding of these systems. However, existing empirical results are mixed, and theoretical proposals are often mutually incompatible. In this work, we propose a unifying framework for understanding conditions under which combining the complementary strengths of humans and ML leads to higher quality decisions than those produced by each of them individually -- a state which we refer to as human-ML complementarity. We focus specifically on the context of human-ML predictive decision-making and investigate optimal ways of combining human and ML predictive decisions, accounting for the underlying sources of variation in their judgments. Within this scope, we present two crucial contributions. First, taking a computational perspective of decision-making and drawing upon prior literature in psychology, machine learning, and human-computer interaction, we introduce a taxonomy characterizing a wide range of criteria across which human and machine decision-making differ. Second, formalizing our taxonomy allows us to study how human and ML predictive decisions should be aggregated optimally. We show that our proposed framework encompasses several existing models of human-ML complementarity as special cases. Last but not least, an initial exploratory analysis of our framework presents a critical insight for future work in human-ML complementarity: the mechanism by which we combine human and ML judgments should be informed by the underlying causes of divergence in their decisions., Comment: 21 pages, 1 figure
Published: 2022
Full Text: View/download PDF

32. Densification Mechanism of Os: Master Sintering Curve Fitting and Frist Principle Calculation

Author: Yunfei Yang, Junhao Sun, Wei Liu, Zhikai Hu, Shilei Li, Leqi Liu, and Jinshu Wang
Subjects: History, Polymers and Plastics, Business and International Management, Industrial and Manufacturing Engineering
Published: 2022

33. When Curation Becomes Creation

Author: Leqi, Liu, primary, Hadfield-Menell, Dylan, additional, and Lipton, Zachary C., additional
Published: 2021
Full Text: View/download PDF

34. Shared genetic architecture in autoimmune disease - preliminary analysis

Author: Jia Tao, Ziyan Yang, Fadi Towfic, and Leqi Liu
Subjects: Autoimmune disease, Insulin resistance, Disease Ontology, Diabetes mellitus, Human Phenotype Ontology, medicine, Genetic predisposition, Disease, Biology, medicine.disease, Bioinformatics, Genetic architecture
Abstract: Diseases that have different underlying genetic risk component(s) may share similar phenotypes. Traditionally, disease classifications have focused on characterizing diseases based on sets of related phenotypes. As an example, type I diabetes and type II diabetes are both classified as a type of diabetes based on patients having high blood sugar over long periods of time. However, as our understanding of genetic contributions to disease susceptibility and progression evolves, we start noticing that diseases with similar symptoms may have completely different causes. For example, type II diabetes is due to insulin resistance while type I diabetes is caused by immune cells attacking insulin producing cells. As genetic data becomes more highly available, it becomes possible to classify diseases based on their genetic causative drivers instead of their phenotypes. In this study, we have (1) explored the relationship between 10 autoimmune diseases along with type II diabetes based on their genetic susceptibility information and compared such classifications to existing disease categorizations based on disease symptoms/phenotypes from Human Phenotype Ontology, NCI-thesaurus and the Disease Ontology, and (2) developed automated scripts to compute similarities and cluster diseases based on the specified criteria. Categorization based on genetic susceptibility can help identify diseases that share similar drug targets and benefit from similar diagnosis technologies. We hope to further develop our system to apply it to more disease categories.
Published: 2015

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

34 results on '"Leqi Liu"'

1. A Common Pitfall of Margin-based Language Model Alignment: Gradient Entanglement

2. A Unified Causal Framework for Auditing Recommender Systems for Ethical Concerns

3. Ambient Air Pollution and Hospitalization for Acute Myocardial Infarction in Chongqing, China: A Time-Stratified Case Crossover Analysis

4. Accounting for AI and Users Shaping One Another: The Role of Mathematical Models

5. Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework

6. Personalized Language Modeling from Personalized Human Feedback

7. A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits

8. Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

9. Off-Policy Risk Assessment in Markov Decision Processes

10. Supervised Learning with General Risk Functionals

11. A Taxonomy of Human and ML Strengths in Decision-Making to Investigate Human-ML Complementarity

12. Modeling Attrition in Recommender Systems with Departing Bandits

13. Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

14. Action-Sufficient State Representation Learning for Control with Structural Constraints

15. When Curation Becomes Creation: Algorithms, Microcontent, and the Vanishing Distinction between Platforms and Creators

16. Off-Policy Risk Assessment in Contextual Bandits

17. On the Convergence and Optimality of Policy Gradient for Markov Coherent Risk

18. Median Optimal Treatment Regimes

19. Rebounding Bandits for Modeling Satiation Effects

20. Game Design for Eliciting Distinguishable Behavior

21. Automated Dependence Plots

22. Sample Complexity of Nonparametric Semi-Supervised Learning

23. A Taxonomy of Human and ML Strengths in Decision-Making to Investigate Human-ML Complementarity

24. Analyzing Personality through Social Media Profile Picture Choice.

25. Shared genetic architecture in autoimmune disease - preliminary analysis.

26. A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits

27. When Curation Becomes Creation.

28. Analyzing Personality through Social Media Profile Picture Choice

29. Modeling Attrition in Recommender Systems with Departing Bandits

30. Many Ways to Be Lonely: Fine-Grained Characterization of Loneliness and Its Potential Changes in COVID-19

31. A Unifying Framework for Combining Complementary Strengths of Humans and ML toward Better Predictive Decision-Making

32. Densification Mechanism of Os: Master Sintering Curve Fitting and Frist Principle Calculation

33. When Curation Becomes Creation

34. Shared genetic architecture in autoimmune disease - preliminary analysis

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

34 results on '"Leqi Liu"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources