Author: "Gan, Jiarui" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Gan, Jiarui"' showing total 95 results

Start Over Author "Gan, Jiarui"

95 results on '"Gan, Jiarui"'

1. Markov Decision Processes with Time-Varying Geometric Discounting

Author: Gan, Jiarui, Hennes, Annika, Majumdar, Rupak, Mandal, Debmalya, and Radanovic, Goran
Subjects: Computer Science - Artificial Intelligence, Computer Science - Computer Science and Game Theory
Abstract: Canonical models of Markov decision processes (MDPs) usually consider geometric discounting based on a constant discount factor. While this standard modeling approach has led to many elegant results, some recent studies indicate the necessity of modeling time-varying discounting in certain applications. This paper studies a model of infinite-horizon MDPs with time-varying discount factors. We take a game-theoretic perspective -- whereby each time step is treated as an independent decision maker with their own (fixed) discount factor -- and we study the subgame perfect equilibrium (SPE) of the resulting game as well as the related algorithmic problems. We present a constructive proof of the existence of an SPE and demonstrate the EXPTIME-hardness of computing an SPE. We also turn to the approximate notion of $\epsilon$-SPE and show that an $\epsilon$-SPE exists under milder assumptions. An algorithm is presented to compute an $\epsilon$-SPE, of which an upper bound of the time complexity, as a function of the convergence property of the time-varying discount factor, is provided., Comment: 24 pages, 3 figures
Published: 2023
Full Text: View/download PDF

2. Stochastic Principal-Agent Problems: Efficient Computation and Learning

Author: Gan, Jiarui, Majumdar, Rupak, Mandal, Debmalya, and Radanovic, Goran
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Machine Learning, Computer Science - Multiagent Systems
Abstract: We introduce a stochastic principal-agent model. A principal and an agent interact in a stochastic environment, each privy to observations about the state not available to the other. The principal has the power of commitment, both to elicit information from the agent and to provide signals about her own information. The players communicate with each other and then select actions independently. Each of them receives a payoff based on the state and their joint action, and the environment transitions to a new state. The interaction continues over a finite time horizon. Both players are far-sighted, aiming to maximize their total payoffs over the time horizon. The model encompasses as special cases extensive-form games (EFGs) and stochastic games of incomplete information, partially observable Markov decision processes (POMDPs), as well as other forms of sequential principal-agent interactions, including Bayesian persuasion and automated mechanism design problems. We consider both the computation and learning of the principal's optimal policy. Since the general problem, which subsumes POMDPs, is intractable, we explore algorithmic solutions under hindsight observability, where the state and the interaction history are revealed at the end of each step. Though the problem becomes more amenable under this condition, the number of possible histories remains exponential in the length of the time horizon, making approaches for EFG-based models infeasible. We present an efficient algorithm based on the inducible value sets. The algorithm computes an $\epsilon$-approximate optimal policy in time polynomial in $1/\epsilon$. Additionally, we show an efficient learning algorithm for an episodic reinforcement learning setting where the transition probabilities are unknown. The algorithm guarantees sublinear regret $\tilde{O}(T^{2/3})$ for both players over $T$ episodes.
Published: 2023

3. Robust Stackelberg Equilibria

Author: Gan, Jiarui, Han, Minbiao, Wu, Jibang, and Xu, Haifeng
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Computational Complexity, Economics - Theoretical Economics
Abstract: This paper provides a systematic study of the robust Stackelberg equilibrium (RSE), which naturally generalizes the widely adopted solution concept of the strong Stackelberg equilibrium (SSE). The RSE accounts for any possible up-to-$\delta$ suboptimal follower responses in Stackelberg games and is adopted to improve the robustness of the leader's strategy. While a few variants of robust Stackelberg equilibrium have been considered in previous literature, the RSE solution concept we consider is importantly different -- in some sense, it relaxes previously studied robust Stackelberg strategies and is applicable to much broader sources of uncertainties. We provide a thorough investigation of several fundamental properties of RSE, including its utility guarantees, algorithmics, and learnability. We first show that the RSE we defined always exists and thus is well-defined. Then we characterize how the leader's utility in RSE changes with the robustness level considered. On the algorithmic side, we show that, in sharp contrast to the tractability of computing an SSE, it is NP-hard to obtain a fully polynomial approximation scheme (FPTAS) for any constant robustness level. Nevertheless, we develop a quasi-polynomial approximation scheme (QPTAS) for RSE. Finally, we examine the learnability of the RSE in a natural learning scenario, where both players' utilities are not known in advance, and provide almost tight sample complexity results on learning the RSE. As a corollary of this result, we also obtain an algorithm for learning SSE, which strictly improves a key result of Bai et al. in terms of both utility guarantee and computational efficiency.
Published: 2023

4. k-Prize Weighted Voting Games

Author: Lee, Wei-Chen, Hyland, David, Abate, Alessandro, Elkind, Edith, Gan, Jiarui, Gutierrez, Julian, Harrenstein, Paul, and Wooldridge, Michael
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Computational Complexity
Abstract: We introduce a natural variant of weighted voting games, which we refer to as k-Prize Weighted Voting Games. Such games consist of n players with weights, and k prizes, of possibly differing values. The players form coalitions, and the i-th largest coalition (by the sum of weights of its members) wins the i-th largest prize, which is then shared among its members. We present four solution concepts to analyse the games in this class, and characterise the existence of stable outcomes in games with three players and two prizes, and in games with uniform prizes. We then explore the efficiency of stable outcomes in terms of Pareto optimality and utilitarian social welfare. Finally, we study the computational complexity of finding stable outcomes., Comment: Accepted to AAMAS 2023
Published: 2023

5. Learning to Manipulate a Commitment Optimizer

Author: Chen, Yurong, Deng, Xiaotie, Gan, Jiarui, and Li, Yuhao
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Artificial Intelligence, Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning, Economics - Theoretical Economics
Abstract: It is shown in recent studies that in a Stackelberg game the follower can manipulate the leader by deviating from their true best-response behavior. Such manipulations are computationally tractable and can be highly beneficial for the follower. Meanwhile, they may result in significant payoff losses for the leader, sometimes completely defeating their first-mover advantage. A warning to commitment optimizers, the risk these findings indicate appears to be alleviated to some extent by a strict information advantage the manipulations rely on. That is, the follower knows the full information about both players' payoffs whereas the leader only knows their own payoffs. In this paper, we study the manipulation problem with this information advantage relaxed. We consider the scenario where the follower is not given any information about the leader's payoffs to begin with but has to learn to manipulate by interacting with the leader. The follower can gather necessary information by querying the leader's optimal commitments against contrived best-response behaviors. Our results indicate that the information advantage is not entirely indispensable to the follower's manipulations: the follower can learn the optimal way to manipulate in polynomial time with polynomially many queries of the leader's optimal commitment.
Published: 2023

6. Online Reinforcement Learning with Uncertain Episode Lengths

Author: Mandal, Debmalya, Radanovic, Goran, Gan, Jiarui, Singla, Adish, and Majumdar, Rupak
Subjects: Computer Science - Machine Learning
Abstract: Existing episodic reinforcement algorithms assume that the length of an episode is fixed across time and known a priori. In this paper, we consider a general framework of episodic reinforcement learning when the length of each episode is drawn from a distribution. We first establish that this problem is equivalent to online reinforcement learning with general discounting where the learner is trying to optimize the expected discounted sum of rewards over an infinite horizon, but where the discounting function is not necessarily geometric. We show that minimizing regret with this new general discounting is equivalent to minimizing regret with uncertain episode lengths. We then design a reinforcement learning algorithm that minimizes regret with general discounting but acts for the setting with uncertain episode lengths. We instantiate our general bound for different types of discounting, including geometric and polynomial discounting. We also show that we can obtain similar regret bounds even when the uncertainty over the episode lengths is unknown, by estimating the unknown distribution over time. Finally, we compare our learning algorithms with existing value-iteration based episodic RL algorithms in a grid-world environment., Comment: To appear at AAAI-2023
Published: 2023

7. Generalized Principal-Agency: Contracts, Information, Games and Beyond

Author: Gan, Jiarui, Han, Minbiao, Wu, Jibang, and Xu, Haifeng
Subjects: Computer Science - Computer Science and Game Theory, Economics - Theoretical Economics
Abstract: In the principal-agent problem formulated by Myerson'82, agents have private information (type) and make private decisions (action), both of which are unobservable to the principal. Myerson pointed out an elegant linear programming solution that relies on the revelation principle. This paper extends Myerson's results to a more general setting where the principal's action space can be infinite and subject to additional design constraints. Our generalized principal-agent model unifies several important design problems including contract design, information design, and Bayesian Stackelberg games, and encompasses them as special cases. We first extend the revelation principle to this general model, based on which a polynomial-time algorithm is then derived for computing the optimal mechanism for the principal. This algorithm not only implies new efficient solutions simultaneously for all the aforementioned special cases but also significantly simplifies previously known algorithms designed for special cases. Inspired by the recent interest in the algorithmic design of a single contract and menu of contracts, we study such constrained design problems to our general principal-agent model. In contrast to the above unification, our results here illustrate the other facet of diversity among different principal-agent design problems and demonstrate how their different structures can lead to different complexities: some are tractable whereas others are APX-hard. Finally, we reveal an interesting connection of our model to the problem of information acquisition for decision making and study its algorithmic properties in general.
Published: 2022

8. Socially Fair Reinforcement Learning

Author: Mandal, Debmalya and Gan, Jiarui
Subjects: Computer Science - Machine Learning, Computer Science - Computers and Society, Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems
Abstract: We consider the problem of episodic reinforcement learning where there are multiple stakeholders with different reward functions. Our goal is to output a policy that is socially fair with respect to different reward functions. Prior works have proposed different objectives that a fair policy must optimize including minimum welfare, and generalized Gini welfare. We first take an axiomatic view of the problem, and propose four axioms that any such fair objective must satisfy. We show that the Nash social welfare is the unique objective that uniquely satisfies all four objectives, whereas prior objectives fail to satisfy all four axioms. We then consider the learning version of the problem where the underlying model i.e. Markov decision process is unknown. We consider the problem of minimizing regret with respect to the fair policies maximizing three different fair objectives -- minimum welfare, generalized Gini welfare, and Nash social welfare. Based on optimistic planning, we propose a generic learning algorithm and derive its regret bound with respect to the three different policies. For the objective of Nash social welfare, we also derive a lower bound in regret that grows exponentially with $n$, the number of agents. Finally, we show that for the objective of minimum welfare, one can improve regret by a factor of $O(H)$ for a weaker notion of regret.
Published: 2022

9. Admissible Policy Teaching through Reward Design

Author: Banihashem, Kiarash, Singla, Adish, Gan, Jiarui, and Radanovic, Goran
Subjects: Computer Science - Machine Learning, Computer Science - Artificial Intelligence
Abstract: We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy from a set of admissible policies. The goal of the reward designer is to modify the underlying reward function cost-efficiently while ensuring that any approximately optimal deterministic policy under the new reward function is admissible and performs well under the original reward function. This problem can be viewed as a dual to the problem of optimal reward poisoning attacks: instead of forcing an agent to adopt a specific policy, the reward designer incentivizes an agent to avoid taking actions that are inadmissible in certain states. Perhaps surprisingly, and in contrast to the problem of optimal reward poisoning attacks, we first show that the reward design problem for admissible policy teaching is computationally challenging, and it is NP-hard to find an approximately optimal reward modification. We then proceed by formulating a surrogate problem whose optimal solution approximates the optimal solution to the reward design problem in our setting, but is more amenable to optimization techniques and analysis. For this surrogate problem, we present characterization results that provide bounds on the value of the optimal solution. Finally, we design a local search algorithm to solve the surrogate problem and showcase its utility using simulation-based experiments.
Published: 2022

10. Experimental investigation on the deterioration of the physical and mechanical properties of autoclaved aerated concrete at elevated temperatures

Author: Tang Lingxiao, Yao Huayan, Zhang Mingyuan, Gan Jiarui, Xie Mingyu, and Xie Wansheng
Subjects: autoclaved aerated concrete, temperature, compressive strength, microstructure, deterioration mechanism, Technology, Chemical technology, TP1-1185, Chemicals: Manufacture, use, etc., TP200-248
Abstract: Autoclaved aerated concrete (AAC) has been extensively studied and applied in the past decades because of its excellent thermal insulation and acoustic performance, energy efficiency, and outstanding structural performance. To investigate the deterioration characteristics of AAC under high temperatures, the physico-mechanical properties of AAC at different temperatures were tested by mass loss, wave velocity, and compressive tests, and the deterioration mechanism was discussed by X-ray diffraction (XRD), thermogravimetric analysis (TGA), and scanning electron microscopy (SEM) tests. The results showed that after exposure to elevated temperatures, the parameters of apparent form, mass loss, wave velocity, and compressive strength underwent remarkable conversions. It was observed that the ultrasonic behavior of AAC was affected at high temperatures as it increased at 100°C and decreased beyond 100°C. In addition, the compressive strength exhibited a two-stage transformation, slightly increased from ambient temperature to 300°C, and exhibited a rapid reduction beyond 300°C. At 900°C, the specimen lost its strength. By XRD, TGA, and SEM, it was confirmed that a series of physicochemical changes in AAC, such as the water escape and evaporation, decomposition of calcium silicate hydrate and calcium carbonate, and structural damage, were the primary reasons for the deterioration of the physical and mechanical properties at elevated temperatures.
Published: 2024
Full Text: View/download PDF

11. Approximately Envy-Free Budget-Feasible Allocation

Author: Gan, Jiarui, Li, Bo, and Wu, Xiaowei
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems
Abstract: In the budget-feasible allocation problem, a set of items with varied sizes and values are to be allocated to a group of agents. Each agent has a budget constraint on the total size of items she can receive. The goal is to compute a feasible allocation that is envy-free (EF), in which the agents do not envy each other for the items they receive, nor do they envy a charity, who is endowed with all the unallocated items. Since EF allocations barely exist even without budget constraints, we are interested in the relaxed notion of envy-freeness up to one item (EF1). The computation of both exact and approximate EF1 allocations remains largely open, despite a recent effort by Wu et al. (IJCAI 2021) in showing that any budget-feasible allocation that maximizes the Nash Social Welfare (NSW) is 1/4-approximate EF1. In this paper, we move one step forward by showing that for agents with identical additive valuations, a 1/2-approximate EF1 allocation can be computed in polynomial time. For the uniform-budget and two-agent cases, we propose efficient algorithms for computing an exact EF1 allocation. We also consider the large budget setting, i.e., when the item sizes are infinitesimal compared with the agents' budgets, and show that both the NSW maximizing allocation and the allocation our polynomial-time algorithm computes have an approximation close to 1 regarding EF1.
Published: 2021

12. Bayesian Persuasion in Sequential Decision-Making

Author: Gan, Jiarui, Majumdar, Rupak, Radanovic, Goran, and Singla, Adish
Subjects: Computer Science - Computer Science and Game Theory
Abstract: We study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes actions in each time step based on the current state, the principal's advice/signal, and beliefs about the external parameter. The action of the agent updates the state according to a stochastic process. The model arises naturally in many applications, e.g., an app (the principal) can advice the user (the agent) on possible choices between actions based on additional real-time information the app has. We study the problem of designing a signaling strategy from the principal's point of view. We show that the principal has an optimal strategy against a myopic agent, who only optimizes their rewards locally, and the optimal strategy can be computed in polynomial time. In contrast, it is NP-hard to approximate an optimal policy against a far-sighted agent. Further, if the principal has the power to threaten the agent by not providing future signals, then we can efficiently compute a threat-based strategy. This strategy guarantees the principal's payoff as if playing against an agent who is far-sighted but myopic to future signals.
Published: 2021

13. Your College Dorm and Dormmates: Fair Resource Sharing with Externalities

Author: Gan, Jiarui, Li, Bo, and Li, Yingkai
Subjects: Computer Science - Computer Science and Game Theory
Abstract: We study a fair resource sharing problem, where a set of resources are to be shared among a group of agents. Each agent demands one resource and each resource can serve a limited number of agents. An agent cares about what resource they get as well as the externalities imposed by their mates, who share the same resource with them. Clearly, the strong notion of envy-freeness, where no agent envies another for their resource or mates, cannot always be achieved and we show that even deciding the existence of such a strongly envy-free assignment is an intractable problem. Hence, a more interesting question is whether (and in what situations) a relaxed notion of envy-freeness, the Pareto envy-freeness, can be achieved. Under this relaxed notion, an agent envies another only when they envy both the resource and the mates of the other agent. In particular, we are interested in a dorm assignment problem, where students are to be assigned to dorms with the same capacity and they have dichotomous preference over their dormmates. We show that when the capacity of each dorm is 2, a Pareto envy-free assignment always exists and we present a polynomial-time algorithm to compute such an assignment. Nevertheless, the result breaks immediately when the capacity increases to 3, in which case even Pareto envy-freeness cannot be guaranteed. In addition to the existential results, we also investigate the utility guarantees of (Pareto) envy-free assignments in our model., Comment: accepted in JAIR 2023
Published: 2020
Full Text: View/download PDF

14. Budget-feasible Maximum Nash Social Welfare Allocation is Almost Envy-free

Author: Wu, Xiaowei, Li, Bo, and Gan, Jiarui
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems
Abstract: The Nash social welfare (NSW) is a well-known social welfare measurement that balances individual utilities and the overall efficiency. In the context of fair allocation of indivisible goods, it has been shown by Caragiannis et al. (EC 2016 and TEAC 2019) that an allocation maximizing the NSW is envy-free up to one good (EF1). In this paper, we are interested in the fairness of the NSW in a budget-feasible allocation problem, in which each item has a cost that will be incurred to the agent it is allocated to, and each agent has a budget constraint on the total cost of items she receives. We show that a budget-feasible allocation that maximizes the NSW achieves a 1/4-approximation of EF1 and the approximation ratio is tight. The approximation ratio improves gracefully when the items have small costs compared with the agents' budgets; it converges to 1/2 when the budget-cost ratio approaches infinity., Comment: 16 pages, 2 figures
Published: 2020

15. Optimally Deceiving a Learning Leader in Stackelberg Games

Author: Birmpas, Georgios, Gan, Jiarui, Hollender, Alexandros, Marmolejo-Cossío, Francisco J., Rajgopal, Ninad, and Voudouris, Alexandros A.
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Data Structures and Algorithms, Computer Science - Machine Learning
Abstract: Recent results in the ML community have revealed that learning algorithms used to compute the optimal strategy for the leader to commit to in a Stackelberg game, are susceptible to manipulation by the follower. Such a learning algorithm operates by querying the best responses or the payoffs of the follower, who consequently can deceive the algorithm by responding as if his payoffs were much different than what they actually are. For this strategic behavior to be successful, the main challenge faced by the follower is to pinpoint the payoffs that would make the learning algorithm compute a commitment so that best responding to it maximizes the follower's utility, according to his true payoffs. While this problem has been considered before, the related literature only focused on the simplified scenario in which the payoff space is finite, thus leaving the general version of the problem unanswered. In this paper, we fill in this gap, by showing that it is always possible for the follower to compute (near-)optimal payoffs for various scenarios about the learning interaction between leader and follower.
Published: 2020
Full Text: View/download PDF

16. Decentralized leadership and follower deception in Stackelberg games

Author: Gan, Jiarui, Elkind, Edith, and Wooldridge, Michael
Subjects: 519.3, Algorithmic game theory
Abstract: This thesis focuses on two aspects of Stackelberg games - decentralized leadership and follower deception - that stem from reasoning about strategic interactions at a higher level in Stackelberg games. In the first part of the thesis, we study decentralized leadership in Stackelberg games. We focus on a variant of Stackelberg security games that involves multiple leaders, in which the leaders allocate their security resources to protect a set of targets against an attacker; the attacker acts as the follower in the game, surveying the leaders' strategies and responding rationally afterwards. We aim to understand the game under decentralized leadership and explore ways to coordinate the leaders. To this end, firstly, we propose a novel equilibrium concept to describe the outcome of the game, and we analyze the existence of an equilibrium and the complexity of computing it. Then, we take a mechanism design approach to coordinate the leaders, aiming to design a coordination mechanism that satisfies several natural properties concerning efficiency, incentives, strategyproofness, and stability of strategies it generates. We obtain impossibility results showing that certain combinations of these properties cannot be achieved by any mechanism. On top of that we also present mechanisms for property combinations that are not blocked by the impossibility results. The second part of the thesis is motivated by the finding that, in a Stackelberg game, the follower can benefit from changing the leader's belief about the game. This can typically happen when the follower is able to tamper with the leader's attempt of information gathering: for instance, when the leader learns the follower's type by interacting with the follower, the follower can imitate the behavior of a different follower type, pretending that their payoffs are different than the actual ones. A leader who ignores such strategic manipulation will then be misled into playing optimally against the imitated follower type, which may correspond to a highly suboptimal strategy in the original game. We study the strategic interactions resulting from this deceptive behavior, both from the leader's and the follower's perspectives. From the leader's perspective, we propose a policy-based approach to mitigate potential losses due to follower deception and study the associated problem of computing the optimal policy. From the follower's perspective, we study the problem of computing the optimal strategy to deceive the leader. Our results provide an almost complete picture of the complexity landscape of these problems in various settings. It is shown that the problems facing the leader are hard in general, whereby we derive inapproximability bounds and also design algorithms that achieve matching approximation ratios; whereas the problems facing the follower are tractable, whereby we design polynomial-time algorithms to compute optimal (or near-optimal) deception strategies for the follower. Our work sheds light on strategic interactions that may arise in Stackelberg games but are not captured by existing solution concepts. It provides frameworks to study these interactions and contributes to the understanding of them through the lens of computation.
Published: 2020

17. Swap Stability in Schelling Games on Graphs

Author: Agarwal, Aishwarya, Elkind, Edith, Gan, Jiarui, and Voudouris, Alexandros A.
Subjects: Computer Science - Computer Science and Game Theory
Abstract: We study a recently introduced class of strategic games that is motivated by and generalizes Schelling's well-known residential segregation model. These games are played on undirected graphs, with the set of agents partitioned into multiple types; each agent either occupies a node of the graph and never moves away or aims to maximize the fraction of her neighbors who are of her own type. We consider a variant of this model that we call swap Schelling games, where the number of agents is equal to the number of nodes of the graph, and agents may {\em swap} positions with other agents to increase their utility. We study the existence, computational complexity and quality of equilibrium assignments in these games, both from a social welfare perspective and from a diversity perspective., Comment: AAAI 2020
Published: 2019

18. Protecting Elections by Recounting Ballots

Author: Elkind, Edith, Gan, Jiarui, Obraztsova, Svetlana, Rabinovich, Zinovi, and Voudouris, Alexandros A.
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Multiagent Systems
Abstract: Complexity of voting manipulation is a prominent topic in computational social choice. In this work, we consider a two-stage voting manipulation scenario. First, a malicious party (an attacker) attempts to manipulate the election outcome in favor of a preferred candidate by changing the vote counts in some of the voting districts. Afterwards, another party (a defender), which cares about the voters' wishes, demands a recount in a subset of the manipulated districts, restoring their vote counts to their original values. We investigate the resulting Stackelberg game for the case where votes are aggregated using two variants of the Plurality rule, and obtain an almost complete picture of the complexity landscape, both from the attacker's and from the defender's perspective.
Published: 2019

19. Manipulating a Learning Defender and Ways to Counteract

Author: Gan, Jiarui, Guo, Qingyu, Tran-Thanh, Long, An, Bo, and Wooldridge, Michael
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Machine Learning
Abstract: In Stackelberg security games when information about the attacker's payoffs is uncertain, algorithms have been proposed to learn the optimal defender commitment by interacting with the attacker and observing their best responses. In this paper, we show that, however, these algorithms can be easily manipulated if the attacker responds untruthfully. As a key finding, attacker manipulation normally leads to the defender learning a maximin strategy, which effectively renders the learning attempt meaningless as to compute a maximin strategy requires no additional information about the other player at all. We then apply a game-theoretic framework at a higher level to counteract such manipulation, in which the defender commits to a policy that specifies her strategy commitment according to the learned information. We provide a polynomial-time algorithm to compute the optimal such policy, and in addition, a heuristic approach that applies even when the attacker's payoff space is infinite or completely unknown. Empirical evaluation shows that our approaches can improve the defender's utility significantly as compared to the situation when attacker manipulation is ignored., Comment: The paper appears in NeurIPS 2019
Published: 2019

20. Envy-Freeness in House Allocation Problems

Author: Gan, Jiarui, Suksompong, Warut, and Voudouris, Alexandros A.
Subjects: Computer Science - Computer Science and Game Theory
Abstract: We consider the house allocation problem, where $m$ houses are to be assigned to $n$ agents so that each agent gets exactly one house. We present a polynomial-time algorithm that determines whether an envy-free assignment exists, and if so, computes one such assignment. We also show that an envy-free assignment exists with high probability if the number of houses exceeds the number of agents by a logarithmic factor.
Published: 2019
Full Text: View/download PDF

21. Imitative Follower Deception in Stackelberg Games

Author: Gan, Jiarui, Xu, Haifeng, Guo, Qingyu, Tran-Thanh, Long, Rabinovich, Zinovi, and Wooldridge, Michael
Subjects: Computer Science - Computer Science and Game Theory
Abstract: Information uncertainty is one of the major challenges facing applications of game theory. In the context of Stackelberg games, various approaches have been proposed to deal with the leader's incomplete knowledge about the follower's payoffs, typically by gathering information from the leader's interaction with the follower. Unfortunately, these approaches rely crucially on the assumption that the follower will not strategically exploit this information asymmetry, i.e., the follower behaves truthfully during the interaction according to their actual payoffs. As we show in this paper, the follower may have strong incentives to deceitfully imitate the behavior of a different follower type and, in doing this, benefit significantly from inducing the leader into choosing a highly suboptimal strategy. This raises a fundamental question: how to design a leader strategy in the presence of a deceitful follower? To answer this question, we put forward a basic model of Stackelberg games with (imitative) follower deception and show that the leader is indeed able to reduce the loss due to follower deception with carefully designed policies. We then provide a systematic study of the problem of computing the optimal leader policy and draw a relatively complete picture of the complexity landscape; essentially matching positive and negative complexity results are provided for natural variants of the model. Our intractability results are in sharp contrast to the situation with no deception, where the leader's optimal strategy can be computed in polynomial time, and thus illustrate the intrinsic difficulty of handling follower deception. Through simulations we also examine the benefit of considering follower deception in randomly generated games.
Published: 2019

22. Schelling Games on Graphs

Author: Elkind, Edith, Gan, Jiarui, Igarashi, Ayumi, Suksompong, Warut, and Voudouris, Alexandros A.
Subjects: Computer Science - Computer Science and Game Theory
Abstract: We consider strategic games that are inspired by Schelling's model of residential segregation. In our model, the agents are partitioned into k types and need to select locations on an undirected graph. Agents can be either stubborn, in which case they will always choose their preferred location, or strategic, in which case they aim to maximize the fraction of agents of their own type in their neighborhood. We investigate the existence of equilibria in these games, study the complexity of finding an equilibrium outcome or an outcome with high social welfare, and also provide upper and lower bounds on the price of anarchy and stability. Some of our results extend to the setting where the preferences of the agents over their neighbors are defined by a social network rather than a partition into types.
Published: 2019
Full Text: View/download PDF

23. On the Inducibility of Stackelberg Equilibrium for Security Games

Author: Guo, Qingyu, Gan, Jiarui, Fang, Fei, Tran-Thanh, Long, Tambe, Milind, and An, Bo
Subjects: Computer Science - Computer Science and Game Theory
Abstract: Strong Stackelberg equilibrium (SSE) is the standard solution concept of Stackelberg security games. As opposed to the weak Stackelberg equilibrium (WSE), the SSE assumes that the follower breaks ties in favor of the leader and this is widely acknowledged and justified by the assertion that the defender can often induce the attacker to choose a preferred action by making an infinitesimal adjustment to her strategy. Unfortunately, in security games with resource assignment constraints, the assertion might not be valid; it is possible that the defender cannot induce the desired outcome. As a result, many results claimed in the literature may be overly optimistic. To remedy, we first formally define the utility guarantee of a defender strategy and provide examples to show that the utility of SSE can be higher than its utility guarantee. Second, inspired by the analysis of leader's payoff by Von Stengel and Zamir (2004), we provide the solution concept called the inducible Stackelberg equilibrium (ISE), which owns the highest utility guarantee and always exists. Third, we show the conditions when ISE coincides with SSE and the fact that in general case, SSE can be extremely worse with respect to utility guarantee. Moreover, introducing the ISE does not invalidate existing algorithmic results as the problem of computing an ISE polynomially reduces to that of computing an SSE. We also provide an algorithmic implementation for computing ISE, with which our experiments unveil the empirical advantage of the ISE over the SSE., Comment: The Thirty-Third AAAI Conference on Artificial Intelligence
Published: 2018

24. Defense coordination in security games: Equilibrium analysis and mechanism design

Author: Gan, Jiarui, Elkind, Edith, Kraus, Sarit, and Wooldridge, Michael
Published: 2022
Full Text: View/download PDF

25. Schelling games on graphs

Author: Agarwal, Aishwarya, Elkind, Edith, Gan, Jiarui, Igarashi, Ayumi, Suksompong, Warut, and Voudouris, Alexandros A.
Published: 2021
Full Text: View/download PDF

26. Protecting elections by recounting ballots

Author: Elkind, Edith, Gan, Jiarui, Obraztsova, Svetlana, Rabinovich, Zinovi, and Voudouris, Alexandros A.
Published: 2021
Full Text: View/download PDF

27. Pore characteristics and mechanical behaviors of autoclaved aerated concrete produced from coal bottom ash and tailings.

Author: Zhang, Mingyuan, Yao, Huayan, Liu, Yuting, Gan, Jiarui, and Tang, Lingxiao
Subjects: AIR-entrained concrete, COAL ash, PORE size distribution, SOLID waste, COAL mine waste, SURFACE cracks, COMPRESSIVE strength
Abstract: The solid wastes such as coal bottom ash and tailings are currently widely used in various building materials, but further research on their application in autoclaved aerated concrete (AAC) is needed. To study the physical and mechanical properties of AAC produced from coal bottom ash and tailings, the pore characteristics of two different density grades of AAC were extracted and analyzed, and wave velocity and uniaxial compressive tests under water‐saturated and dry conditions were carried out. The results show that the AAC produced from solid waste has excellent performance. The specimen is anisotropic, and the pore size distribution of its cross‐section is mainly concentrated in the ranges of <0.1, 0.1–0.5, and 0.5–1.0 mm. The pore size perpendicular to the direction of rise is smaller and the pores number is greater. The wave velocity and uniaxial compressive strength of the test specimens in the water‐saturated state are significantly lower than those in the dry state. The strengths of specimens loaded perpendicular to the direction of rise are higher than those loaded parallel to the direction of rise whether water‐saturated or dry. The failure mode of the water‐saturated specimens are different from those of dry specimens, which is manifested in the act that there are fewer surface cracks when the specimens fail in the water‐saturated state, while there are more surface cracks in the dry specimens, and their morphology is irregular. The physicochemical effect between water and components and the mechanical effect of pore water are the reasons for the deterioration of AAC's mechanical properties. The results help to reveal the performance of AAC produced from solid waste and have good practical significance. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

28. Robust Stackelberg Equilibria

Author: Gan, Jiarui, primary, Han, Minbiao, additional, Wu, Jibang, additional, and Xu, Haifeng, additional
Published: 2023
Full Text: View/download PDF

29. Online Reinforcement Learning with Uncertain Episode Lengths

Author: Mandal, Debmalya, primary, Radanovic, Goran, additional, Gan, Jiarui, additional, Singla, Adish, additional, and Majumdar, Rupak, additional
Published: 2023
Full Text: View/download PDF

30. Markov Decision Processes with Time-Varying Geometric Discounting

Author: Gan, Jiarui, primary, Hennes, Annika, additional, Majumdar, Rupak, additional, Mandal, Debmalya, additional, and Radanovic, Goran, additional
Published: 2023
Full Text: View/download PDF

31. Sequential Principal-Agent Problems with Communication: Efficient Computation and Learning

Author: Gan, Jiarui, Majumdar, Rupak, Mandal, Debmalya, Radanovic, Goran, Gan, Jiarui, Majumdar, Rupak, Mandal, Debmalya, and Radanovic, Goran
Abstract: We study a sequential decision making problem between a principal and an agent with incomplete information on both sides. In this model, the principal and the agent interact in a stochastic environment, and each is privy to observations about the state not available to the other. The principal has the power of commitment, both to elicit information from the agent and to provide signals about her own information. The principal and the agent communicate their signals to each other, and select their actions independently based on this communication. Each player receives a payoff based on the state and their joint actions, and the environment moves to a new state. The interaction continues over a finite time horizon, and both players act to optimize their own total payoffs over the horizon. Our model encompasses as special cases stochastic games of incomplete information and POMDPs, as well as sequential Bayesian persuasion and mechanism design problems. We study both computation of optimal policies and learning in our setting. While the general problems are computationally intractable, we study algorithmic solutions under a conditional independence assumption on the underlying state-observation distributions. We present a polynomial-time algorithm to compute the principal's optimal policy up to an additive approximation. Additionally, we show an efficient learning algorithm in the case where the transition probabilities are not known beforehand. The algorithm guarantees sublinear regret for both players.
Published: 2023

32. Bayesian Persuasion in Sequential Decision-Making

Author: Gan, Jiarui, primary, Majumdar, Rupak, additional, Radanovic, Goran, additional, and Singla, Adish, additional
Published: 2022
Full Text: View/download PDF

33. Admissible Policy Teaching through Reward Design

Author: Banihashem, Kiarash, primary, Singla, Adish, additional, Gan, Jiarui, additional, and Radanovic, Goran, additional
Published: 2022
Full Text: View/download PDF

34. Sequential Decision Making With Information Asymmetry (Invited Talk)

Author: Jiarui Gan and Rupak Majumdar and Goran Radanovic and Adish Singla, Gan, Jiarui, Majumdar, Rupak, Radanovic, Goran, Singla, Adish, Jiarui Gan and Rupak Majumdar and Goran Radanovic and Adish Singla, Gan, Jiarui, Majumdar, Rupak, Radanovic, Goran, and Singla, Adish
Abstract: We survey some recent results in sequential decision making under uncertainty, where there is an information asymmetry among the decision-makers. We consider two versions of the problem: persuasion and mechanism design. In persuasion, a more-informed principal influences the actions of a less-informed agent by signaling information. In mechanism design, a less-informed principal incentivizes a more-informed agent to reveal information by committing to a mechanism, so that the principal can make more informed decisions. We define Markov persuasion processes and Markov mechanism processes that model persuasion and mechanism design into dynamic models. Then we survey results on optimal persuasion and optimal mechanism design on myopic and far-sighted agents. These problems are solvable in polynomial time for myopic agents but hard for far-sighted agents.
Published: 2022
Full Text: View/download PDF

35. Sequential Decision Making With Information Asymmetry (Invited Talk)

Author: Gan, Jiarui, Majumdar, Rupak, Radanovic, Goran, and Singla, Adish
Subjects: Theory of computation → Models of computation, Myopic agents, Automated mechanism design, Bayesian persuasion, Markov mechanism processes, Markov persuasion processes
Abstract: We survey some recent results in sequential decision making under uncertainty, where there is an information asymmetry among the decision-makers. We consider two versions of the problem: persuasion and mechanism design. In persuasion, a more-informed principal influences the actions of a less-informed agent by signaling information. In mechanism design, a less-informed principal incentivizes a more-informed agent to reveal information by committing to a mechanism, so that the principal can make more informed decisions. We define Markov persuasion processes and Markov mechanism processes that model persuasion and mechanism design into dynamic models. Then we survey results on optimal persuasion and optimal mechanism design on myopic and far-sighted agents. These problems are solvable in polynomial time for myopic agents but hard for far-sighted agents., LIPIcs, Vol. 243, 33rd International Conference on Concurrency Theory (CONCUR 2022), pages 4:1-4:18
Published: 2022
Full Text: View/download PDF

36. Optimal Coordination in Generalized Principal-Agent Problems: A Revisit and Extensions

Author: Gan, Jiarui, Han, Minbiao, Wu, Jibang, and Xu, Haifeng
Subjects: FOS: Computer and information sciences, FOS: Economics and business, Computer Science - Computer Science and Game Theory, Economics - Theoretical Economics, Theoretical Economics (econ.TH), Computer Science and Game Theory (cs.GT)
Abstract: In the principal-agent problem formulated in [Myerson 1982], agents have private information (type) and make private decisions (action), both of which are unobservable to the principal. Myerson pointed out an elegant solution that relies on the revelation principle, which states that without loss of generality optimal coordination mechanisms of this problem can be assumed to be truthful and direct. Consequently, the problem can be solved by a linear program when the support sets of the action and type spaces are finite. In this paper, we extend Myerson's results to the setting where the principal's action space might be infinite and subject to additional design constraints. This generalized principal-agent model unifies several important design problems -- including contract design, information design, and Bayesian Stackelberg games -- and encompasses them as special cases. We present a revelation principle for this general model, based on which a polynomial-time algorithm is derived for computing the optimal coordination mechanism. This algorithm not only implies new efficient algorithms simultaneously for all the aforementioned special cases but also significantly simplifies previous approaches in the literature.
Published: 2022
Full Text: View/download PDF

37. Optimally Deceiving a Learning Leader in Stackelberg Games

Author: Birmpas, Georgios, primary, Gan, Jiarui, additional, Hollender, Alexandros, additional, Marmolejo-Cossío, Francisco J., additional, Rajgopal, Ninad, additional, and Voudouris, Alexandros A., additional
Published: 2021
Full Text: View/download PDF

38. Budget-feasible Maximum Nash Social Welfare is Almost Envy-free

Author: Wu, Xiaowei, primary, Li, Bo, additional, and Gan, Jiarui, additional
Published: 2021
Full Text: View/download PDF

39. Coalition Formation in Multi-defender Security Games

Author: Mutzari, Dolev, primary, Gan, Jiarui, additional, and Kraus, Sarit, additional
Published: 2021
Full Text: View/download PDF

40. Swap Stability in Schelling Games on Graphs

Author: Agarwal, Aishwarya, primary, Elkind, Edith, additional, Gan, Jiarui, additional, and Voudouris, Alexandros, additional
Published: 2020
Full Text: View/download PDF

41. Envy-freeness in house allocation problems

Author: Gan, Jiarui, primary, Suksompong, Warut, additional, and Voudouris, Alexandros A., additional
Published: 2019
Full Text: View/download PDF

42. Schelling Games on Graphs

Author: Elkind, Edith, primary, Gan, Jiarui, additional, Igarashi, Ayumi, additional, Suksompong, Warut, additional, and Voudouris, Alexandros A., additional
Published: 2019
Full Text: View/download PDF

43. Protecting Elections by Recounting Ballots

Author: Elkind, Edith, primary, Gan, Jiarui, additional, Obraztsova, Svetlana, additional, Rabinovich, Zinovi, additional, and Voudouris, Alexandros A., additional
Published: 2019
Full Text: View/download PDF

44. On the Inducibility of Stackelberg Equilibrium for Security Games

Author: Guo, Qingyu, primary, Gan, Jiarui, additional, Fang, Fei, additional, Tran-Thanh, Long, additional, Tambe, Milind, additional, and An, Bo, additional
Published: 2019
Full Text: View/download PDF

45. Imitative Follower Deception in Stackelberg Games

Author: Gan, Jiarui, primary, Xu, Haifeng, additional, Guo, Qingyu, additional, Tran-Thanh, Long, additional, Rabinovich, Zinovi, additional, and Wooldridge, Michael, additional
Published: 2019
Full Text: View/download PDF

46. Optimal Electric Vehicle Fast Charging Station Placement Based on Game Theoretical Framework

Author: Xiong, Yanhai, primary, Gan, Jiarui, additional, An, Bo, additional, Miao, Chunyan, additional, and Bazzan, Ana L. C., additional
Published: 2018
Full Text: View/download PDF

47. Optimal Escape Interdiction on Transportation Networks

Author: Zhang, Youzhi, primary, An, Bo, additional, Tran-Thanh, Long, additional, Wang, Zhen, additional, Gan, Jiarui, additional, and Jennings, Nicholas R., additional
Published: 2017
Full Text: View/download PDF

48. Game-Theoretic Considerations for Optimizing Taxi System Efficiency

Author: Gan, Jiarui, primary and An, Bo, additional
Published: 2017
Full Text: View/download PDF

49. Security Games on a Plane

Author: Gan, Jiarui, primary, An, Bo, additional, Vorobeychik, Yevgeniy, additional, and Gauch, Brian, additional
Published: 2017
Full Text: View/download PDF

50. Game Theoretic Considerations for Optimizing Efficiency of Taxi Systems

Author: Gan, Jiarui, primary and An, Bo, additional
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

95 results on '"Gan, Jiarui"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources