Author: "Wojtczak, Dominik" / Topic: computer science - logic in computer science - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Wojtczak, Dominik"' showing total 17 results

Start Over Author "Wojtczak, Dominik" Topic computer science - logic in computer science

17 results on '"Wojtczak, Dominik"'

1. Omega-Regular Decision Processes

Author: Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh, and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science, Computer Science - Machine Learning
Abstract: Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback). While RDPs enable intuitive and succinct representation of non-Markovian decision processes, their expressive power coincides with finite-state Markov decision processes (MDPs). We introduce omega-regular decision processes (ODPs) where the non-Markovian aspect of the transition and reward functions are extended to an omega-regular lookahead over the system evolution. Semantically, these lookaheads can be considered as promises made by the decision maker or the learning agent about her future behavior. In particular, we assume that, if the promised lookaheads are not met, then the payoff to the decision maker is $\bot$ (least desirable payoff), overriding any rewards collected by the decision maker. We enable optimization and learning for ODPs under the discounted-reward objective by reducing them to lexicographic optimization and learning over finite MDPs. We present experimental results demonstrating the effectiveness of the proposed reduction.
Published: 2023

2. Alternating Good-for-MDP Automata

Author: Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh, and Wojtczak, Dominik
Subjects: Computer Science - Formal Languages and Automata Theory, Computer Science - Artificial Intelligence, Computer Science - Logic in Computer Science
Abstract: When omega-regular objectives were first proposed in model-free reinforcement learning (RL) for controlling MDPs, deterministic Rabin automata were used in an attempt to provide a direct translation from their transitions to scalar values. While these translations failed, it has turned out that it is possible to repair them by using good-for-MDPs (GFM) B\"uchi automata instead. These are nondeterministic B\"uchi automata with a restricted type of nondeterminism, albeit not as restricted as in good-for-games automata. Indeed, deterministic Rabin automata have a pretty straightforward translation to such GFM automata, which is bi-linear in the number of states and pairs. Interestingly, the same cannot be said for deterministic Streett automata: a translation to nondeterministic Rabin or B\"uchi automata comes at an exponential cost, even without requiring the target automaton to be good-for-MDPs. Do we have to pay more than that to obtain a good-for-MDP automaton? The surprising answer is that we have to pay significantly less when we instead expand the good-for-MDP property to alternating automata: like the nondeterministic GFM automata obtained from deterministic Rabin automata, the alternating good-for-MDP automata we produce from deterministic Streett automata are bi-linear in the the size of the deterministic automaton and its index, and can therefore be exponentially more succinct than minimal nondeterministic B\"uchi automata.
Published: 2022

3. Mungojerrie: Reinforcement Learning of Linear-Time Objectives

Author: Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh, and Wojtczak, Dominik
Subjects: Computer Science - Machine Learning, Computer Science - Logic in Computer Science, Electrical Engineering and Systems Science - Systems and Control
Abstract: Reinforcement learning synthesizes controllers without prior knowledge of the system. At each timestep, a reward is given. The controllers optimize the discounted sum of these rewards. Applying this class of algorithms requires designing a reward scheme, which is typically done manually. The designer must ensure that their intent is accurately captured. This may not be trivial, and is prone to error. An alternative to this manual programming, akin to programming directly in assembly, is to specify the objective in a formal language and have it "compiled" to a reward scheme. Mungojerrie (https://plv.colorado.edu/mungojerrie/) is a tool for testing reward schemes for $\omega$-regular objectives on finite models. The tool contains reinforcement learning algorithms and a probabilistic model checker. Mungojerrie supports models specified in PRISM and $\omega$-automata specified in HOA., Comment: Mungojerrie is available at https://plv.colorado.edu/mungojerrie/
Published: 2021

4. Model-free Reinforcement Learning for Branching Markov Decision Processes

Author: Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh, and Wojtczak, Dominik
Subjects: Computer Science - Machine Learning, Computer Science - Logic in Computer Science, Electrical Engineering and Systems Science - Systems and Control
Abstract: We study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs). The state of a (discrete-time) BMCs is a collection of entities of various types that, while spawning other entities, generate a payoff. In comparison with BMCs, where the evolution of a each entity of the same type follows the same probabilistic pattern, BMDPs allow an external controller to pick from a range of options. This permits us to study the best/worst behaviour of the system. We generalise model-free reinforcement learning techniques to compute an optimal control strategy of an unknown BMDP in the limit. We present results of an implementation that demonstrate the practicality of the approach., Comment: to appear in CAV 2021
Published: 2021

5. Simple Stochastic Games with Almost-Sure Energy-Parity Objectives are in NP and coNP

Author: Mayr, Richard, Schewe, Sven, Totzke, Patrick, and Wojtczak, Dominik
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Logic in Computer Science
Abstract: We study stochastic games with energy-parity objectives, which combine quantitative rewards with a qualitative $\omega$-regular condition: The maximizer aims to avoid running out of energy while simultaneously satisfying a parity condition. We show that the corresponding almost-sure problem, i.e., checking whether there exists a maximizer strategy that achieves the energy-parity objective with probability $1$ when starting at a given energy level $k$, is decidable and in $NP \cap coNP$. The same holds for checking if such a $k$ exists and if a given $k$ is minimal.
Published: 2021

6. Open Problems in a Logic of Gossips

Author: Apt, Krzysztof R. and Wojtczak, Dominik
Subjects: Computer Science - Artificial Intelligence, Computer Science - Logic in Computer Science
Abstract: Gossip protocols are programs used in a setting in which each agent holds a secret and the aim is to reach a situation in which all agents know all secrets. Such protocols rely on a point-to-point or group communication. Distributed epistemic gossip protocols use epistemic formulas in the component programs for the agents. The advantage of the use of epistemic logic is that the resulting protocols are very concise and amenable for a simple verification. Recently, we introduced a natural modal logic that allows one to express distributed epistemic gossip protocols and to reason about their correctness. We proved that the resulting protocols are implementable and that all aspects of their correctness, including termination, are decidable. To establish these results we showed that both the definition of semantics and of truth of the underlying logic are decidable. We also showed that the analogous results hold for an extension of this logic with the 'common knowledge' operator. However, several, often deceptively simple, questions about this logic and the corresponding gossip protocols remain open. The purpose of this paper is to list and elucidate these questions and provide for them an appropriate background information in the form of partial of related results., Comment: In Proceedings TARK 2019, arXiv:1907.08335
Published: 2019
Full Text: View/download PDF

7. Omega-Regular Objectives in Model-Free Reinforcement Learning

Author: Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh, and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science, Computer Science - Machine Learning, Statistics - Machine Learning
Abstract: We provide the first solution for model-free reinforcement learning of {\omega}-regular objectives for Markov decision processes (MDPs). We present a constructive reduction from the almost-sure satisfaction of {\omega}-regular objectives to an almost- sure reachability problem and extend this technique to learning how to control an unknown model so that the chance of satisfying the objective is maximized. A key feature of our technique is the compilation of {\omega}-regular properties into limit- deterministic Buechi automata instead of the traditional Rabin automata; this choice sidesteps difficulties that have marred previous proposals. Our approach allows us to apply model-free, off-the-shelf reinforcement learning algorithms to compute optimal strategies from the observations of the MDP. We present an experimental evaluation of our technique on benchmark learning problems., Comment: 16 pages, 3 figures
Published: 2018

8. Common Knowledge in a Logic of Gossips

Author: Apt, Krzysztof R. and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science, Computer Science - Artificial Intelligence
Abstract: Gossip protocols aim at arriving, by means of point-to-point or group communications, at a situation in which all the agents know each other secrets. Recently a number of authors studied distributed epistemic gossip protocols. These protocols use as guards formulas from a simple epistemic logic, which makes their analysis and verification substantially easier. We study here common knowledge in the context of such a logic. First, we analyze when it can be reduced to iterated knowledge. Then we show that the semantics and truth for formulas without nested common knowledge operator are decidable. This implies that implementability, partial correctness and termination of distributed epistemic gossip protocols that use non-nested common knowledge operator is decidable, as well. Given that common knowledge is equivalent to an infinite conjunction of nested knowledge, these results are non-trivial generalizations of the corresponding decidability results for the original epistemic logic, established in (Apt & Wojtczak, 2016). K. R. Apt & D. Wojtczak (2016): On Decidability of a Logic of Gossips. In Proc. of JELIA 2016, pp. 18-33, doi:10.1007/ 978-3-319-48758-8_2., Comment: In Proceedings TARK 2017, arXiv:1707.08250
Published: 2017
Full Text: View/download PDF

9. Optimal Control for Multi-Mode Systems with Discrete Costs

Author: Mousa, Mahmoud A. A., Schewe, Sven, and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science, Computer Science - Systems and Control
Abstract: This paper studies optimal time-bounded control in multi-mode systems with discrete costs. Multi-mode systems are an important subclass of linear hybrid systems, in which there are no guards on transitions and all invariants are global. Each state has a continuous cost attached to it, which is linear in the sojourn time, while a discrete cost is attached to each transition taken. We show that an optimal control for this model can be computed in NEXPTIME and approximated in PSPACE. We also show that the one-dimensional case is simpler: although the problem is NP-complete (and in LOGSPACE for an infinite time horizon), we develop an FPTAS for finding an approximate solution., Comment: extended version of a FORMATS 2017 paper
Published: 2017

10. On Strong Determinacy of Countable Stochastic Games

Author: Kiefer, Stefan, Mayr, Richard, Shirmohammadi, Mahsa, and Wojtczak, Dominik
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Logic in Computer Science, 91A15, G.3
Abstract: We study 2-player turn-based perfect-information stochastic games with countably infinite state space. The players aim at maximizing/minimizing the probability of a given event (i.e., measurable set of infinite plays), such as reachability, B\"uchi, omega-regular or more general objectives. These games are known to be weakly determined, i.e., they have value. However, strong determinacy of threshold objectives (given by an event and a threshold $c \in [0,1]$) was open in many cases: is it always the case that the maximizer or the minimizer has a winning strategy, i.e., one that enforces, against all strategies of the other player, that the objective is satisfied with probability $\ge c$ (resp. $< c$)? We show that almost-sure objectives (where $c=1$) are strongly determined. This vastly generalizes a previous result on finite games with almost-sure tail objectives. On the other hand we show that $\ge 1/2$ (co-)B\"uchi objectives are not strongly determined, not even if the game is finitely branching. Moreover, for almost-sure reachability and almost-sure B\"uchi objectives in finitely branching games, we strengthen strong determinacy by showing that one of the players must have a memoryless deterministic (MD) winning strategy., Comment: 13 pages
Published: 2017

11. Parity Objectives in Countable MDPs

Author: Kiefer, Stefan, Mayr, Richard, Shirmohammadi, Mahsa, and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science
Abstract: We study countably infinite MDPs with parity objectives, and special cases with a bounded number of colors in the Mostowski hierarchy (including reachability, safety, Buchi and co-Buchi). In finite MDPs there always exist optimal memoryless deterministic (MD) strategies for parity objectives, but this does not generally hold for countably infinite MDPs. In particular, optimal strategies need not exist. For countable infinite MDPs, we provide a complete picture of the memory requirements of optimal (resp., $\epsilon$-optimal) strategies for all objectives in the Mostowski hierarchy. In particular, there is a strong dichotomy between two different types of objectives. For the first type, optimal strategies, if they exist, can be chosen MD, while for the second type optimal strategies require infinite memory. (I.e., for all objectives in the Mostowski hierarchy, if finite-memory randomized strategies suffice then also MD strategies suffice.) Similarly, some objectives admit $\epsilon$-optimal MD-strategies, while for others $\epsilon$-optimal strategies require infinite memory. Such a dichotomy also holds for the subclass of countably infinite MDPs that are finitely branching, though more objectives admit MD-strategies here.
Published: 2017

12. An Ordered Approach to Solving Parity Games in Quasi Polynomial Time and Quasi Linear Space

Author: Fearnley, John, Jain, Sanjay, Schewe, Sven, Stephan, Frank, and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science
Abstract: Parity games play an important role in model checking and synthesis. In their paper, Calude et al. have shown that these games can be solved in quasi-polynomial time. We show that their algorithm can be implemented efficiently: we use their data structure as a progress measure, allowing for a backward implementation instead of a complete unravelling of the game. To achieve this, a number of changes have to be made to their techniques, where the main one is to add power to the antagonistic player that allows for determining her rational move without changing the outcome of the game. We provide a first implementation for a quasi-polynomial algorithm, test it on small examples, and provide a number of side results, including minor algorithmic improvements, a quasi bi-linear complexity in the number of states and edges for a fixed number of colours, and matching lower bounds for the algorithm of Calude et al.
Published: 2017

13. MDPs with Energy-Parity Objectives

Author: Mayr, Richard, Schewe, Sven, Totzke, Patrick, and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science
Abstract: Energy-parity objectives combine $\omega$-regular with quantitative objectives of reward MDPs. The controller needs to avoid to run out of energy while satisfying a parity objective. We refute the common belief that, if an energy-parity objective holds almost-surely, then this can be realised by some finite memory strategy. We provide a surprisingly simple counterexample that only uses coB\"uchi conditions. We introduce the new class of bounded (energy) storage objectives that, when combined with parity objectives, preserve the finite memory property. Based on these, we show that almost-sure and limit-sure energy-parity objectives, as well as almost-sure and limit-sure storage parity objectives, are in $\mathit{NP}\cap \mathit{coNP}$ and can be solved in pseudo-polynomial time for energy-parity MDPs.
Published: 2017

14. On Probabilistic Parallel Programs with Process Creation and Synchronisation

Author: Kiefer, Stefan and Wojtczak, Dominik
Subjects: Computer Science - Logic in Computer Science, Computer Science - Formal Languages and Automata Theory
Abstract: We initiate the study of probabilistic parallel programs with dynamic process creation and synchronisation. To this end, we introduce probabilistic split-join systems (pSJSs), a model for parallel programs, generalising both probabilistic pushdown systems (a model for sequential probabilistic procedural programs which is equivalent to recursive Markov chains) and stochastic branching processes (a classical mathematical model with applications in various areas such as biology, physics, and language processing). Our pSJS model allows for a possibly recursive spawning of parallel processes; the spawned processes can synchronise and return values. We study the basic performance measures of pSJSs, especially the distribution and expectation of space, work and time. Our results extend and improve previously known results on the subsumed models. We also show how to do performance analysis in practice, and present two case studies illustrating the modelling power of pSJSs., Comment: This is a technical report accompanying a TACAS'11 paper
Published: 2010

15. Decision Problems for Nash Equilibria in Stochastic Games

Author: Ummels, Michael and Wojtczak, Dominik
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Logic in Computer Science
Abstract: We analyse the computational complexity of finding Nash equilibria in stochastic multiplayer games with $\omega$-regular objectives. While the existence of an equilibrium whose payoff falls into a certain interval may be undecidable, we single out several decidable restrictions of the problem. First, restricting the search space to stationary, or pure stationary, equilibria results in problems that are typically contained in PSPACE and NP, respectively. Second, we show that the existence of an equilibrium with a binary payoff (i.e. an equilibrium where each player either wins or loses with probability 1) is decidable. We also establish that the existence of a Nash equilibrium with a certain binary payoff entails the existence of an equilibrium with the same payoff in pure, finite-state strategies., Comment: 22 pages, revised version
Published: 2009
Full Text: View/download PDF

16. The Complexity of Nash Equilibria in Simple Stochastic Multiplayer Games

Author: Ummels, Michael and Wojtczak, Dominik
Subjects: Computer Science - Computer Science and Game Theory, Computer Science - Computational Complexity, Computer Science - Logic in Computer Science
Abstract: We analyse the computational complexity of finding Nash equilibria in simple stochastic multiplayer games. We show that restricting the search space to equilibria whose payoffs fall into a certain interval may lead to undecidability. In particular, we prove that the following problem is undecidable: Given a game G, does there exist a pure-strategy Nash equilibrium of G where player 0 wins with probability 1. Moreover, this problem remains undecidable if it is restricted to strategies with (unbounded) finite memory. However, if mixed strategies are allowed, decidability remains an open problem. One way to obtain a provably decidable variant of the problem is restricting the strategies to be positional or stationary. For the complexity of these two problems, we obtain a common lower bound of NP and upper bounds of NP and PSPACE respectively., Comment: 23 pages; revised version
Published: 2009
Full Text: View/download PDF

17. Omega-Regular Objectives in Model-Free Reinforcement Learning

Author: Hahn, Ernst Moritz, Perez, Mateo, Schewe, Sven, Somenzi, Fabio, Trivedi, Ashutosh, Wojtczak, Dominik, Vojnar, Tomáš, Zhang, Lijun, and Formal Methods and Tools
Subjects: FOS: Computer and information sciences, Computer Science::Machine Learning, Computer Science - Logic in Computer Science, Computer Science - Machine Learning, 0209 industrial biotechnology, Theoretical computer science, Reduction (recursion theory), Reachability problem, Computer science, Büchi automaton, Machine Learning (stat.ML), 02 engineering and technology, Constructive, Logic in Computer Science (cs.LO), Machine Learning (cs.LG), Automaton, 020901 industrial engineering & automation, Statistics - Machine Learning, 0202 electrical engineering, electronic engineering, information engineering, Benchmark (computing), Reinforcement learning, 020201 artificial intelligence & image processing, Markov decision process
Abstract: We provide the first solution for model-free reinforcement learning of {\omega}-regular objectives for Markov decision processes (MDPs). We present a constructive reduction from the almost-sure satisfaction of {\omega}-regular objectives to an almost- sure reachability problem and extend this technique to learning how to control an unknown model so that the chance of satisfying the objective is maximized. A key feature of our technique is the compilation of {\omega}-regular properties into limit- deterministic Buechi automata instead of the traditional Rabin automata; this choice sidesteps difficulties that have marred previous proposals. Our approach allows us to apply model-free, off-the-shelf reinforcement learning algorithms to compute optimal strategies from the observations of the MDP. We present an experimental evaluation of our technique on benchmark learning problems., Comment: 16 pages, 3 figures
Published: 2019
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

17 results on '"Wojtczak, Dominik"'

1. Omega-Regular Decision Processes

2. Alternating Good-for-MDP Automata

3. Mungojerrie: Reinforcement Learning of Linear-Time Objectives

4. Model-free Reinforcement Learning for Branching Markov Decision Processes

5. Simple Stochastic Games with Almost-Sure Energy-Parity Objectives are in NP and coNP

6. Open Problems in a Logic of Gossips

7. Omega-Regular Objectives in Model-Free Reinforcement Learning

8. Common Knowledge in a Logic of Gossips

9. Optimal Control for Multi-Mode Systems with Discrete Costs

10. On Strong Determinacy of Countable Stochastic Games

11. Parity Objectives in Countable MDPs

12. An Ordered Approach to Solving Parity Games in Quasi Polynomial Time and Quasi Linear Space

13. MDPs with Energy-Parity Objectives

14. On Probabilistic Parallel Programs with Process Creation and Synchronisation

15. Decision Problems for Nash Equilibria in Stochastic Games

16. The Complexity of Nash Equilibria in Simple Stochastic Multiplayer Games

17. Omega-Regular Objectives in Model-Free Reinforcement Learning

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

Publisher

17 results on '"Wojtczak, Dominik"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources