Author: "Ryan, Murray" / Publisher: ieee - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ryan, Murray"' showing total 3 results

Start Over Author "Ryan, Murray" Publisher ieee

3 results on '"Ryan, Murray"'

1. Modelling uncertainty in reinforcement learning

Author: Michele Palladino and Ryan Murray
Subjects: Computer Science::Machine Learning, Computer science, business.industry, Control (management), Reinforcement learning, State (computer science), Artificial intelligence, Markov decision process, business, Measure (mathematics), Thompson sampling
Abstract: This paper discusses the model problem presented in "A model for system uncertainty in reinforcement learning", Systems and Control Letters, 2018, for certain tasks in reinforcement learning. The model provides a framework to deal with situations in which the system dynamics is not known and encodes the available information about the state dynamics as a measure on the space of functions. Such a measure is updated in time, taking into account all the previous measurements of the state variable and extracting new information from them. Here we will mainly focus on the differences between the present model and central algorithms used in reinforcement learning (i.e. value iteration and Thompson sampling).
Published: 2019
Full Text: View/download PDF

2. Distributed Gradient Descent: Nonconvergence to Saddle Points and the Stable-Manifold Theorem

Author: Ryan Murray, H. Vincent Poor, Brian Swenson, and Soummya Kar
Subjects: FOS: Computer and information sciences, 0209 industrial biotechnology, Dynamical systems theory, 020206 networking & telecommunications, Stable manifold theorem, 02 engineering and technology, Stable manifold, Maxima and minima, 020901 industrial engineering & automation, Optimization and Control (math.OC), Saddle point, Convergence (routing), FOS: Mathematics, 0202 electrical engineering, electronic engineering, information engineering, Applied mathematics, Computer Science - Multiagent Systems, Almost surely, Gradient descent, Mathematics - Optimization and Control, Multiagent Systems (cs.MA), Mathematics
Abstract: The paper studies a distributed gradient descent (DGD) process and considers the problem of showing that in nonconvex optimization problems, DGD typically converges to local minima rather than saddle points. The paper considers unconstrained minimization of a smooth objective function. In centralized settings, the problem of demonstrating nonconvergence to saddle points of gradient descent (and variants) is typically handled by way of the stable-manifold theorem from classical dynamical systems theory. However, the classical stable-manifold theorem is not applicable in distributed settings. The paper develops an appropriate stable-manifold theorem for DGD showing that convergence to saddle points may only occur from a low-dimensional stable manifold. Under appropriate assumptions (e.g., coercivity), this result implies that DGD typically converges to local minima and not to saddle points.
Published: 2019
Full Text: View/download PDF

3. Best-Response Dynamics in Continuous Potential Games: Non-Convergence to Saddle Points

Author: Ryan Murray, Soummya Kar, H. Vincent Poor, and Brian Swenson
Subjects: Computer Science::Computer Science and Game Theory, 0209 industrial biotechnology, 020208 electrical & electronic engineering, Stable manifold theorem, Context (language use), 02 engineering and technology, Function (mathematics), symbols.namesake, 020901 industrial engineering & automation, Nash equilibrium, Best response, Saddle point, Convergence (routing), 0202 electrical engineering, electronic engineering, information engineering, symbols, Applied mathematics, Saddle, Mathematics
Abstract: The paper studies properties of best-response (BR) dynamics in potential games with continuous action sets. It is known that BR dynamics converge to the set of Nash equilibria (NE) in potential games. The set of NE in potential games is composed of local maximizers and saddle points of the potential function. The paper studies non-convergence of BR dynamics to saddle points of the potential function. Under relatively mild assumptions it is shown that BR dynamics may only converge to an interior saddle-point from a measure-zero set of initial conditions. This provides a weak stable manifold theorem in this context.
Published: 2018
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Ryan, Murray"'

1. Modelling uncertainty in reinforcement learning

2. Distributed Gradient Descent: Nonconvergence to Saddle Points and the Stable-Manifold Theorem

3. Best-Response Dynamics in Continuous Potential Games: Non-Convergence to Saddle Points

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

3 results on '"Ryan, Murray"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources