Author: "Akrour, Riad" / Publisher: ieee - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Akrour, Riad"' showing total 6 results

Start Over Author "Akrour, Riad" Publisher ieee

6 results on '"Akrour, Riad"'

1. Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts.

Author: Akrour, Riad, Tateo, Davide, and Peters, Jan
Subjects: *ACTIVE learning, *NONLINEAR functions, *MACHINE learning, *REINFORCEMENT learning, *APPROXIMATION algorithms
Abstract: Reinforcement learning (RL) has demonstrated its ability to solve high dimensional tasks by leveraging non-linear function approximators. However, these successes are mostly achieved by ’black-box’ policies in simulated domains. When deploying RL to the real world, several concerns regarding the use of a ’black-box’ policy might be raised. In order to make the learned policies more transparent, we propose in this paper a policy iteration scheme that retains a complex function approximator for its internal value predictions but constrains the policy to have a concise, hierarchical, and human-readable structure, based on a mixture of interpretable experts. Each expert selects a primitive action according to a distance to a prototypical state. A key design decision to keep such experts interpretable is to select the prototypical states from trajectory data. The main technical contribution of the paper is to address the challenges introduced by this non-differentiable prototypical state selection procedure. Experimentally, we show that our proposed algorithm can learn compelling policies on continuous action deep RL benchmarks, matching the performance of neural network based policies, but returning policies that are more amenable to human inspection than neural network or linear-in-feature policies. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

2. Reinforcement Learning Based Underwater Wireless Optical Communication Alignment for Autonomous Underwater Vehicles.

Author: Weng, Yang, Pajarinen, Joni, Akrour, Riad, Matsuda, Takumi, Peters, Jan, and Maki, Toshihiro
Subjects: AUTONOMOUS underwater vehicles, PARTIALLY observable Markov decision processes, REINFORCEMENT learning, SUBMERSIBLES, OPTICAL communications, WIRELESS communications, UNDERWATER exploration
Abstract: With the developments in underwater wireless optical communication (UWOC) technology, UWOC can be used in conjunction with autonomous underwater vehicles (AUVs) for high-speed data sharing among the vehicle formation during underwater exploration. A beam alignment problem arises during communication due to the transmission range, external disturbances and noise, and uncertainties in the AUV dynamic model. In this article, we propose an acoustic navigation method to guide the alignment process without requiring beam directors, light intensity sensors, and/or scanning algorithms as used in previous research. The AUVs need stably maintain a specific relative position and orientation for establishing an optical link. We model the alignment problem as a partially observable Markov decision process (POMDP) that takes manipulation, navigation, and energy consumption of underwater vehicles into account. However, finding an efficient policy for the POMDP under high partial observability and environmental variability is challenging. Therefore, for successful policy optimization, we utilize the soft actor–critic reinforcement learning algorithm together with AUV-specific belief updates and reward shaping based curriculum learning. Our approach outperformed baseline approaches in a simulation environment and successfully performed the beam alignment process from one AUV to another on the real AUV Tri-TON 2. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

3. Regularizing Reinforcement Learning with State Abstraction

Author: Akrour, Riad, primary, Veiga, Filipe, additional, Peters, Jan, additional, and Neumann, Gerhard, additional
Published: 2018
Full Text: View/download PDF

4. Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences

Author: Pinsler, Robert, primary, Akrour, Riad, additional, Osa, Takayuki, additional, Peters, Jan, additional, and Neumann, Gerhard, additional
Published: 2018
Full Text: View/download PDF

5. Layered direct policy search for learning hierarchical skills

Author: End, Felix, primary, Akrour, Riad, additional, Peters, Jan, additional, and Neumann, Gerhard, additional
Published: 2017
Full Text: View/download PDF

6. Empowered skills

Author: Gabriel, Alexander, primary, Akrour, Riad, additional, Peters, Jan, additional, and Neumann, Gerhard, additional
Published: 2017
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"Akrour, Riad"'

1. Continuous Action Reinforcement Learning From a Mixture of Interpretable Experts.

2. Reinforcement Learning Based Underwater Wireless Optical Communication Alignment for Autonomous Underwater Vehicles.

3. Regularizing Reinforcement Learning with State Abstraction

4. Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences

5. Layered direct policy search for learning hierarchical skills

6. Empowered skills

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

6 results on '"Akrour, Riad"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources