Author: "Ayton, Benjamin" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Ayton, Benjamin"' showing total 15 results

Start Over Author "Ayton, Benjamin"

15 results on '"Ayton, Benjamin"'

1. Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari

Author: Ayton, Benjamin and Asai, Masataro
Subjects: Computer Science - Artificial Intelligence, Computer Science - Machine Learning
Abstract: Width-based planning has shown promising results on Atari 2600 games using pixel input, while using substantially fewer environment interactions than reinforcement learning. Recent width-based approaches have computed feature vectors for each screen using a hand designed feature set or a variational autoencoder trained on game screens (VAE-IW), and prune screens that do not have novel features during the search. We propose Olive (Online-VAE-IW), which updates the VAE features online using active learning to maximize the utility of screens observed during planning. Experimental results in 55 Atari games demonstrate that it outperforms Rollout-IW by 42-to-11 and VAE-IW by 32-to-20. Moreover, Olive outperforms existing work based on policy-learning ($\pi$-IW, DQN) trained with 100x training budget by 30-to-22 and 31-to-17, and a state of the art data-efficient reinforcement learning (EfficientZero) trained with the same training budget and ran with 1.8x planning budget by 18-to-7 in Atari 100k benchmark, with no policy learning at all. The source code is available at github.com/ibm/atari-active-learning ., Comment: Accepted in ICAPS 2022, Planning & Learning track. Edits: Title change. Fixed the variance update formula. Updated the scores of $\pi$-IW as per authors' request. Included the results of DQN and EfficientZero
Published: 2021

2. RADMPC: A Fast Decentralized Approach for Chance-Constrained Multi-Vehicle Path-Planning

Author: Huang, Aaron, Ayton, Benjamin J., and Williams, Brian C.
Subjects: Computer Science - Systems and Control, Computer Science - Artificial Intelligence, Computer Science - Multiagent Systems, Computer Science - Robotics
Abstract: Robust multi-vehicle path-planning is important for ensuring the safety of multi-vehicle systems in applications like transportation, search and rescue, and robotic exploration. Chance-constrained methods like Iterative Risk Allocation (IRA)\cite{IRA} have been developed for situations where environmental disturbances are unbounded. However, chance-constrained methods for the multi-vehicle case generally use centralized strategies where the vehicle set is planned with couplings between all vehicle pairs. This approach is intractable as fleet size increases because computation time is exponential with respect to the number of vehicles being planned over due to a polynomial increase in coupling constraints between vehicle pairs. We present a faster approach for chance-constrained multi-vehicle path-planning that relies upon a decentralized path-planning method called Risk-Aware Decentralized Model Predictive Control (RADMPC) to rapidly approximate a centralized IRA approach. The RADMPC approximation is evaluated for vehicle interactions to determine the vehicle sets that should be planned in a coupled manner. Applying IRA to the smaller vehicle sets determined from the RADMPC approximation rapidly plans safe paths for the entire fleet. A Monte Carlo simulation analysis demonstrates the correctness of our approach and a significant improvement in computation time compared to a centralized IRA approach.
Published: 2018

3. Vulcan: A Monte Carlo Algorithm for Large Chance Constrained MDPs with Risk Bounding Functions

Author: Ayton, Benjamin J and Williams, Brian C
Subjects: Computer Science - Artificial Intelligence
Abstract: Chance Constrained Markov Decision Processes maximize reward subject to a bounded probability of failure, and have been frequently applied for planning with potentially dangerous outcomes or unknown environments. Solution algorithms have required strong heuristics or have been limited to relatively small problems with up to millions of states, because the optimal action to take from a given state depends on the probability of failure in the rest of the policy, leading to a coupled problem that is difficult to solve. In this paper we examine a generalization of a CCMDP that trades off probability of failure against reward through a functional relationship. We derive a constraint that can be applied to each state history in a policy individually, and which guarantees that the chance constraint will be satisfied. The approach decouples states in the CCMDP, so that large problems can be solved efficiently. We then introduce Vulcan, which uses our constraint in order to apply Monte Carlo Tree Search to CCMDPs. Vulcan can be applied to problems where it is unfeasible to generate the entire state space, and policies must be returned in an anytime manner. We show that Vulcan and its variants run tens to hundreds of times faster than linear programming methods, and over ten times faster than heuristic based methods, all without the need for a heuristic, and returning solutions with a mean suboptimality on the order of a few percent. Finally, we use Vulcan to solve for a chance constrained policy in a CCMDP with over $10^{13}$ states in 3 minutes., Comment: 33 pages, 12 figures. In review
Published: 2018

4. Query-driven adaptive sampling

Author: Williams, Brian C., Camilli, Richard, Ayton, Benjamin, Williams, Brian C., Camilli, Richard, and Ayton, Benjamin
Abstract: Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy at the Massachusetts Institute of Technology and the Woods Hole Oceanographic Institution September 2022., Automated information gathering allows exploration of environments where data is limited and gathering observations introduces risk, such as underwater and planetary exploration. Typically, exploration has been performed in service of a query, with a unique algorithm developed for each mission. Yet this approach does not allow scientists to respond to novel questions as they are raised. In this thesis, we develop a single approach for a broad range of adaptive sampling missions with risk and limited prior knowledge. To achieve this, we present contributions in planning adaptive missions in service of queries, and modeling multi-attribute environments. First, we define a query language suitable for specifying diverse goals in adaptive sampling. The language fully encompasses objectives from previous adaptive sampling approaches, and significantly extends the possible range of objectives. We prove that queries expressible in this language are not biased in a way that avoids information. We then describe a Monte Carlo tree search approach to plan for all queries in our language, using sample based objective estimators embedded within tree search. This approach outperforms methods that maximize information about all variables in hydrocarbon seep search and fire escape scenarios. Next, we show how to plan when the policy must bound risk as a function of reward. By solving approximating problems, we guarantee risk bounds on policies with large numbers of actions and continuous observations, ensuring that risks are only taken when justified by reward. Exploration is limited by the quality of the environment model, so we introduce Gaussian process models with directed acyclic structure to improve model accuracy under limited data. The addition of interpretable structure allows qualitative expert knowledge of the environment to be encoded through structure and parameter constraints. Since expert knowledge may be incomplete, we introduce efficient structure learning over stru, The work in this thesis was supported by the Exxon Mobil Corporation as part of the MIT Energy Initiative under the project ‘Autonomous System for Deep Sea Hydrocarbon Detection and Monitoring’, NASA’s PSTAR program under the project ‘Cooperative Exploration with Under-actuated Autonomous Vehicles in Hazardous Environments’, and the Vulcan Machine Learning Center for Impact under the project ‘Machine Learning Based Persistent Autonomous Underwater Scientific Studies’.
Published: 2022

5. Information-Driven and Risk-Bounded Autonomy for Scientist Avatars

Author: Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Department of Aeronautics and Astronautics, Timmons, Eric M, Reeves, Marlyse, Ayton, Benjamin J, Williams, Brian, Ingham, Michel D, Castillo-Rogez, Julie, Seto, William, Havelund, Klaus, Jasour, Ashkan, Donitz, Benjamin, Mages, Declan, Rahmani, Amir, Tavallali, Peyman, Chung, Seung, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology. Department of Aeronautics and Astronautics, Timmons, Eric M, Reeves, Marlyse, Ayton, Benjamin J, Williams, Brian, Ingham, Michel D, Castillo-Rogez, Julie, Seto, William, Havelund, Klaus, Jasour, Ashkan, Donitz, Benjamin, Mages, Declan, Rahmani, Amir, Tavallali, Peyman, and Chung, Seung
Published: 2022

6. Toward Information-Driven and Risk-Bounded Autonomy for Adaptive Science and Exploration

Author: Massachusetts Institute of Technology. Department of Aeronautics and Astronautics, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ayton, Benjamin J, Reeves, Marlyse, Timmons, Eric, Williams, Brian C, Ingham, Michel D, Massachusetts Institute of Technology. Department of Aeronautics and Astronautics, Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science, Ayton, Benjamin J, Reeves, Marlyse, Timmons, Eric, Williams, Brian C, and Ingham, Michel D
Abstract: © 2020 The MITRE Corporation. All Rights Reserved. While the primary purpose of robotic space exploration systems is to gather scientific data, it is equally important that engineering operations are performed and engineering constraints are respected in order to prolong the mission life and ensure the integrity of the observations taken. However, science and engineering operations are often at odds with each other as attempting to obtain the “best” data may violate engineering operations constraints and place the mission at risk. Historically, mission systems engineering has separated the process of planning for science from engineering operations, with the engineering operations constrained to support the science measurement plan with acceptable risk. This task division leads to multiple design iterations between the science and engineering operations which results in compromised, conservative operations that reduce science return and are more brittle than desired. To overcome these limitations, we present an approach for autonomous mission planning that explicitly models and reasons about the coupling between science and engineering operations, resulting in higher science return, while maintaining acceptable levels of risk. Our approach is to develop an information-driven, risk-bounded plan executive that is capable of producing missions satisfying the goals and constraints expressed in these programs. In this paper, we describe in detail the risk-bounded, information-driven execution problem and lay out the architecture used in our information-directed plan executive ‘Enterprise’. We then show the performance of the current version of Enterprise on two space exploration scenarios. Finally, we conclude with thoughts on future work, including on the design of a proposed information-theoretic language that will allow operators and scientists to specify their objectives in terms of questions about scientific phenomena or the configuration of the space system.
Published: 2022

7. Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari

Author: Ayton, Benjamin, primary and Asai, Masataro, additional
Published: 2022
Full Text: View/download PDF

8. Information-Driven and Risk-Bounded Autonomy for Scientist Avatars

Author: Timmons, Eric M., primary, Reeves, Marlyse, additional, Ayton, Benjamin J., additional, Williams, Brian, additional, Ingham, Michel D., additional, Castillo-Rogez, Julie, additional, Seto, William, additional, Havelund, Klaus, additional, Jasour, Ashkan, additional, Donitz, Benjamin, additional, Mages, Declan, additional, Rahmani, Amir, additional, Tavallali, Peyman, additional, and Chung, Seung, additional
Published: 2021
Full Text: View/download PDF

9. Using a Ladder of Seeps with computer decision processes to explore for and evaluate cold seeps on the Costa Rica active margin

Author: Vrolijk, Peter, Summa, Lori, Ayton, Benjamin, Nomikou, Paraskevi, Hüpers, Andre, Kinnaman, Frank, Sylva, Sean, Valentine, David L., Camilli, Richard, Vrolijk, Peter, Summa, Lori, Ayton, Benjamin, Nomikou, Paraskevi, Hüpers, Andre, Kinnaman, Frank, Sylva, Sean, Valentine, David L., and Camilli, Richard
Abstract: © The Author(s), 2021. This article is distributed under the terms of the Creative Commons Attribution License. The definitive version was published in Vrolijk, P., Summa, L., Ayton, B., Nomikou, P., Huepers, A., Kinnaman, F., Sylva, S., Valentine, D., & Camilli, R. Using a Ladder of Seeps with computer decision processes to explore for and evaluate cold seeps on the Costa Rica active margin. Frontiers in Earth Science, 9, (2021): 601019, https://doi.org/10.3389/feart.2021.601019., Natural seeps occur at the seafloor as loci of fluid flow where the flux of chemical compounds into the ocean supports unique biologic communities and provides access to proxy samples of deep subsurface processes. Cold seeps accomplish this with minimal heat flux. While individual expertize is applied to locate seeps, such knowledge is nowhere consolidated in the literature, nor are there explicit approaches for identifying specific seep types to address discrete scientific questions. Moreover, autonomous exploration for seeps lacks any clear framework for efficient seep identification and classification. To address these shortcomings, we developed a Ladder of Seeps applied within new decision-assistance algorithms (Spock) to assist in seep exploration on the Costa Rica margin during the R/V Falkor 181210 cruise in December, 2018. This Ladder of Seeps [derived from analogous astrobiology criteria proposed by Neveu et al. (2018)] was used to help guide human and computer decision processes for ROV mission planning. The Ladder of Seeps provides a methodical query structure to identify what information is required to confirm a seep either: 1) supports seafloor life under extreme conditions, 2) supports that community with active seepage (possible fluid sample), or 3) taps fluids that reflect deep, subsurface geologic processes, but the top rung may be modified to address other scientific questions. Moreover, this framework allows us to identify higher likelihood seep targets based on existing incomplete or easily acquired data, including MBES (Multi-beam echo sounder) water column data. The Ladder of Seeps framework is based on information about the instruments used to collect seep information (e.g., are seeps detectable by the instrument with little chance of false positives?) and contextual criteria about the environment in which the data are collected (e.g., temporal variability of seep flux). Finally, the assembled data are considered in light of a Last-Resort inte, Support for this research was provided through NASA PSTAR Grant #NNX16AL08G and National Science Foundation Navigating the New Arctic grant #1839063. Use of the R/V Falkor and ROV SuBastian were provided through a grant from the Schmidt Ocean Institute. The AUG Nemesis and the Aurora in-situ mass spectrometer was provided through in-kind support from Teledyne Webb Research and Navistry Corp, respectively.
Published: 2021

10. Measurement Maximizing Adaptive Sampling with Risk Bounding Functions

Author: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Woods Hole Oceanographic Institution, Ayton, Benjamin James, Williams, Brian C, Camilli, Richard, Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory, Woods Hole Oceanographic Institution, Ayton, Benjamin James, Williams, Brian C, and Camilli, Richard
Abstract: © 2019, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. In autonomous exploration a mobile agent must adapt to new measurements to seek high reward, but disturbances cause a probability of collision that must be traded off against expected reward. This paper considers an autonomous agent tasked with maximizing measurements from a Gaussian Process while subject to unbounded disturbances. We seek an adaptive policy in which the maximum allowed probability of failure is constrained as a function of the expected reward. The policy is found using an extension to Monte Carlo Tree Search (MCTS) which bounds probability of failure. We apply MCTS to a sequence of approximating problems, which allows constraint satisfying actions to be found in an anytime manner. Our innovation lies in defining the approximating problems and replanning strategy such that the probability of failure constraint is guaranteed to be satisfied over the true policy. The approach does not need to plan for all measurements explicitly or constrain planning based only on the measurements that were observed. To the best of our knowledge, our approach is the first to enforce probability of failure constraints in adaptive sampling. Through experiments on real bathymetric data and simulated measurements, we show our algorithm allows an agent to take dangerous actions only when the reward justifies the risk. We then verify through Monte Carlo simulations that failure bounds are satisfied.
Published: 2021

11. Using a Ladder of Seeps With Computer Decision Processes to Explore for and Evaluate Cold Seeps on the Costa Rica Active Margin

Author: Vrolijk, Peter, primary, Summa, Lori, additional, Ayton, Benjamin, additional, Nomikou, Paraskevi, additional, Hüpers, Andre, additional, Kinnaman, Frank, additional, Sylva, Sean, additional, Valentine, David, additional, and Camilli, Richard, additional
Published: 2021
Full Text: View/download PDF

12. Toward Information-Driven and Risk-Bounded Autonomy for Adaptive Science and Exploration

Author: Ayton, Benjamin J., primary, Reeves, Marlyse, additional, Timmons, Eric, additional, Williams, Brian C., additional, and Ingham, Michel D., additional
Published: 2020
Full Text: View/download PDF

13. Measurement Maximizing Adaptive Sampling with Risk Bounding Functions

Author: Ayton, Benjamin, primary, Williams, Brian, additional, and Camilli, Richard, additional
Published: 2019
Full Text: View/download PDF

14. Risk-bounded autonomous information gathering for localization of phenomena in hazardous environments

Author: Brian C. Williams., Massachusetts Institute of Technology. Department of Aeronautics and Astronautics., Ayton, Benjamin James, Brian C. Williams., Massachusetts Institute of Technology. Department of Aeronautics and Astronautics., and Ayton, Benjamin James
Abstract: Thesis: S.M., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2017., Cataloged from PDF version of thesis., Includes bibliographical references (pages 147-150)., Exploration of new environments is often conducted in search of some phenomenon of interest. Examples include the search for extreme forms of life in the deep ocean or under the ice on Europa, or localizing resource deposits on the ocean floor. Exploration of all these environments is dangerous because of uncertainty in the environment and poorly characterized disturbances that can damage the exploration vehicle. Autonomous vehicles allows exploration in those environments where it is too dangerous or expensive to send a human-operated craft. Autonomous exploration has been well-studied from the perspective of information maximization, but information gathering has not been considered with the intention of localizing specific phenomena, nor has it been considered in environments where exploration can threaten the vehicle. This thesis addresses both challenges by introducing Risk-Bounded Adaptive Search, which maximizes the number of phenomena located while bounding the probability of mission failure by a user-defined threshold. The first innovation of this thesis is the development of a new information measure that focuses on locating instances of a specific phenomenon. Search for phenomena of interest is framed as a discrete space Markov Decision Process that is solved using forward search and receding horizon planning, with a reward function specified as the information gained about unobserved instances of the phenomenon of interest from measurements. Using this reward function, the number of phenomena located is increased compared to maximizing conventional information, as it steers the agent towards locations where phenomena are thought to exist so they are not bypassed when the belief state is high. The second innovation is a method of applying risk bounds as a function of the expected information gain of a policy over a planning horizon, in contrast to a static bound. This 'Performance-Guided Risk Bounding' system allows an MDP policy to be found that is sligh, by Benjamin James Ayton., S.M.
Published: 2018

15. Calculating Tidal Stresses on Satellites using SatStressGUI

Author: Patthoff, D. Alex, Pappalardo, Robert T., Harper, Chad, Ismailyan, Andre, Doan, Nhu, Sinclair, Peter, Ayton, Benjamin, Tang, Lee, Kay, Jonathan, Li, Jessica, Dubois, David, Kattenhorn, Simon, Planetary Science Institute [Tucson] (PSI), Jet Propulsion Laboratory (JPL), NASA-California Institute of Technology (CALTECH), Grinnell College, Columbia University [New York], Massachusetts Institute of Technology (MIT), California Institute of Technology (CALTECH), Department of Earth and Environmental Sciences [Chicago] (EAES), University of Illinois [Chicago] (UIC), University of Illinois System-University of Illinois System, PLANETO - LATMOS, Laboratoire Atmosphères, Milieux, Observations Spatiales (LATMOS), Université de Versailles Saint-Quentin-en-Yvelines (UVSQ)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut national des sciences de l'Univers (INSU - CNRS)-Centre National de la Recherche Scientifique (CNRS)-Université de Versailles Saint-Quentin-en-Yvelines (UVSQ)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut national des sciences de l'Univers (INSU - CNRS)-Centre National de la Recherche Scientifique (CNRS), ConocoPhillips Company [Houston], and Dubois, David
Subjects: [SDU.STU.PL]Sciences of the Universe [physics]/Earth Sciences/Planetology, [SDU.STU.PL] Sciences of the Universe [physics]/Earth Sciences/Planetology, ComputingMilieux_MISCELLANEOUS
Abstract: International audience
Published: 2017

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

15 results on '"Ayton, Benjamin"'

1. Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari

2. RADMPC: A Fast Decentralized Approach for Chance-Constrained Multi-Vehicle Path-Planning

3. Vulcan: A Monte Carlo Algorithm for Large Chance Constrained MDPs with Risk Bounding Functions

4. Query-driven adaptive sampling

5. Information-Driven and Risk-Bounded Autonomy for Scientist Avatars

6. Toward Information-Driven and Risk-Bounded Autonomy for Adaptive Science and Exploration

7. Is Policy Learning Overrated?: Width-Based Planning and Active Learning for Atari

8. Information-Driven and Risk-Bounded Autonomy for Scientist Avatars

9. Using a Ladder of Seeps with computer decision processes to explore for and evaluate cold seeps on the Costa Rica active margin

10. Measurement Maximizing Adaptive Sampling with Risk Bounding Functions

11. Using a Ladder of Seeps With Computer Decision Processes to Explore for and Evaluate Cold Seeps on the Costa Rica Active Margin

12. Toward Information-Driven and Risk-Bounded Autonomy for Adaptive Science and Exploration

13. Measurement Maximizing Adaptive Sampling with Risk Bounding Functions

14. Risk-bounded autonomous information gathering for localization of phenomena in hazardous environments

15. Calculating Tidal Stresses on Satellites using SatStressGUI

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

15 results on '"Ayton, Benjamin"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources