Back to Search
Start Over
Visuomotor Mechanical Search: Learning to Retrieve Target Objects in Clutter
- Source :
- IROS
- Publication Year :
- 2020
-
Abstract
- When searching for objects in cluttered environments, it is often necessary to perform complex interactions in order to move occluding objects out of the way and fully reveal the object of interest and make it graspable. Due to the complexity of the physics involved and the lack of accurate models of the clutter, planning and controlling precise predefined interactions with accurate outcome is extremely hard, when not impossible. In problems where accurate (forward) models are lacking, Deep Reinforcement Learning (RL) has shown to be a viable solution to map observations (e.g. images) to good interactions in the form of close-loop visuomotor policies. However, Deep RL is sample inefficient and fails when applied directly to the problem of unoccluding objects based on images. In this work we present a novel Deep RL procedure that combines i) teacher-aided exploration, ii) a critic with privileged information, and iii) mid-level representations, resulting in sample efficient and effective learning for the problem of uncovering a target object occluded by a heap of unknown objects. Our experiments show that our approach trains faster and converges to more efficient uncovering solutions than baselines and ablations, and that our uncovering policies lead to an average improvement in the graspability of the target object, facilitating downstream retrieval applications.
- Subjects :
- FOS: Computer and information sciences
0209 industrial biotechnology
Computer Science - Machine Learning
Extremely hard
business.industry
Computer Science - Artificial Intelligence
Sample (statistics)
02 engineering and technology
010501 environmental sciences
Object (computer science)
01 natural sciences
Outcome (probability)
Machine Learning (cs.LG)
Computer Science - Robotics
020901 industrial engineering & automation
Artificial Intelligence (cs.AI)
Clutter
Reinforcement learning
Computer vision
Artificial intelligence
business
Robotics (cs.RO)
0105 earth and related environmental sciences
Heap (data structure)
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- IROS
- Accession number :
- edsair.doi.dedup.....137d00b160ef15eb4a2946634dc86d2e