Yang, Xintong ![]() ![]() ![]() ![]() |
![]() |
PDF
- Accepted Post-Print Version
Download (1MB) |
![]() |
PDF
- Supplemental Material
Download (3MB) |
Abstract
This work re-implements the OpenAI Gym multi-goal robotic manipulation environment, originally based on the commercial Mujoco engine, onto the open-source Pybullet engine. By comparing the performances of the Hindsight Experience Replay-aided Deep Deterministic Policy Gradient agent on both environments, we demonstrate our successful re-implementation of the original environment. Besides, we provide users with new APIs to access a joint control mode, image observations and goals with customisable camera and a built-in on-hand camera. We further design a set of multi-step, multi-goal, long-horizon and sparse reward robotic manipulation tasks, aiming to inspire new goal-conditioned reinforcement learning algorithms for such challenges. We use a simple, human-prior-based curriculum learning method to benchmark the multi-step manipulation tasks. Discussions about future research opportunities regarding this kind of tasks are also provided.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Date Type: | Published Online |
Status: | In Press |
Schools: | Engineering |
Publisher: | Springer |
ISBN: | 9783030891763 |
Date of First Compliant Deposit: | 20 July 2021 |
Last Modified: | 26 Jan 2023 22:27 |
URI: | https://orca.cardiff.ac.uk/id/eprint/142729 |
Citation Data
Cited 1 time in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
![]() |
Edit Item |