Marc Peter Deisenroth , Imperial College London, London and TU Darmstadt, Darmstadt
Dieter Fox , University of Washington
Carl Edward Rasmussen , University of Cambridge, Cambridge
Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required. However, autonomous reinforcement learning (RL) approaches typically require many interactions with the system to learn controllers, which is a practical limitation in real systems, such as robots, where many interactions can be impractical and time consuming. To address this problem, current learning approaches typically require task-specific knowledge in form of expert demonstrations, realistic simulators, pre-shaped policies, or specific knowledge about the underlying dynamics. In this article, we follow a different approach and speed up learning by extracting more information from data. In particular, we learn a probabilistic, non-parametric Gaussian process transition model of the system. By explicitly incorporating model uncertainty into long-term planning and controller learning our approach reduces the effects of model errors, a key problem in model-based learning. Compared to state-of-the art RL our model-based policy search method achieves an unprecedented speed of learning. We demonstrate its applicability to autonomous learning in real robot and control tasks.
Computational modeling, Probabilistic logic, Approximation methods, Robots, Uncertainty, Data models, Predictive models, Robotics, Nonparametric statistics, Machine learning, Robotics
Marc Peter Deisenroth, Dieter Fox, Carl Edward Rasmussen, "Gaussian Processes for Data-Efficient Learning in Robotics and Control", IEEE Transactions on Pattern Analysis & Machine Intelligence, , no. 1, pp. 1, PrePrints PrePrints, doi:10.1109/TPAMI.2013.218