Learning Motor Primitives with Reinforcement Learning

Peters, J; Schaal, S

Learning Motor Primitives with Reinforcement Learning

Peters, J., & Schaal, S. (2004). Learning Motor Primitives with Reinforcement Learning. Poster presented at 11th Joint Symposium on Neural Computation (JSNC 2004), Los Angeles, CA, USA.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/11858/00-001M-0000-0013-D95B-1 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0005-6478-4

資料種別: ポスター

ファイル

表示: ファイル

作成者

表示:

非表示:

作成者:
Peters, J^{1, 2}, 著者
Schaal, S, 著者

所属:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795
2Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497794

内容説明

表示:

非表示:

キーワード: -

要旨: One of the major challenges in action generation for robotics and in the understanding of human motor control is to learn the "building blocks of move- ment generation," or more precisely, motor primitives. Recently, Ijspeert et al. [1, 2] suggested a novel framework how to use nonlinear dynamical systems as motor primitives. While a lot of progress has been made in teaching these mo- tor primitives using supervised or imitation learning, the self-improvement by interaction of the system with the environment remains a challenging problem. In this poster, we evaluate different reinforcement learning approaches can be used in order to improve the performance of motor primitives. For pursuing this goal, we highlight the difficulties with current reinforcement learning methods, and line out how these lead to a novel algorithm which is based on natural policy gradients [3]. We compare this algorithm to previous reinforcement learning algorithms in the context of dynamic motor primitive learning, and show that it outperforms these by at least an order of magnitude. We demonstrate the efficiency of the resulting reinforcement learning method for creating complex behaviors for automous robotics. The studied behaviors will include both discrete, finite tasks such as baseball swings, as well as complex rhythmic patterns as they occur in biped locomotion.

資料詳細

表示:

非表示:

言語:

日付: オンライン出版: 2004-05

出版の状態: オンラインで出版済み

ページ: -

出版情報: -

目次: -

査読: -

識別子（DOI, ISBNなど）: BibTex参照ID: 5066

学位: -

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物