日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

登録内容を編集ファイル形式で保存
 
 
ダウンロード電子メール
  Learning Motor Primitives with Reinforcement Learning

Peters, J., & Schaal, S. (2004). Learning Motor Primitives with Reinforcement Learning. Poster presented at 11th Joint Symposium on Neural Computation (JSNC 2004), Los Angeles, CA, USA.

Item is

基本情報

表示: 非表示:
資料種別: ポスター

ファイル

表示: ファイル

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Peters, J1, 2, 著者           
Schaal, S, 著者           
所属:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795              
2Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497794              

内容説明

表示:
非表示:
キーワード: -
 要旨: One of the major challenges in action generation for robotics and in the understanding of human motor control is to learn the "building blocks of move- ment generation," or more precisely, motor primitives. Recently, Ijspeert et al. [1, 2] suggested a novel framework how to use nonlinear dynamical systems as motor primitives. While a lot of progress has been made in teaching these mo- tor primitives using supervised or imitation learning, the self-improvement by interaction of the system with the environment remains a challenging problem. In this poster, we evaluate different reinforcement learning approaches can be used in order to improve the performance of motor primitives. For pursuing this goal, we highlight the difficulties with current reinforcement learning methods, and line out how these lead to a novel algorithm which is based on natural policy gradients [3]. We compare this algorithm to previous reinforcement learning algorithms in the context of dynamic motor primitive learning, and show that it outperforms these by at least an order of magnitude. We demonstrate the efficiency of the resulting reinforcement learning method for creating complex behaviors for automous robotics. The studied behaviors will include both discrete, finite tasks such as baseball swings, as well as complex rhythmic patterns as they occur in biped locomotion.

資料詳細

表示:
非表示:
言語:
 日付: 2004-05
 出版の状態: オンラインで出版済み
 ページ: -
 出版情報: -
 目次: -
 査読: -
 識別子(DOI, ISBNなど): BibTex参照ID: 5066
 学位: -

関連イベント

表示:
非表示:
イベント名: 11th Joint Symposium on Neural Computation (JSNC 2004)
開催地: Los Angeles, CA, USA
開始日・終了日: 2004-05-15

訴訟

表示:

Project information

表示:

出版物

表示: