English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark

Riedmiller, M., Peters, J., & Schaal, S. (2007). Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark. In 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (pp. 254-261). Los Alamitos, CA, USA: IEEE Computer Society.

Item is

Files

show Files

Locators

show
hide
Description:
-

Creators

show
hide
 Creators:
Riedmiller, M, Author
Peters, J1, Author              
Schaal, S, Author              
Affiliations:
1External Organizations, ou_persistent22              

Content

show
hide
Free keywords: -
 Abstract: In this paper, we evaluate different versions from the three main kinds of model-free policy gradient methods, i.e., finite difference gradients, ‘vanilla‘ policy gradients and natural policy gradients. Each of these methods is first presented in its simple form and subsequently refined and optimized. By carrying out numerous experiments on the cart pole regulator benchmark we aim to provide a useful baseline for future research on parameterized policy search algorithms. Portable C++ code is provided for both plant and algorithms; thus, the results in this paper can be reevaluated, reused and new algorithms can be inserted with ease.

Details

show
hide
Language(s):
 Dates: 2007-04
 Publication Status: Published in print
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: DOI: 10.1109/ADPRL.2007.368196
BibTex Citekey: 4727
 Degree: -

Event

show
hide
Title: IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007)
Place of Event: Honolulu, HI, USA
Start-/End Date: 2007-04-01 - 2007-04-05

Legal Case

show

Project information

show

Source 1

show
hide
Title: 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning
Source Genre: Proceedings
 Creator(s):
Affiliations:
Publ. Info: Los Alamitos, CA, USA : IEEE Computer Society
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 254 - 261 Identifier: ISBN: 1-4244-0706-0