Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark

Riedmiller, M; Peters, J; Schaal, S

doi:10.1109/ADPRL.2007.368196

Item

ITEM ACTIONSEXPORT

Add to Basket

Please note that a newer version of this item is available:
https://pure.mpg.de/pubman/item/item_1790533_2

DetailsSummary

Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark

Riedmiller, M., Peters, J., & Schaal, S. (2007). Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark. Proceedings of the 2007 IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007), 254-261.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-CE1B-8 Version Permalink: https://hdl.handle.net/11858/00-001M-0000-0013-CE1C-6

Genre: Conference Paper

Files

show Files

Locators

show

Creators

show

hide

Creators:
Riedmiller, M, Author
Peters, J^{1, 2}, Author
Schaal, S, Author

Affiliations:
1Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society, ou_1497795
2Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society, ou_1497647

Content

show

hide

Free keywords: -

Abstract: In this paper, we evaluate different versions from the three main kinds of model-free policy gradient methods, i.e., finite difference gradients, ‘vanilla‘ policy gradients and natural policy gradients. Each of these methods is first presented in its simple form and subsequently refined and optimized. By carrying out numerous experiments on the cart pole regulator benchmark we aim to provide a useful baseline for future research on parameterized policy search algorithms. Portable C++ code is provided for both plant and algorithms; thus, the results in this paper can be reevaluated, reused and new algorithms can be inserted with ease.

Details

show

hide

Language(s):

Dates: Date issued: 2007-04

Publication Status: Issued

Pages: -

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: URI: http://liu.ece.uic.edu/ADPRL07/
DOI: 10.1109/ADPRL.2007.368196
BibTex Citekey: 4727

Degree: -

Event

show

hide

Title: 2007 IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning

Place of Event: Honolulu, Hawaii

Start-/End Date: -

Legal Case

show

Project information

show

Source 1

show

hide

Title: Proceedings of the 2007 IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007)

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: Los Alamitos, CA, USA : IEEE Computer Society

Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 254 - 261 Identifier: -