English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT
  Hierarchical Relative Entropy Policy Search

Daniel, C., Neumann, G., & Peters, J. (2012). Hierarchical Relative Entropy Policy Search. In N. Lawrence, & M. Girolami (Eds.), Artificial Intelligence and Statistics, 21-23 April 2012, La Palma, Canary Islands (pp. 273-281). Madison, WI, USA: International Machine Learning Society.

Item is

Files

show Files

Locators

show
hide
Description:
-

Creators

show
hide
 Creators:
Daniel, C, Author
Neumann, G, Author
Peters, J1, Author              
Affiliations:
1Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society, ou_1497647              

Content

show
hide
Free keywords: -
 Abstract: Many real-world problems are inherently hi- erarchically structured. The use of this struc- ture in an agent's policy may well be the key to improved scalability and higher per- formance. However, such hierarchical struc- tures cannot be exploited by current policy search algorithms. We will concentrate on a basic, but highly relevant hierarchy - the 'mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy deter- mines the action. In this paper, we reformulate learning a hi- erarchical policy as a latent variable estima- tion problem and subsequently extend the Relative Entropy Policy Search (REPS) to the latent variable case. We show that our Hierarchical REPS can learn versatile solu- tions while also showing an increased perfor- mance in terms of learning speed and quality of the found policy in comparison to the non- hierarchical approach.

Details

show
hide
Language(s):
 Dates: 2012-04
 Publication Status: Published in print
 Pages: -
 Publishing info: -
 Table of Contents: -
 Rev. Type: -
 Identifiers: BibTex Citekey: DanielNP2012
 Degree: -

Event

show
hide
Title: Fifteenth International Conference on Artificial Intelligence and Statistics (AI Statistics 2012)
Place of Event: La Palma, Canary Islands, Spain
Start-/End Date: 2012-04-21 - 2012-04-23

Legal Case

show

Project information

show

Source 1

show
hide
Title: Artificial Intelligence and Statistics, 21-23 April 2012, La Palma, Canary Islands
Source Genre: Proceedings
 Creator(s):
Lawrence, N, Editor
Girolami, M, Editor
Affiliations:
-
Publ. Info: Madison, WI, USA : International Machine Learning Society
Pages: - Volume / Issue: - Sequence Number: - Start / End Page: 273 - 281 Identifier: -

Source 2

show
hide
Title: JMLR Workshop and Conference Proceedings
Source Genre: Series
 Creator(s):
Affiliations:
Publ. Info: -
Pages: - Volume / Issue: 22 Sequence Number: - Start / End Page: - Identifier: -