Policy Learning: A Unified Perspective with Applications in Robotics

Peters, J; Kober, J; Nguyen-Tuong, D

doi:10.1007/978-3-540-89722-4_17

Datensatz

DATENSATZ AKTIONENEXPORT

Zur Ablage hinzufügen

Lokale TagsFreigabegeschichteDetailsÜbersicht

Freigegeben

Konferenzbeitrag

Policy Learning: A Unified Perspective with Applications in Robotics

MPG-Autoren

/persons/resource/persons84135

Peters, J
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons84021

Kober, J
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Max Planck Institute for Biological Cybernetics, Max Planck Society;

/persons/resource/persons84108

Nguyen-Tuong, D
Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society;
Max Planck Institute for Biological Cybernetics, Max Planck Society;

Externe Ressourcen

https://link.springer.com/content/pdf/10.1007%2F978-3-540-89722-4_17.pdf
(Verlagsversion)

Volltexte (beschränkter Zugriff)

Für Ihren IP-Bereich sind aktuell keine Volltexte freigegeben.

Volltexte (frei zugänglich)

Es sind keine frei zugänglichen Volltexte in PuRe verfügbar

Ergänzendes Material (frei zugänglich)

Es sind keine frei zugänglichen Ergänzenden Materialien verfügbar

Zitation

Peters, J., Kober, J., & Nguyen-Tuong, D. (2008). Policy Learning: A Unified Perspective with Applications in Robotics. In S. Girgin, M. Loth, R. Munos, P. Preux, & D. Ryabko (Eds.), Recent Advances in Reinforcement Learning: 8th European Workshop, EWRL 2008, Villeneuve d’Ascq, France, June 30-July 3, 2008 (pp. 220-228). Berlin, Germany: Springer.

Zitierlink: https://hdl.handle.net/11858/00-001M-0000-0013-C669-B

Zusammenfassung

Policy Learning approaches are among the best suited methods for high-dimensional, continuous control systems such as anthropomorphic robot arms and humanoid robots. In this paper, we show two contributions: firstly, we show a unified perspective which allows us to derive several policy learning algorithms from a common point of view, i.e, policy gradient algorithms, natural-gradient algorithms and EM-like policy learning. Secondly, we present several applications to both robot motor primitive learning as well as to robot control in task space. Results both from simulation and several different real robots are shown.