Peters, J Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society;
Pinsler, R., Akrour, R., Osa, T., Peters, J., & Neumann, G. (2018). Sample and Feedback Efficient Hierarchical Reinforcement Learning from Human Preferences. In 2018 IEEE International Conference on Robotics and Automation (ICRA 2018) (pp. 596-601). Piscataway, NJ: IEEE. doi:10.1109/ICRA.2018.8460907.