Akrour, R., Abdolmaleki, A., Abdulsamad, H., Peters, J., & Neumann, G. (2018). Model-Free Trajectory-based Policy Optimization with Monotonic Improvement. Journal of Machine Learning Research, 19: 14. Retrieved from http://jmlr.org/papers/v19/17-329.html.