Bogdanovic, M., Khadiv, M., & Righetti, L. (2022). Model-free reinforcement learning for robust locomotion using demonstrations from trajectory optimization. Frontiers in Robotics and AI, 9: 854212. doi:10.3389/frobt.2022.854212.