Peters, J Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society; Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society;
Riedmiller, M., Peters, J., & Schaal, S. (2007). Evaluation of Policy Gradient Methods and Variants on the Cart-Pole Benchmark. Proceedings of the 2007 IEEE Internatinal Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2007), 254-261.