Peters, J Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society; Dept. Empirical Inference, Max Planck Institute for Intelligent Systems, Max Planck Society;
Hoffman, M., Freitas ND, Doucet, A., & Peters, J. (2009). An Expectation Maximization Algorithm for Continuous Markov Decision Processes with Arbitrary Reward. Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics (AIStats 2009), 232-239.