Peters, J Department Empirical Inference, Max Planck Institute for Biological Cybernetics, Max Planck Society; Max Planck Institute for Biological Cybernetics, Max Planck Society;
https://link.springer.com/content/pdf/10.1007%2F978-3-540-87536-9_42.pdf (Publisher version)
Wierstra, D., Schaul, T., Peters, J., & Schmidhuber, J. (2008). Episodic Reinforcement Learning by Logistic Reward-Weighted Regression. In V. Kurkova-Pohlova, R. Neruda, & J. Koutnik (Eds.), Artificial Neural Networks - ICANN 2008: 18th International Conference, Prague, Czech Republic, September 3-6, 2008 (pp. 407-416). Berlin, Germany: Springer.