Dann, C., Neumann, G., & Peters, J. (2014). Policy Evaluation with Temporal Differences: A Survey and Comparison. Journal of Machine Learning Research, 15, 809-883. Retrieved from http://www.jmlr.org/papers/volume15/dann14a/dann14a.pdf.