Dayan, P Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society; Max Planck Institute for Biological Cybernetics, Max Planck Society;
https://papers.nips.cc/paper/2020/file/9dd16e049becf4d5087c90a83fea403b-Paper.pdf (Any fulltext)
Tano, P., Dayan, P., & Pouget, A. (2021). A local temporal difference code for distributional reinforcement learning. In H. Larochelle, M. Ranzato, R. Hadsell, M.-F. Balcan, & H.-T. Lin (Eds.), Advances in Neural Information Processing Systems 33 (pp. 13662-13673). Red Hook, NY, USA: Curran.