Dayan, P Department of Computational Neuroscience, Max Planck Institute for Biological Cybernetics, Max Planck Society; Max Planck Institute for Biological Cybernetics, Max Planck Society;
https://papers.nips.cc/paper/2020/file/9dd16e049becf4d5087c90a83fea403b-Paper.pdf (Any fulltext)
Tano, P., Dayan, P., & Pouget, A. (in press). A local temporal difference code for distributional reinforcement learning. In Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS 2020).