Singla, Adish Group A. Singla, Max Planck Institute for Software Systems, Max Planck Society;
16545 (Publisher version), 554KB
Zhang, X., Bharti, S., Ma, Y., Singla, A., & Zhu, X. (2021). The Sample Complexity of Teaching by Reinforcement on Q-Learning. In AAAI Technical Track on Machine Learning V (pp. 10939-10947). Palo Alto, CA: AAAI. Retrieved from https://ojs.aaai.org/index.php/AAAI/article/view/17306.