Lu, C., Huang, B., Wang, K., Hernández-Lobato, J. M., Zhang, K., & Schölkopf, B. (2020). Sample-Efficient Reinforcement Learning via Counterfactual-Based Data Augmentation. Retrieved from https://offline-rl-neurips.github.io/program/offrl_34.html.