Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations
Xiaoqin Zhang, Huimin Ma
https://arxiv.org/abs/1801.10459
Jan. 31, 2018


Q-Learning

Enter comment here