Learning model-based strategies in simple environments with hierarchical q-networks
Necati Alp Muyesser, Kyle Dunovan, Timothy Verstynen
https://arxiv.org/abs/1801.06689
Jan. 20, 2018


Q-Learning

Enter comment here