Cite
MLA Citation
Maxim Lapan. Deep reinforcement learning hands-on : apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more. Birmingham : Packt Publishing, 2018. http://access.bl.uk/ark:/81055/vdc_100062224606.0x000001