Python reinforcement learning projects : eight hands-on projects exploring reinforcement learning algorithms using TensorFlow /: eight hands-on projects exploring reinforcement learning algorithms using TensorFlow. (2018)

Record Type:: Book
Title:: Python reinforcement learning projects : eight hands-on projects exploring reinforcement learning algorithms using TensorFlow /: eight hands-on projects exploring reinforcement learning algorithms using TensorFlow. (2018)
Main Title:: Python reinforcement learning projects : eight hands-on projects exploring reinforcement learning algorithms using TensorFlow
Further Information:: Note: Sean Saito, Yang Wenzhuo, Rajalingappaa Shanmugamani.
Authors:: Saito, Sean
Yang, Wenzhuo
Shanmugamani, Rajalingappaa
Contents:: Cover; Title Page; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Up and Running with Reinforcement Learning; Introduction to this book; Expectations; Hardware and software requirements; Installing packages; What is reinforcement learning?; The agent; Policy; Value function; Model; Markov decision process (MDP); Deep learning; Neural networks; Backpropagation; Convolutional neural networks; Advantages of neural networks; Implementing a convolutional neural network in TensorFlow; TensorFlow; The Fashion-MNIST dataset; Building the network. Methods for building the networkbuild method; fit method; Summary; References; Chapter 2: Balancing CartPole; OpenAI Gym; Gym; Installation ; Running an environment; Atari; Algorithmic tasks; MuJoCo; Robotics; Markov models; CartPole; Summary; Chapter 3: Playing Atari Games; Introduction to Atari games; Building an Atari emulator; Getting started; Implementation of the Atari emulator; Atari simulator using gym; Data preparation; Deep Q-learning; Basic elements of reinforcement learning; Demonstrating basic Q-learning algorithm; Implementation of DQN; Experiments; Summary. Chapter 4: Simulating Control TasksIntroduction to control tasks; Getting started; The classic control tasks; Deterministic policy gradient; The theory behind policy gradient; DPG algorithm; Implementation of DDPG; Experiments; Trust region policy optimization; Theory behind TRPO; TRPO algorithm; Experiments on MuJoCo tasks; … (more)
Publisher Details:: Birmingham : Packt Publishing Ltd
Publication Date:: 2018
Extent:: 1 online resource (287 pages)
Subjects:: 006.31
Algorithms -- Study and teaching
Machine learning
Artificial intelligence
Python (Computer program language)
Algorithms -- Study and teaching
Electronic books
Languages:: English
ISBNs:: 9781788993227
1788993225
Notes:: Note: Print version record.
Access Rights:: Legal Deposit; Only available on premises controlled by the deposit library and to one user at any one time; The Legal Deposit Libraries (Non-Print Works) Regulations (UK).
Access Usage:: Restricted: Printing from this resource is governed by The Legal Deposit Libraries (Non-Print Works) Regulations (UK) and UK copyright law currently in force.
View Content:: Available online (eLD content is only available in our Reading Rooms) ↗
Physical Locations:: British Library HMNTS - ELD.DS.334856
Ingest File:: 02_334.xml