Sutton and Barto Part 3

less than 1 minute read

Published: February 10, 2019

I am currently reading Sutton and Barto (reference below). Along the way I decided to recreate certain experiments cited in each book chapter. This particular example includes the Blackjack problem definition.

The variation in this experiment is to compare importance sampling for MC with off and on policy - as well as with exploring starts

I have added a notebook of the experiements:

Monte Carlo with Exploring Starts (ES) & Monte Carlo Sample with Off-Policy

Plots 5.1 - 5.3 are here.

Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction. A Bradford Book, Cambridge, MA, USA.

Share on

Twitter Facebook LinkedIn

Saran Ahluwalia

Sutton and Barto Part 3

Share on

You May Also Enjoy

COVID-19 Reproduction Rates

COVID-19 - Keeping an Eye on Reopening of Countries and States

Derivation of Support Vector Machines

Nuclear norm minimization

Natural Language Processing with Deep Learning