Sutton and Barto Part 2

less than 1 minute read

Published: January 29, 2019

I am currently reading Sutton and Barto (reference below). Along the way I decided to recreate certain experiments cited in each book chapter. This particular example includes the example from Figure 6.2.

I have added a notebook of the experiments:

Temporal Difference Learning with Batch Updates

Plots are here.

Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement Learning: An Introduction. A Bradford Book, Cambridge, MA, USA.

Share on

Twitter Facebook LinkedIn

COVID-19 Reproduction Rates

6 minute read

Published: April 22, 2020

COVID-19 - Keeping an Eye on Reopening of Countries and States

Derivation of Support Vector Machines

less than 1 minute read

Published: March 27, 2020

Recently, I am experimenting with different mechanisms for deriving decision functions for multiclass classification problems. I have spent some time reading and experiementing with examples from the following references:

Nuclear norm minimization

less than 1 minute read

Published: May 01, 2019

Nuclear norm minimization

Natural Language Processing with Deep Learning

less than 1 minute read

Published: April 15, 2019

I recently completed Stanford’s course on natural language processing. This course opened my eyes to different representaions of speech and linguistics.

Saran Ahluwalia

Sutton and Barto Part 2

Share on

You May Also Enjoy

COVID-19 Reproduction Rates

COVID-19 - Keeping an Eye on Reopening of Countries and States

Derivation of Support Vector Machines

Nuclear norm minimization

Natural Language Processing with Deep Learning