Balaji Varatharajan

"The noblest pleasure is the joy of understanding"
~ Leonardo da Vinci

17 February 2024

In this article, we'll briefly discuss about the problem of high variance in Policy gradients and techniques for variance reduction.

Policy gradients demystified

23 April 2023

This article explains about Policy gradient methods and REINFORCE algorithm (Monte Carlo Policy Gradient) which is the simplest policy gradient method.