"The noblest pleasure is the joy of understanding"
~ Leonardo da Vinci

High Variance in Policy gradients

In this article, we'll briefly discuss about the problem of high variance in Policy gradients and techniques for variance reduction.

Policy gradients demystified

This article explains about Policy gradient methods and REINFORCE algorithm (Monte Carlo Policy Gradient) which is the simplest policy gradient method.