Stop Thinking, Just Do!

Sungsoo Kim's Blog

Natural Policy Gradients, TRPO, PPO

Tags

machine learning ¹⁵⁰⁹

21 March 2019

Article Source

Title: Deep RL Bootcamp Lecture 5: Natural Policy Gradients, TRPO, PPO

Natural Policy Gradients, TRPO, PPO

Instructor: John Schulman (OpenAI)
Program: Lecture 5 Deep RL Bootcamp Berkeley August 2017
Title: Natural Policy Gradients, TRPO, PPO

TPRO Paper Review