Stop Thinking, Just Do!
Sungsoo Kim's Blog
Home
Tags
Categories
Archive
Natural Policy Gradients, TRPO, PPO
Tags
machine learning
1380
21 March 2019
Article Source
Title:
Deep RL Bootcamp Lecture 5: Natural Policy Gradients, TRPO, PPO
Natural Policy Gradients, TRPO, PPO
Instructor: John Schulman (OpenAI)
Program: Lecture 5 Deep RL Bootcamp Berkeley August 2017
Title: Natural Policy Gradients, TRPO, PPO
TPRO Paper Review
Please enable JavaScript to view the
comments powered by Disqus.
comments powered by
Disqus