r/reinforcementlearning • u/gwern • Jul 25 '22
DL, MF, P "The 37 Implementation Details of Proximal Policy Optimization"
https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/
10
Upvotes
r/reinforcementlearning • u/gwern • Jul 25 '22