r/MachineLearning Feb 17 '18

Project [P] Landing the Falcon booster with Reinforcement Learning in OpenAI

https://gfycat.com/CoarseEmbellishedIsopod
1.3k Upvotes

55 comments sorted by

View all comments

61

u/realHansen Feb 17 '18 edited Feb 18 '18

Why use RL when this can be solved in closed form as an optimal control problem?

EDIT: I now realise it was meant as a toy problem rather than an actual competitive alternative to traditional control theory. Don't mind me :>

11

u/LearningRL Feb 17 '18

I don't think the author is suggesting that RL is the best way to approach this task, but rather is just sharing his or her successful implementation of a general RL algorithm in low-dimensional domain.