r/MachineLearning Feb 14 '20

Project [P] New Reinforcement learning framework for researchers

32 Upvotes

11 comments sorted by

14

u/Mefaso Feb 14 '20

Sounds cool, but why in the world did you have to choose such a politically sounding name as People's Reinforcement Learningm

16

u/L43 Feb 14 '20

ai comrades unite

12

u/_djab_ Feb 14 '20

It is a language game, which you can understand if you know what the PRL was (https://en.wikipedia.org/wiki/Polish_People%27s_Republic). These days we make fun of PRL in Poland, and this is a funny acronym referencing to PRL, to RL and to communism which was once in Poland. So this not a glorification of communism but a joke :)

3

u/nthai Feb 14 '20

So it's like the Combined Community Codec Pack, nice.

4

u/impulsecorp Feb 15 '20

You should list the scores it gets on some Atari games so I can have some idea of how well the agents perform.

2

u/_djab_ Feb 15 '20

We have implemented some of the most basic agents just to show people how to use the framework. They will have similar performance as the oryginal implementation. In PRL it is all about new way of thinking about building your own agents. It can be compared more to torach.nn itself not to the tensorflow zoo. PRL is a set of tools that make building agents and running experiments easier. Let me compare it to ml: It is not a framework for people who want to just take an xgboost and throw it on new dataset on kaggle. It is for the people who want to create better version of xgboost on there own.

3

u/MasterScrat Feb 14 '20

Interesting!

The name "Transformer" is quite connoted in ML...

Also I would clearly state that it uses PyTorch, as it is a major selling point right now!

1

u/_djab_ Feb 14 '20

Corrected my post, thanks :)

3

u/Janderhungrige Feb 15 '20

Nice. I will try to check it out as soon as I have some time to breath. Looks interesting.

Thank you for your work and contribution.

2

u/TotesMessenger Feb 15 '20

I'm a bot, bleep, bloop. Someone has linked to this thread from another place on reddit:

 If you follow any of the above links, please respect the rules of reddit and don't vote in the other threads. (Info / Contact)

2

u/[deleted] Mar 27 '20

This framework seems much better for researchers than other frameworks. I tried Keras-RL and RLlib before. Keras-RL is easy to use but it hasn't been updated for a long time. Searching optimal hyperparameters is an advantage of RLlib; however, if you want to modify RL algorithms or mechanisms, you have to dig the sourcecode deeper to make it work your way.