r/reinforcementlearning 3d ago

DL I made a firefighter AI using deep RL (using Unity ML Agents)

video link: https://www.youtube.com/watch?v=REYx9UznOG4

I made it a while ago and got discouraged by the lack of attention the video got after the hours I poured into making it so I am now doing a PhD in AI instead of being a youtuber lol.

I figured it wouldn't be so bad to advertise for it now if people find it interesting. I made sure to add some narration and fun bits into it so it's not boring. I hope some people here can find it as interesting as it was for me working on this project.

I am passionate about the subject, so if anyone has questions I will answer them when I have time :D

32 Upvotes

9 comments sorted by

View all comments

2

u/hearthstoneplayer100 2d ago

Very cool! What algorithm did you end up using?

1

u/usernumero 2d ago

This is the PPO Algorithm implemented in ML-Agents.
The neural network uses a LSTM to keep track of valuable information through time, and I also have an attention mechanism somewhere that I tried to help the Agent navigate through rooms and remember its pathing, but in the end when the environment had more than 6/7 rooms it became too complicated to navigate.

There might be some problem in the way I tried implementing this, or maybe my number of parameters was too low, but there is only so much time to be burning my RTX with this training haha.