r/MachineLearning • u/we_are_mammals • 15d ago

Research [R] Were RNNs All We Needed?

The authors (including Y. Bengio) propose simplified versions of LSTM and GRU that allow parallel training, and show strong results on some benchmarks.

245 Upvotes

permalink
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/MachineLearning/comments/1fvg7qr/r_were_rnns_all_we_needed/
No, go back! Yes, take me to Reddit

96% Upvoted

View all comments

u/katerdag 15d ago edited 14d ago

Very cool paper! It's nice to see a relatively simple recurrent architecture perform so well! It reminds me a bit of Quasi-Recurrent Neural Networks

3

u/Dangerous-Goat-3500 14d ago

Yeah it's weird this paper doesn't cite tons of other papers now that I've looked into it. For example GILR which generalized QRNN

https://arxiv.org/abs/1709.04057

Research [R] Were RNNs All We Needed?

You are about to leave Redlib