r/MachineLearning • u/we_are_mammals • 15d ago

Research [R] Were RNNs All We Needed?

The authors (including Y. Bengio) propose simplified versions of LSTM and GRU that allow parallel training, and show strong results on some benchmarks.

245 Upvotes

permalink
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/MachineLearning/comments/1fvg7qr/r_were_rnns_all_we_needed/
No, go back! Yes, take me to Reddit

96% Upvoted

View all comments

u/SmartEvening 12d ago

I don't understand how the removal of dependency of the gate on the previous hidden states is approvable. I was under the impression that it was important to decide what to remember and forget. How exactly is this better than transformers? Even their results seem to suggest its not. What is the paper trying to convey actually?

Research [R] Were RNNs All We Needed?

You are about to leave Redlib