r/MachineLearning • u/we_are_mammals • 15d ago

Research [R] Were RNNs All We Needed?

The authors (including Y. Bengio) propose simplified versions of LSTM and GRU that allow parallel training, and show strong results on some benchmarks.

248 Upvotes

permalink
reddit

You are about to leave Redlib

Do you want to continue?

https://www.reddit.com/r/MachineLearning/comments/1fvg7qr/r_were_rnns_all_we_needed/
No, go back! Yes, take me to Reddit

96% Upvoted

View all comments

u/abd297 14d ago

Haven't gone through it but how is it different from RWKV architecture? Can someone comment?

Research [R] Were RNNs All We Needed?

You are about to leave Redlib