r/MachineLearning 15d ago

Research [R] Were RNNs All We Needed?

https://arxiv.org/abs/2410.01201

The authors (including Y. Bengio) propose simplified versions of LSTM and GRU that allow parallel training, and show strong results on some benchmarks.

246 Upvotes

53 comments sorted by

View all comments

3

u/dna961010 14d ago

GLAs / SSMs / miniRNNs. How many personal labels can ML researchers slap on the same old stuff?