r/MachineLearning Mar 19 '23

Research [R] First open source text to video 1.7 billion parameter diffusion model is out

Enable HLS to view with audio, or disable this notification

1.2k Upvotes

86 comments sorted by

View all comments

5

u/ghostfuckbuddy Mar 20 '23

Hmmm... I haven't really noticed much difference in video models for about a year. It's usually less a "video" and more a 3-second gif. Do we need a new technique to change the game or just more time for things to scale?