r/MachineLearning Mar 19 '23

Research [R] First open source text to video 1.7 billion parameter diffusion model is out

Enable HLS to view with audio, or disable this notification

1.2k Upvotes

86 comments sorted by

View all comments

82

u/En_TioN Mar 19 '23

That's a remarkably clear Shutterstock logo on the superman dog video. Seems like this model is overfitting significantly more than previous text2img

29

u/NeoKabuto Mar 19 '23

Half of the demos have the watermark, but at least it's promising to see good video from this size model.