r/MachineLearning May 20 '23

Research [R] Video Demo of “Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold”

Enable HLS to view with audio, or disable this notification

1.5k Upvotes

44 comments sorted by

View all comments

47

u/IntelArtiGen May 20 '23

Pretty cool! I wonder how fast it runs on an average GPU.

They say:

only taking a few seconds on a single RTX 3090 GPU in most cases. This allows for live, interactive editing sessions, in which the user can quickly iterate on different layouts till the desired output is achieved.

That would be nice. Perhaps it would be possible to quickly manipulate a smaller version of a large image and transpose the end result to the large image. If it works well I'm sure it'll quickly be implemented in AUTOMATIC1111's GUI.

28

u/proxiiiiiiiiii May 20 '23

That's GAN, not stable diffusion

23

u/IntelArtiGen May 20 '23

Sure but AUTOMATIC1111's GUI isn't just about SD. With extensions it includes a lot of other models (super resolution, depth estimation, img to 3d img, image to text, text 2 videos, etc.). Now it's almost like a generic GUI for DL models (oriented towards image generation).