this post was submitted on 26 Sep 2023
1221 points (95.5% liked)

Memes

45550 readers
1232 users here now

Rules:

  1. Be civil and nice.
  2. Try not to excessively repost, as a rule of thumb, wait at least 2 months to do it if you have to.

founded 5 years ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 4 points 1 year ago* (last edited 1 year ago)

I think the prompt is not much other than "puppies" and "kittens". Major, middle and minor features of the image can be controlled individually in some AIs (they can be differentiated using a Fourier transform or Gauss convolutions and fed into different discriminators) so I think:

  • major features (scenery) are controlled by the prompt (grass or couch)
  • middle features (text) are a source image that the AI is punished for straying from
  • minor features (details) are controlled by the prompt (faces and fur)

Or it's just Stable Diffusion that starts with a text rather than random noise.