Technology

34904 readers

1244 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.

Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.

Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago

MODERATORS

[email protected]

Why is txt2img AI so bad with hands? (lemmygrad.ml)

submitted 1 year ago by [email protected] to c/[email protected]

10 comments fedilink hide all child comments

Pls explain

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 10 points 1 year ago (3 children)

Hands are really complicated, even to draw. Everything else is relatively easy to guess for an AI, usually faces are looking at the camera or looking sideways, but hands have like a thousand different positions and poses. It's hard for the AI to guess what the hands should look like and where the fingers should be. It doesn't help that people are historically bad at drawing hands so there's a lot of garbage in the data.

[–] [email protected] 1 points 1 year ago (2 children)

That's true but I would have thought that the models would be able to "understand" hands because I'm assuming they have seen millions of photographs with hands in them by now.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago)

I think it's helpful to remember that the model doesn't have a skeleton, its literally skin deep. It doesn't understand hands, it understands pixels. Without an understanding of the actual structure all the AI can do is guess where the pixels go based on other neighboring pixels.

load more comments (1 replies)