Technology

59312 readers

4528 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

[email protected]

618

ChatGPT, how do I use OCR in Word? (lemdro.id)

submitted 11 months ago by [email protected] to c/[email protected]

91 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 9 points 11 months ago (2 children)

Yeah is this linked with dall-e?

[–] [email protected] 13 points 11 months ago

It is. The paid version (GPT-4) is integrated with DALLE-3.

[–] [email protected] 3 points 11 months ago (3 children)

This has all the hallmarks of "human pretending to be an AI" rather than actual AI output

[–] [email protected] 7 points 11 months ago (1 children)

I disagree. This is as you say Precisely the type of thing that happens when an image generator is asked to make a chart/diagram, so to me it seems a really wild leap to go from "This looks like exactly what happens when X" to "someone must have designed this to look like what happens when X".

If it were human designed, I think it would be intentionally funny (which realistically would backfire, but anyway...)

(And besides, paid ChatGPT does indeed connect to DALL-E 3 now)

[–] [email protected] 0 points 11 months ago (1 children)

Tbf I thought DALL-E3 was still just available via bing image creator, missed the memo that ChatGPT was hooked up to it too.

Still, for me though it still looks like it's human generated to try and be funny (it's just haha-AI-so-silly isn't groundbreakingly funny any more). It's mostly the information continuity throughout the image that I've not really seen from an image generating AI before (especially when not even prompted for it), and I've had a play around with DALL-E3 so I would expect the ChatGPT version to be equivalent.

Maybe I'm too cynical, but this just reeks of fake to me.

[–] [email protected] 2 points 11 months ago* (last edited 11 months ago) (2 children)

I tried the same prompts as OP, it didn't generate an image at first instance - had to ask it to generate one. This is the image I got:

@[email protected]

[–] [email protected] 2 points 11 months ago

Ropy from pituge

[–] [email protected] 2 points 11 months ago

ChatGPT takes the liberty of creating a DALL-E prompt that it doesn't feel the need to share with the user. You can, however, ask ChatGPT to share the exact prompt and seed with you to reproduce the image. Here is the actual prompt and seed DALL-E ended up working with:

Prompt: "A step-by-step visual guide on using Optical Character Recognition (OCR) in Microsoft Word. The guide includes steps like opening Microsoft Word, inserting an image into a Word document, selecting the image, and using the OCR feature to convert the text in the image into editable text. The layout should be clear and easy to follow, with each step labeled and illustrated in a user-friendly manner, catering to users with basic proficiency in Microsoft Word."

Seed: 3993182816

To be clear, ChatGPT decided on its own to create and send this prompt to DALL-E in response to my request for tech support.

[–] [email protected] 2 points 11 months ago

That's how you know the AI is good! actually.

[–] [email protected] 2 points 11 months ago (1 children)

Why do you think that?