Technology

59123 readers

2294 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

[email protected]

265

OpenAI has built a text watermarking method to detect chatgpt written content (www.tomshardware.com)

submitted 2 months ago by [email protected] to c/[email protected]

69 comments fedilink hide all child comments

Am I missing something? The article seems to suggest it works via hidden text characters. Has OpenAI never heard of pasting text into a utf8 notepad before?

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 2 points 2 months ago

They could, but adding random zero width characters into words would also destroy ever spell checker, giving it away immediately and making sure that even unaware people would filter it. Doing it outside the words would leave them with too few spots to use for proper watermarking.

I think it's far more likely they'll use some kind of pattern in the tokens - that way the watermark will remain even when you don't copypaste it.

But yeah, as said, they will never tell how it's implemented, but it can still be simply subverted.