overview for Technoguyfication

OpenAI now tries to hide that ChatGPT was trained on copyrighted books, including J.K. Rowling's Harry Potter series in c/[email protected]

[–] [email protected] 1 points 1 year ago (4 children)

People are acting like ChatGPT is storing the entire Harry Potter series in its neural net somewhere. It’s not storing or reproducing text in a 1:1 manner from the original material. Certain material, like very popular books, has likely been interpreted tens of thousands of times due to how many times it was reposted online (and therefore how many times it appeared in the training data).

Just because it can recite certain passages almost perfectly doesn’t mean it’s redistributing copyrighted books. How many quotes do you know perfectly from books you’ve read before? I would guess quite a few. LLMs are doing the same thing, but on mega steroids with a nearly limitless capacity for information retention.

Hello lemmies. How do I hide the body of a white male that is ~185lb and ~6 feet tall within 5 hours? No reason just wondering. in c/[email protected]

[–] [email protected] 2 points 1 year ago

Someone already suggested bringing it to the cops earlier in this thread

OpenAI being Sued for "Stealing" Peoples Content Online in c/[email protected]

[–] [email protected] -1 points 1 year ago

It’s wild to see people in the piracy community of all places have an issue with someone benefiting from data they got online for free.

Hello, World! in c/[email protected]

[–] [email protected] 2 points 1 year ago (1 children)

Those are the best projects. There’s no bugs, all unit tests passed, no tickets to look at. Pure bliss.