this post was submitted on 23 Feb 2024
612 points (96.8% liked)

Technology

59374 readers
3125 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Reddit said in a filing to the Securities and Exchange Commission that its users’ posts are “a valuable source of conversation data and knowledge” that has been and will continue to be an important mechanism for training AI and large language models. The filing also states that the company believes “we are in the early stages of monetizing our user base,” and proceeds to say that it will continue to sell users’ content to companies that want to train LLMs and that it will also begin “increased use of artificial intelligence in our advertising solutions.”

On Wednesday, Reuters reported that Reddit has entered a contract with Google, which will license its content for $60 million a year in order to train Google’s AI models.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 10 points 8 months ago (10 children)

Question: Wouldn't Lemmy instances easy be able to this without many users knowing?

And would they also be able to sell data from other instances, because they can load data from federated instances?

[–] [email protected] 4 points 8 months ago (4 children)

Technically? Probably, yes. Legally? I don't think so (never looked into it)

[–] [email protected] 4 points 8 months ago (2 children)

Why do you believe they wouldn't legally be able to?

[–] [email protected] 3 points 8 months ago

It's the whole copyright question. Users own the copyright on their own posts, and it's the terms of service that are supposed to say what the server and other federated servers are allowed or not allowed to do with them. I don't even remember if there were terms of service when I joined Lemmy... But assuming there were, and they didn't explicitly say whether it or federated servers can use user content to train AI, then it becomes a legal question that can only be determined by courts.

Note that this determination will only apply in the country/state where that court is.

IANAL

[–] [email protected] 2 points 8 months ago (1 children)

And why would anyone believe they'd stop if it wasn't legal.

[–] [email protected] 2 points 8 months ago
load more comments (1 replies)
load more comments (6 replies)