this post was submitted on 19 Dec 2023
1134 points (91.5% liked)

Technology

60052 readers
2865 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 69 points 1 year ago (1 children)

You are posting publicly online. It's all scrapeable.

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (5 children)

Yes, but I doubt Meta scrapes Reddit or Lemmy, for instance. With this change we’ll just be delivering it to them on a platter. And, knowing Meta, they’ll find ways to use the data.

[–] [email protected] 19 points 1 year ago

Even if Meta doesn't do it themselves there are likely hundreds of companies that do, and Meta can pay them for the data they want.

[–] [email protected] 18 points 1 year ago

If it's visible, you're best assuming that Meta, Google, Amazon, the CIA, everyone, has a copy of it and are linking it all together behind the scenes.

At least this way they don't get your IP address or linked advertising cookies. Here you're just a username and whatever you post. Unless you browse and post directly on threads that is. Those guys get all their milkshake drunk.

[–] [email protected] 11 points 1 year ago

but I doubt Meta scrapes Reddit or Lemmy, for instance

Why do you doubt that?

[–] [email protected] 6 points 1 year ago

If it's on the darkweb or deepweb then MAYBE they are not, but the reason the rest of the web is not considered part of those groups is because Google/Meta/Microsoft/etc scrape it, categorize it, and process it.

[–] [email protected] 1 points 1 year ago* (last edited 1 year ago)

If they don't scrape Reddit, it's because scraping Reddit costs money. Because they closed down their free-of-charge APIs, remember? Which they did because people were scraping their data for free.

Scraping Lemmy is free, and most probably will always remain that way.