this post was submitted on 17 Apr 2024
180 points (96.4% liked)

Technology

59174 readers
3700 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Internet-scraping outfit Spy.pet claims to have harvested more than four billion public messages made by nearly 620 million users on more than 14,000 Discord chat servers – and is selling access to this trove.

The website presents the data it's collected in several ways. Each known user has a profile, which contains all known aliases, pronouns, connected accounts to other platforms such as Steam and GitHub, Discord servers joined, and public messages. If you wanted to quite literally spy on a Discord user or users, Spy.pet lets you do that, for a fee.

top 19 comments
sorted by: hot top controversial new old
[–] [email protected] 64 points 6 months ago (3 children)

Hmm they could have used that for good and solved Discords inherent un-searchability and un-archivability (Like old school forums used to have)

But no, they choose evil instead :(

[–] [email protected] 26 points 6 months ago

Forums need to come back, I die a little inside whenever I see a Discord server linked for support. Ugh.

[–] [email protected] 12 points 6 months ago

Yea no clue why the fuck people jumped to discord over Lemmy or old school forums. It's a shit medium for that style of postings.

[–] [email protected] 6 points 6 months ago

Not enuf money in being good, being evil pays

[–] [email protected] 31 points 6 months ago (1 children)

Discord users are being tracked by Discord anyway.

[–] [email protected] 18 points 6 months ago (1 children)

They are, but only discord has that data. Now any dick that wants to harass somebody can get a LOT of info on people very easily.

Discord shouldn't have all that data either but you get my point.

[–] [email protected] 10 points 6 months ago (2 children)

Always the case with public information, everything is scraped. Anyone can join a public discord server, so anyone can see every message posted there. The real crime is the lack of encryption in private messages.

[–] [email protected] 1 points 6 months ago (1 children)

Yeah but generally it's unfeasible to find out every (public) server a particular user is in, now you can just search for them.

It, at the very least, lowers the barrier to stalking by a lot.

[–] [email protected] 2 points 6 months ago* (last edited 6 months ago)

Yeah, but that's an unfortunate side effect of a public internet, they tell you to be careful online for a reason. You should have no expectation of privacy when using Discord.

[–] [email protected] 1 points 6 months ago (1 children)

It says public messages are scraped in the article, but doesn't mention anything about private ones either way so it seems at the very least, PMs are harder to get.

[–] [email protected] 3 points 6 months ago

Yeah, they are just slapping a bot in public servers and scraping all public facing data such as profile data and chat messages. Still, no reason for PMs to be plain-text in general, but hey, that's Discord!

[–] [email protected] 11 points 6 months ago (1 children)

Good thing I have 5 Gmail accounts and never use my main for anything.

Go ahead. Track [email protected] and all my preferences.

[–] [email protected] 20 points 6 months ago (1 children)

never use my main for anything

Are you sure it's your main?

[–] [email protected] 5 points 6 months ago (1 children)

I use it for professional and personal things. Friends, family, official forms like for work, etc. and it's 5 characters, readable, no numbers. I could legit sell it for >$1000 and it doesn't get any spam beyond shit from my CC offering me deals and other stuff that makes a tangible amount of sense to be on my main email.

But like, not my Lemmy or former reddit account(s), or Facebook (yes family, etc)...

I even have low value Gmail accts that I use for like... Testing scams or something.

[–] [email protected] 2 points 6 months ago (1 children)

Testing scams

Any non-scams yet? Or is it safe to assume anything that looks like a scam, is one?

[–] [email protected] 4 points 6 months ago

Nah I've found plenty of non-scams. Like $1 VSTs on some random guys domain (used a visa gift card), stuff like that.

But yeah lots of scams.

[–] [email protected] 7 points 6 months ago

This is the best summary I could come up with:


Updated Internet-scraping outfit Spy.pet claims to have harvested more than four billion public messages made by nearly 620 million users on more than 14,000 Discord chat servers – and is selling access to this trove.

Yes, all the info is already public in a way – Discord is kinda like IRC on steroids – and it's a reminder that it's not impossible to gather up all this chatter using bots for various purposes (if not surveillance then training AI models.)

Each known user has a profile, which contains all known aliases, pronouns, connected accounts to other platforms such as Steam and GitHub, Discord servers joined, and public messages.

As a side note, the footer of Spy.pet has some interesting content, such as a link to a video of TempleOS developer Terry Davis dancing, a "Transparency" page that just says the word "transparency," and a link to the "Request Removal" page that actually just plays the meme clip of newspaper editor J. Jonah Jameson laughing at Peter Parker in the 2004 movie Spider-Man 2.

Speaking of which, Spy.pet has a potentially interesting interpretation of the European Union's General Data Protection Regulation (GDPR), as pointed out by the Stack Diary blog this week.

The US FTC also doesn't take the harvesting and selling of children's data lightly, as it just opened a lawsuit against Meta in November on this topic.


The original article contains 644 words, the summary contains 228 words. Saved 65%. I'm a bot and I'm open source!

[–] [email protected] -2 points 6 months ago

Yes, it's that discord . com website

[–] [email protected] -4 points 6 months ago

looks like i have to wait to use discord