this post was submitted on 25 Oct 2023
117 points (89.3% liked)

Technology

34889 readers
313 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 13 points 1 year ago (6 children)

Appending reddit to google search has become the only way to get meaningful search results, without it it's a shitshow of clickbait garbage, I can't imagine what it will become if it's not allowed anymore to index reddit data.

I understand companies not wanting data to be scraped for AI training for free, it's not only reddit according to the article, also news sites, I think it's a legit concern.

I believe at this point governments should wake up and regulate the matter of AI training globally, leaving it to individual companies will only damage users all over the world.

[–] [email protected] 4 points 1 year ago (2 children)

If you regulate AI, you kill any open source or small time endeavors and turn the whole thing into a shit show. You need vast amounts of data to train models and only a few companies either have it or can afford what they are missing.

Our whole economy is going to be AI driven soon, google and Microsoft would literally own us.

I also think Reddit just aggregated that content. Us, the consumer, don't deserve to get shafted and see AI costs explode just so spez can make a fat pay day off the content we created.

[–] [email protected] 4 points 1 year ago (1 children)

Regulating doesn't mean blocking, AI needs to be regulated, it should have been already done, look at stuff like deep fakes, some done even with dead people, fakes with actors faces and voices without their consent, and so on, it's not just about training, it's also about how the results are effectively used.

And the fact the training is expensive doesn't mean everyone should have free reign about it, especially when noone cares about the reliability of the datasets they're using, of the ethical aspects of it.

As for reddit, we've been already shafted, that's why we're on lemmy now.

[–] [email protected] 2 points 1 year ago

You mentioned regulating right after scraping so I thought it pertained to that.

Also when I say expensive, I mean prohibitively so in a way that creates a soft monopoly. And when you couple that with the very real possibility that AI replaces most desk work in the coming decades, its bleak.

That being said, I totally agree deepfakes and all that need to be regulated but only on the platforms distributing it imo. Most seem to want to regulate how the technology itself works, gimping it and forcing filters on the user. All of which can really only be done by stopping users from running it locally.

I think anything other than the lightest touch would be disastrous for both us and the product.

I'm curious where you would start. I have some thoughts but mainly only a strict opt out policy for individuals.

load more comments (3 replies)