this post was submitted on 03 Sep 2024
1580 points (97.8% liked)
Technology
59347 readers
5956 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
So search engines shouldn't exist?
Perhaps. Or perhaps not in the way they do today. Perhaps if you profit from placing ads among results people actually want, you should share revenue with those results. Cause you know, people came to you for those results and they're the reason you were able to show the ads to people.
Case law has been established in the prevention of actual image and text copyright infringement with Google specifically. Your point is not at all ambiguous. The distinction between a search engine and content theft has been made. Search engines can exist for a number of reasons but one of those criteria is obeisance of copyright law.
I mean, their goal and service is to get you to the actual web page someone else made.
What made Google so desirable when it started was that it did an excellent job of getting you to the desired web page and off of google as quickly as possible. The prevailing model at the time was to keep users on the page for as long as possible by creating big messy "everything portals".
Once Google dropped, with a simple search field and high quality results, it took off. Of course now they're now more like their original competitors than their original successful self ... but that's a lesson for us about what capitalistic success actually ends up being about.
The whole AI business model of completely replacing the internet by eating it up for free is the complete sith lord version of the old portal idea. Whatever you think about copyright, the bottom line is that the deeper phenomenon isn't just about "stealing" content, it's about eating it to feed a bigger creature that no one else can defeat.
I really think it's mostly about getting a big enough data set to effectively train an LLM.
I mean, yes of course. But I don't think there's any way in which it is just about that. Because the business model around having and providing services around LLMs is to supplant the data that's been trained on and the services that created that data. What other business model could there be?
In the case of google's AI alongside its search engine, and even chatGPT itself, this is clearly one of the use cases that has emerged and is actually working relatively well: replacing the internet search engine and giving users "answers" directly.
Users like it because it feels more comfortable, natural and useful, and probably quicker too. And in some cases it is actually better. But, it's important to appreciate how we got here ... by the internet becoming shitter, by search engines becoming shitter all in the pursuit of ads revenue and the corresponding tolerance of SEO slop.
IMO, to ignore the "carnivorous" dynamics here, which I think clearly go beyond ordinary capitalism and innovation, is to miss the forest for the trees. Somewhat sadly, this tech era (approx MS windows '95 to now) has taught people that the latest new thing must be a good idea and we should all get on board before it's too late.
No, it legitimately is better. Do you know what Google could never do but that Copilot Search and Gemini Search can? Synthesize one answer from multiple different sources.
Sometimes the answer to your question is inherently not on a single page, it's split across the old framework docs and the new framework docs and stack overflow questions and the best a traditional search engine can ever do is maybe get some of the right pieces in front of you some of the time. LLMs will give you a plain language answer immediately, and let you ask follow up questions and modifications to your original example.
Yes Google has gotten shitty, but it would never have been able to do the above without an LLM under the hood.
Sure, but IME it is very far from doing the things that good, well written and informed human content could do, especially once we're talking about forums and the like where you can have good conversations with informed people about your problem.
IMO, what ever LLMs are doing that older systems can't isn't greater than what was lost with SEO ads-driven slop and shitty search.
Moreover, the business interest of LLM companies is clearly in dominating and controlling (as that's just capitalism and the "smart" thing to do), which means the retention of the older human-driven system of information sharing and problem solving is vulnerable to being severely threatened and destroyed ... while we could just as well enjoy some hybridised system. But because profit is the focus, and the means of making profit problematic, we're in rough waters which I don't think can be trusted to create a net positive (and haven't been trust worthy for decades now).