this post was submitted on 06 Feb 2024
119 points (98.4% liked)

Technology

34698 readers
459 users here now

This is the official technology community of Lemmy.ml for all news related to creation and use of technology, and to facilitate civil, meaningful discussion around it.


Ask in DM before posting product reviews or ads. All such posts otherwise are subject to removal.


Rules:

1: All Lemmy rules apply

2: Do not post low effort posts

3: NEVER post naziped*gore stuff

4: Always post article URLs or their archived version URLs as sources, NOT screenshots. Help the blind users.

5: personal rants of Big Tech CEOs like Elon Musk are unwelcome (does not include posts about their companies affecting wide range of people)

6: no advertisement posts unless verified as legitimate and non-exploitative/non-consumerist

7: crypto related posts, unless essential, are disallowed

founded 5 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 1 points 8 months ago (1 children)

I agree, but I think that computational power requirements already do that – complex models that do interesting stuff need a bunch of special v-cards to train for days, and they need a lot of data to train on – so it's natural that those who already have data and money to process it get there first.

I think, their argument is not even about their monopoly, but to shut down the question of why and how to trust THEM with policing their LLMs before it happened. Open system can be investigated and we can find out that they over or underregulated some stuff, made it biased, find copyrighted materials, personal information, gore or CSAM in their training samples et cetera. They save metric tons of possible lawsuits by making it a rule in the industry that no one can see under the roof of their machines.

[–] [email protected] 1 points 8 months ago

Initial training of the models is expensive, but a trained model can be run on a laptop from that point. The problem of initial training can also be addressed by doing it in distributed fashion. There are also open source projects, such as Petals, that allow you running distributed models Bittorrent style. Other approaches like LoRA allow taking existing models and turning them for a particular task without the need to do training from scratch. There's a pretty good article from Steve Yegge on the recent advances in open source models.

I do agree that avoiding regulation and scrutiny are most definitely additional goals these companies have. They want to keep this tech opaque and frame themselves as responsible guardians of the technology that shouldn't fall into the hands of unwashed masses who can't be trusted with it.