this post was submitted on 15 Mar 2024
491 points (95.4% liked)

Technology

60033 readers
2683 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 128 points 9 months ago (2 children)

They know what they fed the thing. Not backing up their own training data would be insane. They are not insane, just thieves

[–] [email protected] 18 points 9 months ago (5 children)

Everyone says this but the truth is copyright law has been unfit for purpose for well over 30 years now. And the lords were written no one expected something like the internet to ever come along and they certainly didn't expect something like AI. We can't just keep applying the same old copyright laws to new situations when they already don't work.

I'm sure they did illegally obtain the work but is that necessarily a bad thing? For example they're not actually making that content available to anyone so if I pirate a movie and then only I watch it, I don't think anyone would really think I should be arrested for that, so why is it unacceptable for them but fine for me?

[–] [email protected] 22 points 9 months ago (1 children)

if I pirate a movie and then only I watch it, I don't think anyone would really think I should be arrested for that

There are definitely people out there that think you should be arrested for that.

[–] [email protected] 2 points 9 months ago (1 children)

Even the police are unsure if it's actually a crime though. Crimes require someone to lose something and no one can point to a lost product so it's difficult to really quantify.

And it's not even technically breach of copyright since you're not selling it.

[–] [email protected] 2 points 9 months ago (2 children)

But they ARE selling it ... Every answer Chat GPT makes came from possibly stolen material

[–] [email protected] 2 points 9 months ago (1 children)

Isn't that true of every opinion you have. All the knowledge you have is based on works of others that came before you.

[–] [email protected] 3 points 9 months ago (1 children)

Not untill I bill you for it

Also, no there is such a thing as an original thought or opinion... Even if it's informed on other knowledge

There is a difference between reinterpreting other knowledge and just Frankensteining multiple work together

[–] [email protected] 1 points 9 months ago

I don't know enough about LLMs but Neural networks are capable of original thought. I suspect LLMs are too because of their relationship to Neural Networks.

[–] [email protected] 0 points 9 months ago (2 children)

You're using the word 'stolen' which doesn't fit. It would be accurate to say 'every answer comes from possibly unlicensed material '.

[–] [email protected] 1 points 9 months ago

Allegedly possibly maybe accidentally whoopsie not quite licensed fully material.

[–] [email protected] 0 points 9 months ago

Yeap, the real term (I think) would be copyright infringement

[–] [email protected] 14 points 9 months ago

That is a bad thing if they want to be exempt from the law because they are doing a big, very important thing, and we shouldn't.

The copyright laws are shit, but applying them selectively is orders of magnitude worse.

[–] [email protected] 2 points 9 months ago (1 children)

if I pirate a movie and then only I watch it, I don't think anyone would really think I should be arrested for that, so why is it unacceptable for them but fine for me?

Because it's more analogous to watching a video being broadcasted outdoors in the public, or looking at a mural someone painted on a wall, and letting it inform your creative works going forward. Not even recording it, just looking at it.

As far as we know, they never pirated anything. What we do know is it was trained on data that literally anybody can go out and look at yourself and have it inform your own work. If they're out here torrenting a bunch of movies they don't own or aren't licencing, then the argument against them has merit. But until then, I think all of this is a bunch of AI hysteria over some shit humans have been doing since the first human created a thing.

[–] [email protected] 2 points 9 months ago (1 children)

An AI (in its current form) isn't a person drawing inspiration from the world around it, it's a program made by people with inputs chosen by those people. If those people didn't ask permission to use other people's licensed work for their product, then they are plagiarising that work, and they should be subject to the same penalties that, for example, a game company using stolen art in their game should face. An AI doesn't become inspired, it copies existing things to predict what it thinks its user wants to see. If we produce a real thinking AI at some point in the future, one with self determination and whatnot, the story will be different, but for now it isn't.

[–] [email protected] 2 points 9 months ago

What is web scraping if not gathering information from around the world? As long as you're not distributing copyrighted content (and the models in question here don't, btw), then fair use is at play. I'm not plagiarizing the news by reading it or by talking about what I learned, but I would be if I just copy/pasted my response from the article.

Reading publicly available data isn't a copyright violation, and it certainly isn't a violation of fair use. If it were, then you just plagiarized my comment by reading it before you responded.

[–] [email protected] -1 points 9 months ago (2 children)

Because the actual comparison is that you stole ALL movies, started your own Netflix with them and are lining up to literally make billions by taking the jobs of millions of people, including those you stole from

[–] [email protected] 2 points 9 months ago

I would say it is closer to watching all the movies, regardless of how you got them, then taught a film class at UCLA.

[–] [email protected] 1 points 9 months ago* (last edited 9 months ago) (1 children)

If I paint a melty clock hanging off of a table, how have I stolen from Salvador Dali? What did I "steal" from Tolkien when I drew this?

you stole ALL movies, started your own Netflix with them

The model in question can't even try to distribute copyrighted material. You could have easily checked for yourself, but once again I find myself having to do the footwork for you guys.

[–] [email protected] -2 points 9 months ago (1 children)

If you sell your melty clock yes, it not "stealing" but you are violating copyright, that's how it works

The "model in question" is a bit of a prototype, I thought is was clear we are talking about where these models are going.... Maybe you'd get it if you came down of your high horse

[–] [email protected] 2 points 9 months ago

Dali doesn't own the concept of a melting clock. If I include a melting clock in my own work, as long as it's not his melting clock with all the other elements of his painting, it's fair use.

GPT hasn't been a prototype since before 2018, and the copyright restrictions are only getting tighter every time it's updated so idk what you're on about.

[–] [email protected] 2 points 9 months ago* (last edited 9 months ago)

That's really not how it works though, it's a web crawler they're not going to download the whole internet

And a reason they don't is it would actually potentially be copywrite infringement in some cases where as what they do legally isn't (no matter how much people wish the law was set based on their emotions)