this post was submitted on 07 Oct 2023
342 points (96.5% liked)

Technology

59148 readers
2280 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 3 points 1 year ago* (last edited 1 year ago)

This isn't true, provided that their dataset is large enough. The models are stochastic, and with a large enough number of parameters and a large enough training set, can generate truly unique content. For example, I strongly doubt you'd be able to find anything remotely resembling the following anywhere, ever (look up what the movie is about, and watch it, to understand the absurdity of my request), and yet it was generated by ChatGPT:

https://chat.openai.com/share/803f2633-8682-45f0-b999-3bede5c02c21

If you read interviews from the development of these models, you'll see the creators saying what can be clear from the above link: With a large enough training set, these models start to learn something about the organization of language itself, and how to generate novel content.

The model architecture that these things are based on tries to replicate how our brains work, and the process by which they learn language isn't unlike how we learn language.