Ask Lemmy

26778 readers

1359 users here now

A Fediverse community for open-ended, thought provoking questions

Please don't post about US Politics. If you need to do this, try [email protected]

Rules: (interactive)

1) Be nice and; have fun

Doxxing, trolling, sealioning, racism, and toxicity are not welcomed in AskLemmy. Remember what your mother said: if you can't say something nice, don't say anything at all. In addition, the site-wide Lemmy.world terms of service also apply here. Please familiarize yourself with them

2) All posts must end with a '?'

This is sort of like Jeopardy. Please phrase all post titles in the form of a proper question ending with ?

3) No spam

Please do not flood the community with nonsense. Actual suspected spammers will be banned on site. No astroturfing.

4) NSFW is okay, within reason

Just remember to tag posts with either a content warning or a [NSFW] tag. Overtly sexual posts are not allowed, please direct them to either [email protected] or [email protected]. NSFW comments should be restricted to posts tagged [NSFW].

5) This is not a support community.

It is not a place for 'how do I?', type questions. If you have any questions regarding the site itself or would like to report a community, please direct them to Lemmy.world Support or email [email protected]. For other questions check our partnered communities list, or use the search function.

Reminder: The terms of service apply here too.

Partnered Communities:

Logo design credit goes to: tubbadu

founded 1 year ago

MODERATORS

[email protected]

154

If you could ask one question and get a 100% true answer, what would it be and why? (lemmy.blahaj.zone)

submitted 1 year ago by [email protected] to c/[email protected]

257 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 1 points 1 year ago (1 children)

This is a pet peeve of mine right up there with the never ending stream of people calling machine learning AI. We do not have any real kind of AI at all at the moment but I digress.

LLM is literally just a probability engine. LLM's are trained on huge libraries of content. What they do is assign a token(id) to each word (or part of word) and then note down the frequency of the words before and after the word as well as looking specifically for words that NEVER come before or after the word in question.

This creates a data set that can be compared to other tokenized words. Words with vary similar data sets can often be replaced with each other with no detriment to the sentence being created.

There is something called a transformer that has changed how efficiently LLM'S work and has allowed parsing of larger volumes by looking at the relation of each tokenized word to every word in the sentence simultaneously instead of one at a time which generates better more accurate data.

But the real bread and butter comes when it starts generating new text it starts with a word and literally chooses the most probable word to come next based off of its extensive training data. It does this over and over again and looks at the ending probability of the generated text. If it's over a certain threshold it says GOOD ENOUGH and there is your text.

You as a human (I assume)do this kind of thing all ready. If someone walked up too you and said "Hi! How are you..." by the time they got there you have probably already guessed that the next words are going to be "doing today?" Or some slight variation thereof. Why were you able to do this? Because of your past experiences, aka, trained data. Because of the volume of LLM'S data set it can guess with surprisingly good accuracy what comes next. This however is why the data it is trained on is important. If there were more people writing more articles,more papers,more comments about how the earth was flat vs people writing about it being round then the PROBABLE outcome is that the LLM would output that the earth is flat because that's what the data says is probable.

There are variations called the Greedy Search and the Beam Search but they are difficult for me to explain but still just variations of a probability generator.

[–] [email protected] 2 points 1 year ago

I mean yeah, and if I were trained on more articles and papers saying the earth was flat then I might say the same.

I'm not disputing what you've written because it's empirically true. But really, I don't think brains are all that more complex when it comes down to decision making and output. We receive input, evaluate our knowledge and spit out a probable response. Our tokens aren't words, of course, but more abstract concepts which could translate into words. (This has advantages in that we can output in various ways, some non-verbal - movement, music - or combine movement and speech, e.g. writing).

Our two major advantages: 1) we're essentially ongoing and evolving models, retrained constantly on new input and evaluation of that input. LLMs can't learn past a single conversation, and that conversational knowledge isn't integrated into the base model. And 2) ongoing sensory input means we are constantly taking in information and able to think and respond and reevaluate constantly.

If we get an LLM (or whatever successor tech) to that same point and address those two points, I do think we could see some semblance of consciousness emerge. And people will constantly say "but it's just metal and electricity", and yeah, it is. We're just meat and electricity and somehow it works for us. We'll never be able to prove any AI is conscious because we can't actually prove we're conscious, or even know what that really means.

This isn't to disparage any of your excellent points by the way. I just think we overestimate our own brains a bit, and that it may be possible to simulate consciousness in a much simpler and more refined way than our own organically evolved brains, and that we may be closer than we realize.