this post was submitted on 09 Jun 2025
822 points (91.9% liked)

Technology

71396 readers
2917 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
(page 3) 50 comments
sorted by: hot top controversial new old
[–] [email protected] 5 points 5 days ago* (last edited 5 days ago)

Next, pit ChatGPT against 1K ZX Chess in a ZX81.

[–] [email protected] 2 points 4 days ago

Is anyone actually surprised at that?

[–] [email protected] 4 points 5 days ago (2 children)

Okay, but could ChatGPT be used to vibe code a chess program that beats the Atari 2600?

load more comments (2 replies)
[–] [email protected] 4 points 5 days ago* (last edited 5 days ago) (3 children)

This isn't the strength of gpt-o4 the model has been optimised for tool use as an agent. That's why its so good at image gen relative to other models it uses tools to construct an image piece by piece similar to a human. Also probably poor system prompting. A LLM is not a universal thinking machine its a a universal process machine. An LLM understands the process and uses tools to accomplish the process hence its strengths in writing code (especially as an agent).

Its similar to how a monkey is infinitely better at remembering a sequence of numbers than a human ever could but is totally incapable of even comprehending writing down numbers.

load more comments (3 replies)
[–] [email protected] 2 points 5 days ago

So, it fares as well as the average schmuck, proving it is human

/s

[–] [email protected] 2 points 5 days ago

Llms useless confirmed once again

load more comments
view more: ‹ prev next ›