this post was submitted on 27 Mar 2025
77 points (85.3% liked)

Technology

68245 readers
6566 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 24 points 1 week ago

Looks like a math improvement? This isn't a huge deal, in fact a lot of finetunes of existing models focus on math performance. InternLM just released some really interesting ones.

Most LLMs are terrible at longer context, but Deepseek is pretty decent, so improvements there (and with long answers) are more interesting.

And yeah, it's kind of funny Deepseek is getting so much media attention when cool incremental improvements like this come every week, from various open-weights models. It's awesome that they are releasing the weights, but still.