this post was submitted on 08 Apr 2025
495 points (98.2% liked)

Technology

68724 readers
3528 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related news or articles.
  3. Be excellent to each other!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
  9. Check for duplicates before posting, duplicates may be removed
  10. Accounts 7 days and younger will have their posts automatically removed.

Approved Bots


founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 51 points 6 days ago (4 children)

Code written by AI is really poorly written. A couple smells I’ve noticed:

  • Instead of fixing error cases, it overly relies on try:catch structures, making critical bugs invisible but still present. Dangerous shit.
  • It doesn’t reuse code that already exists in the project. You have to do a lot of extra work letting it know that your helper functions or CSS utility classes exist.
  • It writes things in a very “brute force” way. If it finds any solution, it charges forward with the implementation even if there is a much simpler way. It never thinks “but is there a better way?”
  • Likewise, it rarely uses the actual documentation for your library. It goes entirely off of gut instincts. Half the time if you paste in a documentation page, it finally shapes up and builds the code right. That should be default behavior.
  • It has a string tendency to undo manual changes I have made, because it doesn’t know the reasons why I did them.

On the other hand, if you’re in a green field project and need to throw up some simple, dirty CSS/HTML for a quick marketing page, sure, let the AI bang it out. Some projects don’t need to be done well, they just need to be done fast.

And the autocomplete features can be a time saver in some cases regardless.

[–] [email protected] 24 points 6 days ago* (last edited 6 days ago) (3 children)

I just spent about a month using Claude 3.7 to write a new feature for a big OSS product. The change ended up being about 6k loc with about 14k of tests added to an existing codebase with an existing test framework for reference.

For context I'm a principal-level dev with ~15 years experience.

The key to making it work for me was treating it like a junior dev. That includes priming it ("accuracy is key here; we can't swallow errors, we need to fail fast where anything could compromise it") as well as making it explain itself, show architecture diagrams, and reason based on the results.

After every change there's always a pass of "okay but you're violating the layered architecture here; let's refactor that; now tell me what the difference is between these two functions, and shouldn't we just make the one call the other instead of duplicating? This class is doing too much, we need to decompose this interface." I also started a new session, set its context with the code it just wrote, and had it tell me about assumptions the code base was making, and what failure modes existed. That turned out to be pretty helpful too.

In my own personal experience it was actually kinda fun. I'd say it made me about twice as productive.

I would not have said this a month ago. Up until this project, I only had stupid experiences with AI (Gemini, GPT).

[–] [email protected] 15 points 6 days ago

Agreed. I use it in my daily workflow but you as the senior developer have to understand what can and cannot be delegated, and how to stop it from doing stupid things.

For instance when I work in computer vision or other math-heavy code, it’s basically useless.

[–] [email protected] 13 points 6 days ago (1 children)

Typically working with a junior on a project is slower than not working with them. It's a little odd you see this as like that and that it's also faster.

[–] [email protected] 12 points 6 days ago

I don't think it's odd, because LLMs are just way faster than any junior (or senior) Dev. So it's more like working with four junior devs but with the benefit of having tasks done sequentially without the additional overhead of having to give tasks to individual juniors and context switching to review their changes.

(Obviously, there are a whole lot of new pitfalls, but there a real benefits in some circumstances)

[–] [email protected] 3 points 6 days ago (1 children)
[–] [email protected] 3 points 6 days ago

The PR isn't public yet (it's in my fork) but even once I submit it upstream I don't think I'm ready to out my real identity on Lemmy just yet.

[–] [email protected] 11 points 6 days ago

Sounds about right, I had a positive experience when I told my local LLM to refactor a function and add a single argument.

I would not dare letting it loose on a whole source file, because it changes random things giving you more code to review.

In my view current LLM's do a acceptable job with:

  • Adding comments
  • Writing docstrings
  • Writing git commit messages
  • Simple tasks on small pieces of code
[–] [email protected] 10 points 6 days ago

Yeah likewise. I think it shows the primary weakness of Llms right now is not skill or understanding, but context.

It can't use functions or docs that it doesn't know about. Neither can I. RAG systems are supposed to solve this but in practice they don't seem to work that great

[–] [email protected] 5 points 5 days ago

It doesn’t reuse code that already exists in the project

I had a pissing contest with one of the junior guys over this. He didn’t seem to understand why we should use the existing function and had learned so little about the code base that he didn’t know where to find it. He’s gone

The more interesting flaw in his ai code was it hallucinated an entirely different mocking tool for unit tests