Programming

26048 readers

61 users here now

Welcome to the main community in programming.dev! Feel free to post anything relating to programming here!

Cross posting is strongly encouraged in the instance. If you feel your post or another person's post makes sense in another community cross post into it.

Hope you enjoy the instance!

Rules

Follow the programming.dev instance rules
Keep content related to programming in some way
If you're posting long videos try to add in some form of tldr for those who don't want to watch videos

Wormhole

Follow the wormhole through a path of communities !webdev@programming.dev

founded 2 years ago

MODERATORS

snowe@programming.dev

Ategon@programming.dev

UlrikHD@programming.dev

bugsmith@programming.dev

Spyro@programming.dev

Newer AI Coding Assistants Are Failing in Insidious Ways (spectrum.ieee.org)

submitted 2 months ago by brianpeiris@lemmy.ca to c/programming@programming.dev

17 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] Kissaki@programming.dev 13 points 2 months ago (1 children)

A task that might have taken five hours assisted by AI, and perhaps ten hours without it, is now more commonly taking seven or eight hours, or even longer.

What kind of work do they do?

in my role as CEO of Carrington Labs, a provider of predictive-analytics risk models for lenders. My team has a sandbox where we create, deploy, and run AI-generated code without a human in the loop. We use them to extract useful features for model construction, a natural-selection approach to feature development.

I wonder what I have to imagine this is doing and how. How do they interface with the loop-without-a-human?

Either way, they do seem to have a (small, narrow) systematic test case and the product variance to be useful at least anecdotally/for a sample case.

[–] sukhmel@programming.dev 3 points 2 months ago (1 children)

I have a feeling that their test case is also a bit flawed. Trying to get index_value instead of index value is something I can imagine happening, and asking an LLM to ‘fix this but give no explanation’ is asking for a bad solution.

I think they are still correct in the assumption that output becomes worse, though

[–] VoterFrog@lemmy.world 4 points 2 months ago (1 children)

It just emphasizes the importance of tests to me. The example should fail very obviously when you give it even the most basic test data.

[–] sukhmel@programming.dev 3 points 2 months ago (1 children)

Yeah, if only QA vere not the first ‘replaced’ by AI 😠

[–] VoterFrog@lemmy.world 2 points 2 months ago (1 children)

This isn't even a QA level thing. If you write any tests at all, which is basic software engineering practice, even if you had AI write the tests for you, the error should be very, very obvious. I mean I guess we could go down the road of "well what if the engineer doesn't read the tests?" but at that point the article is less about insidious AI and just about bad engineers. So then just blame bad engineers.

[–] sukhmel@programming.dev 1 points 2 months ago

Yeah, I understand that this case doesn't require a QA, but in the wild companies seem to increasingly think that developers are necessary (yet), but QA are surely not

It's not even bad engineers, it's just squeezing of productivity as dry as possible, as I see it