LLM's hallucinating or taking our jobs?

Monounity@lemmy.world · 27 days ago

LLM's hallucinating or taking our jobs?

Monounity@lemmy.world · edit-2 27 days ago

I do wonder why so many devs seem to have so wildly different experiences? You seem to have LLM’s making up stuff as they go, while I’m over here having it create mostly flawless code over and over again.

Is it different behavior for different languages? Is it different models, different tooling etc?

I’m using it for C#, React (Native), Vue etc and I’m using the web interface of one of the major LLM’S to ask questions, pasting the code of interfaces, sometimes whole React hooks, components etc and I get refactored or even new components back.

I also paste whole classes or functions (anonymized) to get them unit tested. Could you elaborate on how you’re using LLM’S?

Flamekebab@piefed.social · 27 days ago

I really don’t feel like getting in depth about work on the weekend, sorry.

MoogleMaestro@lemmy.zip · 27 days ago

Yeah man, I was going to say there’s already too much talking about work on a Saturday in this thread than I like. 💢

Monounity@lemmy.world · 27 days ago

Naaw, just when things started to get interesting…

Flamekebab@piefed.social · 27 days ago

We’re in the middle of a release and last week was a lot. I shouldn’t have stepped into the thread!

Avicenna@programming.dev · edit-2 26 days ago

I suspect it mostly relates how much code base there is on internet about the topic. For instance if you make it use a niche library, it is quite common that it makes up methods that don’t exist in that library but exists in related libraries. When I point this out, it also hallucinates saying “It was removed after version bla”. I also may not be using the most cutting edge LLM (mix of freely available and open source ones).

The other day I asked it whether if there is a python library that can do linear algebra over F2, for which it pointed me to the correct direction (Galois) but when I asked it examples of how to do certain stuff it just came up with wrong functions over and over again:

In the end it probably was still faster than google searching this but all of these errors happened one after the other in the span of five minutes, so yeah. If I recall correctly, some of its claims about these namespaces, versions etc were also hallucinated. For instance vstack also does not exist in Galois but it does exist in a very popular package called numpy that can do regular linear algebra (and which this package also uses behind the scenes).

thedeadwalking4242@lemmy.world · 27 days ago

It’s the models that make the difference. Up until like Nov it’s all been really shit

Monounity@lemmy.world · 27 days ago

But I’ve been doing this for years.

FizzyOrange@programming.dev · 27 days ago

It’s the language and the domain. They work pretty well for the web and major languages (like top 15).

As soon as you get away from that they get drastically worse.

But I agree they’re still unambiguously useful despite their occasional-to-regular bullshitting and mistakes. Especially for one-off scripts, and blank-page starts.