this post was submitted on 31 May 2026
1032 points (98.4% liked)

Fuck AI

7221 readers
1606 users here now

"We did it, Patrick! We made a technological breakthrough!"

A place for all those who loathe AI to discuss things, post articles, and ridicule the AI hype. Proud supporter of working people. And proud booer of SXSW 2024.

AI, in this case, refers to LLMs, GPT technology, and anything listed as "AI" meant to increase market valuations.

founded 2 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Zos_Kia@jlai.lu 4 points 1 day ago (2 children)

The reason labs focus on LLMs is that language is a great substrate for generalization. Good luck trying to one-shot out of distribution problems using classic neutral networks. They've tried for decades to make it happen but LLMs surpassed those results in a few years.

[–] placebo@lemmy.zip 5 points 1 day ago (1 children)

Idk. LLMs don't seem like a good solution because of how many resources they need to train and run compared to specialized models.

[–] Zos_Kia@jlai.lu 1 points 1 day ago

I know it's in bad taste to quote myself but i wrote an explanation of why this isn't necessarily a bad solution here

[–] Th4tGuyII@fedia.io 2 points 1 day ago (1 children)

I understand that idea, but at the same time @placebo@lemmy.zip has a point.

There's a good reason why you generally don't get a CPU to do graphics and why FPGAs are usually only put on dev units.

Specialist hardware is generally much more efficient cost and energy wise than generalist hardware for a given task.

And I imagine that must be true for neural networks too, as that layer of language processing on top of any task naturally can't be as efficient/performatative as specialist software/networks made for the job.

[–] Zos_Kia@jlai.lu 4 points 1 day ago

And I imagine that must be true for neural networks too, as that layer of language processing on top of any task naturally can’t be as efficient/performatative as specialist software/networks made for the job.

Oh yeah definitely, a specialized model for each task would be more efficient on the inference side but can you imagine the cost of training a million specialized models ? For example you could think of natural language processing as it was done before : one model for sentiment analysis, one model for chronological analysis, one model for identifying legal terms etc... need to classify color descriptions in natural language ? Well here you go train another model. A small model (comparatively) but also one you'll have to re-train if you want to change the task even slightly.

A LLM has the advantage of being able to generalize a lot of different tasks on the same model, including some that are wildly out of distribution (meaning you hadn't even thought of them and they are not explicitly stated in the training data). So yeah, you pay a big training tax to train one large model, but then it pays off because that same model can perform on a million different tasks.

At least that's the thesis. I'm not qualified to judge whether it is proving worth it, but that's the reason why the industry massively shifted towards LLMs.