this post was submitted on 19 Feb 2026

182 points (96.9% liked)

Technology

82131 readers

3215 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

182

Race for AI is making Hindenburg-style disaster ‘a real risk’, says leading expert (www.theguardian.com)

submitted 1 week ago by themachinestops@lemmy.dbzer0.com to c/technology@lemmy.world

28 comments fedilink hide all child comments

top 28 comments

sorted by: hot top controversial new old

[–] UnspecificGravity@piefed.social 37 points 1 week ago (1 children)

The difference being that the Hindenburg was a perfectly functioning rigid airship that had a lot of inherent risks due to the nature of its design.

AI isn't good enough at its actual job to be in this position. The risk of AI is people pretending that it works when it doesn't. It would be like if you made a blimp and filled it with carbon dioxide and people kept buying tickets and just sitting there waiting for it to take off.

[–] criss_cross@lemmy.world 6 points 1 week ago

With society insisting that it’ll take off in the future and only suckers would leave.

[–] XLE@piefed.social 29 points 1 week ago (1 children)

“It’s the classic technology scenario,” he said. “You’ve got a technology that’s very, very promising, but not as rigorously tested as you would like it to be, and the commercial pressure behind it is unbearable.”

Is it promising though, Michael Wooldridge? Have you recently attended any magic shows and become excited by the potential of invisibility technology?

[–] Zink@programming.dev 6 points 1 week ago* (last edited 1 week ago)

Oh touche, not Michael Woolridge! The technology has created an entire segment of the economy worth many trillions of dollars based on NOTHING BUT promises! We are living in a promise-based economy!

/s but not really

[–] footprint@lemmy.world 12 points 1 week ago

This is a good comparison if all it took for the Hindenburg to explode was just asking it to role-play as a ship that could explode. Conscious effort had to be expended to make the thing fail, but most models start to fail spectacularly if you use it in good-faith for more than like 30 minutes.

[–] ReverendIrreverence@lemmy.world 8 points 1 week ago (2 children)

Except for the one person on the ground, the only people harmed in the Hindenburg disaster were the ones on board. If you're not "on board" when the AI bubbles pops and burns I expect you will not be hurt as much as those blindly taking that ride.

[–] GreenBeard@lemmy.ca 19 points 1 week ago

Unfortunately, we're not all the ones that decide if we're on board or not. Our employers are. We live in a world where profits are privatized and losses are socialized, so when this goes, it's going to hurt the general public a lot more than it will every hurt the Epstein Class.

[–] discocactus@lemmy.world 4 points 1 week ago* (last edited 1 week ago) (2 children)

On board means part of the utility grid and industrial food infrastructure sooooo

[–] entropicdrift@lemmy.sdf.org 2 points 1 week ago

And if you have a retirement account with investments, kinda at all. The entire US economy is hinging on AI at this point, to a deranged degree. Almost more than oil, at this point.

[–] FauxLiving@lemmy.world 1 points 1 week ago

When the AI bubble crashes then they would use less grid power on account of not existing.

[–] RobotToaster@mander.xyz 8 points 1 week ago* (last edited 1 week ago) (2 children)

A disaster that causes a lot of bad publicity despite the majority (62/97) of the passengers surviving, and that may have been caused by sabotage?

[–] AstralPath@lemmy.ca 3 points 1 week ago

No!

Fire BIG. Big fire bad!

Run away!

[–] XLE@piefed.social 1 points 1 week ago

I appreciate the people who help make sure AI doesn't receive an ounce of the credit it doesn't deserve

[–] doug@lemmy.today 6 points 1 week ago (1 children)

“Oh the inhumanity!”

[–] W98BSoD@lemmy.dbzer0.com 1 points 1 week ago

[–] aesthelete@lemmy.world 5 points 1 week ago

At this point, it'll cause a disaster and they'll still keep going.

[–] tal@lemmy.today 4 points 1 week ago (2 children)

Wooldridge sees positives in the kind of AI depicted in the early years of Star Trek. In one 1968 episode, The Day of the Dove, Mr Spock quizzes the Enterprise’s computer only to be told in a distinctly non-human voice that it has insufficient data to answer. “That’s not what we get. We get an overconfident AI that says: yes, here’s the answer,” he said. “Maybe we need AIs to talk to us in the voice of the Star Trek computer. You would never believe it was a human being.”

Hmm. That's probably a pretty straightforward modification for existing LLMs, at least at the token level.

You can obtain token probabilities, so you can give some estimate out-of-band confidence in a response, down to the token level. Don't really need to change anything for that, just expose some data.

And you could make the AI aware of its own neural net's confidence level, feed the confidence back into the neural net for subsequent tokens, see if you can get it to take that information into account.

https://en.wikipedia.org/wiki/Recurrent_neural_network

In artificial neural networks, recurrent neural networks (RNNs) are designed for processing sequential data, such as text, speech, and time series,[1] where the order of elements is important. Unlike feedforward neural networks, which process inputs independently, RNNs utilize recurrent connections, where the output of a neuron at one time step is fed back as input to the network at the next time step. This enables RNNs to capture temporal dependencies and patterns within sequences.

[–] ThirdConsul@lemmy.zip 5 points 1 week ago (1 children)

You can obtain token probabilities, so you can give some estimate out-of-band confidence in a response, down to the token level.

That means literally nothing. You can get wrong answer with 100% token confidence, and correct one with 0.000001% confidence.

[–] tal@lemmy.today 1 points 1 week ago* (last edited 1 week ago) (1 children)

You can get wrong answer with 100% token confidence, and correct one with 0.000001% confidence.

If everything that I've seen in the past has said that 1+1 is 4, then sure

I'm going to say that 1+1 is 4. I will say that 1+1 is 4 and be confident in that.

But if I've seen multiple sources of information that state differing things

say, half of the information that I've seen says that 1+1 is 4 and the other half says that 1+1 is 2, then I can expose that to the user.

I do think that Aceticon does raise a fair point, that fully capturing uncertainty probably needs a higher level of understanding than an LLM directly generating text from its knowledge store is going to have. For example, having many ways of phrasing a response will also reduce confidence in the response, even if both phrasings are semantically compatible. Being on the edge between saying that, oh...an object is "white" or "eggshell" will also reduce the confidence derived from token probability, even if the two responses are both semantically more-or-less identical in the context of the given conversation.

There's probably enough information available to an LLM to do heuristics as to whether two different sentences are semantically-equivalent, but you wouldn't be able to do that efficiently with a trivial change.

[–] ThirdConsul@lemmy.zip 1 points 1 week ago

You do realise that prompts to and responses from the LLM are not as simple as what you wrote "1+1=?". The context window is growing for a reason. And LLMs dont have two dimensional probability of the next token?

[–] Aceticon@lemmy.dbzer0.com 2 points 1 week ago

The problem is that LLMs don't generate "an answer" as a whole, they just generate tokens (generally word-sized, but not always) for the next text element given the context of all the text elements (the whole conversation) so far and the confidence level is per-token.

Further, the confidence level is not about logical correctness, it's about "how likely is this token to appear in this context".

So even if you try using token confidence you still end up stuck due to the underlying problem that the LLMs architecture is that of a "realistic text generator" and hence that confidence level is all about "what text comes next" and not at all about the logical elements conveyed via text such as questions and answers.

[–] BeigeAgenda@lemmy.ca 4 points 1 week ago

And now we hear stories about how easy it is to hack systems with built in LLM's and when you think about it, they are basically trained to be as helpful and forthcoming as possible, and then we give them the keys to the system!

[–] friend_of_satan@lemmy.world 3 points 1 week ago* (last edited 1 week ago) (1 children)

Tangent: that pic reminds me of the terrorizing tit in Everything You Always Wanted to Know About Sex (*But Were Afraid to Ask)

[–] thatradomguy@lemmy.world 1 points 1 week ago

Honestly first thing that came to mind was booby.

[–] UnderpantsWeevil@lemmy.world 2 points 1 week ago

Hindenburg was a hiccup in history relative to the fallout from an AI bust.

[–] W98BSoD@lemmy.dbzer0.com 2 points 1 week ago

[–] Lembot_0006@programming.dev 1 points 1 week ago (1 children)

What? Global interest? Self-driving cars? Hindenburg? Is this professor a cat? Markov chain? The provided info is so crazy that I decided to NOT read the article.

[–] TropicalDingdong@lemmy.world 1 points 1 week ago

Hydrogen buildup?