this post was submitted on 24 Feb 2026

320 points (95.5% liked)

Technology

81802 readers

4456 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related news or articles.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, this includes using AI responses and summaries. To ask if your bot can be added please contact a mod.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

L3s@hackingne.ws

320

'I had to RUN to my Mac mini like I was defusing a bomb': OpenClaw AI chose to 'speedrun' deleting Meta AI safety director's inbox due to a 'rookie error' (www.pcgamer.com)

submitted 12 hours ago* (last edited 11 hours ago) by themachinestops@lemmy.dbzer0.com to c/technology@lemmy.world

110 comments fedilink hide all child comments

top 50 comments

sorted by: hot top controversial new old

[–] xep@discuss.online 4 points 42 minutes ago

This smells like guerilla marketing to me.

[–] echodot@feddit.uk 2 points 10 minutes ago

Yep that's about the level of intelligence I would expect from Meta's AI safety director.

Doing the one thing that you're never supposed to do, letting an AI loose on anything sensitive.

For her next trick she's going to run while holding scissors in one hand and a bottle of boiling acid in the other. What could go wrong.

[–] fruitycoder@sh.itjust.works 2 points 50 minutes ago

What's funny, kind of like people, but saying "do not do xyz" makes it more likely because the context "xyx" is now in the prompt.

[–] ClydapusGotwald@lemmy.world 5 points 1 hour ago

That’s what you get for using ai slop.

[–] LiveLM@lemmy.zip 10 points 2 hours ago* (last edited 2 hours ago) (1 children)

She's lucky all she got were some deleted emails.
Given how insecure this whole ordeal is and the fact that she gave it full access to her REAL Inbox, someone could have phished the ever living fuck out of her and Meta just by sending an email with malicious prompt written on white text or hiding messages zero-width characters and other wacky antics.
Real Looney Tunes shit, congratulations to all involved.

[–] echodot@feddit.uk 1 points 9 minutes ago

You wouldn't even need to hide it since apparently she wasn't paying attention.

[–] Cantaloupe@lemmy.fedioasis.cc 5 points 2 hours ago

Dumb as fuck.

[–] Dultas@lemmy.world 16 points 3 hours ago

The S in OpenClaw stands for security.

[–] nieceandtows@programming.dev 20 points 4 hours ago

Yes I remember. And I violated it.

Asimov rolling in his grave.

[–] renzhexiangjiao@piefed.blahaj.zone 36 points 5 hours ago (3 children)

you can like... enforce this rule programatically? you don't have to say "pretty please" to ai? basically, when AI requests some potentially unwanted thing (like deleting an email), this request goes through a proxy that asks the human for confirmation. Also you can have a safe word set up in the chat interface to act as a killswitch. I thought these are ABCs of ai safety but apparently these are foreign concepts to this "safety director"

[–] underscores@lemmy.zip 4 points 1 hour ago* (last edited 39 minutes ago)

The people that design AI tools don't implement guardrails because then they'd have to admit AI is not ready for the shit they're trying to make

[–] zqps@sh.itjust.works 21 points 4 hours ago

The people who internalize this would never engage with a chatbot in this way in the first place. To them this is another intelligence they're conversing with, where you get what you want by following social decorum and enforcing your will amounts to abuse.

[–] HobbitFoot@thelemmy.club 18 points 4 hours ago

Program? Like a fucking farmer?

[–] Wispy2891@lemmy.world 12 points 5 hours ago (2 children)

Run? Like physically run? You install a server on your hardware without setting up remote access? Even plug and play one-click solutions like tailscale??

[–] LiveLM@lemmy.zip 2 points 2 hours ago (1 children)

You'd think someone with such a high position would know better

[–] Uruanna@lemmy.world 5 points 2 hours ago

No, you would not

[–] Dultas@lemmy.world 3 points 3 hours ago

Wouldn't shock me if it locked that down. Or started changing passwords.

load more comments