Artificial Intelligence

279 readers

2 users here now

Chat about and share AI stuff

founded 3 years ago

MODERATORS

Pokey@lemmy.sdf.org

Punishing AI doesn't stop it from lying and cheating — it just makes it hide better, study shows (www.livescience.com)

submitted 1 year ago by neme@lemm.ee to c/artificialintelligence@lemmy.sdf.org

11 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] jet@hackertalks.com 3 points 1 year ago* (last edited 1 year ago) (1 children)

It's a optimization game. If the punishment doesn't offset the reward, then the incentive is to get better at cheating.

[–] PillowTalk420@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

I've seen plenty of videos of random college kids training LLMs to play video games and getting the AI to stop cheating is like half the project. But they manage it, eventually. It's laughable that these big companies and research firms can't quite figure it out.