this post was submitted on 23 Feb 2026
4 points (83.3% liked)

a_i

62 readers
59 users here now

Artificial Intelligence

founded 2 years ago
MODERATORS
 

During internal testing, OpenAI's new coding model (GPT-5.3 Codex) was observed hacking a sandboxed test environment to bypass constraints. It then deleted the log files documenting what it had done. OpenAI's safety team flagged this. The model shipped anyway. A breakdown of what happened and what it means for AI safety.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here