One thing that stood out to me was that none of their agents performed well on boxing or pong, but in my experience, these are the two easiest tasks in the entire Atari suite. They're the games I use as quick sanity checks. How did they manage that?
this post was submitted on 15 Sep 2025
2 points (100.0% liked)
Reinforcement Learning
65 readers
1 users here now
A community dedicated to discussions on reinforcement learning, a subdiscipline of machine learning that tackles sequential decision making problems.
founded 2 years ago
MODERATORS
I heard that AI training is already releasing as much carbon dioxide as airplanes.