Reinforcement Learning

65 readers
1 users here now

A community dedicated to discussions on reinforcement learning, a subdiscipline of machine learning that tackles sequential decision making problems.

founded 2 years ago
MODERATORS
1
 
 

Conference is at Simon Fraser University.

Two rounds of submissions. First deadline is 22 Dec 2025. Second is 13 Feb 2026.

2
 
 
3
1
EWRL 2025 - Program and Accepted Papers (euro-workshop-on-reinforcement-learning.github.io)
4
 
 

RL is still nowhere near the scale where this matters, but it's always good to get an idea of how things are going to look when they inevitably reach that point.

5
6
7
8
5
Open Sourcing π₀ (www.physicalintelligence.company)
9
 
 

https://bsky.app/profile/natolambert.bsky.social/post/3lh5jih226k2k

Anyone interested in learning about RLHF? This text isn't complete yet, but looks to be a pretty useful resource as is already.

10
 
 

An overview of RL published just a few days ago. 144 pages of goodies covering everything from basic RL theory to modern deep RL algorithms and various related niches.

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).

11
 
 

Recordings for the RLC keynote talks have been released.

Keynote speakers:

  • David Silver
  • Doina Precup (Not recorded)
  • Peter Stone
  • Finale Doshi-Velez
  • Sergey Levine
  • Emma Brunskill
  • Andrew Barto
12
 
 

OpenAI just put out a blog post about a new model trained via RL (I'm assuming this isn't the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there's very little detail about how this is accomplished so it's hard for me to get excited about it, but the rest of you might find this interesting.

13