Reinforcement Learning

1

0

Canadian AI Conference 2026 — Call for papers (22 Dec 2025 Deadline) (www.caiac.ca)

submitted 1 month ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

2 comments fedilink

Conference is at Simon Fraser University.

Two rounds of submissions. First deadline is 22 Dec 2025. Second is 13 Feb 2026.

2

0

A constant function. Courtesy of W&B. (lemmy.ca)

submitted 2 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

3

1

EWRL 2025 - Program and Accepted Papers (euro-workshop-on-reinforcement-learning.github.io)

submitted 2 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

4

2

Greener Deep Reinforcement Learning: Analysis of Energy and Carbon Efficiency Across Atari Benchmarks (arxiv.org)

submitted 2 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

2 comments fedilink

RL is still nowhere near the scale where this matters, but it's always good to get an idea of how things are going to look when they inevitably reach that point.

5

1

Keynotes Talks from RLC2025 (www.youtube.com)

submitted 3 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

6

5

Factorio Learning Environment (jackhopkins.github.io)

submitted 9 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

7

1

Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. (www.acm.org)

submitted 9 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

8

5

Open Sourcing π₀ (www.physicalintelligence.company)

submitted 10 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

9

7

A Little Bit of Reinforcement Learning from Human Feedback -- Nathan Lambert (rlhfbook.com)

submitted 10 months ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

https://bsky.app/profile/natolambert.bsky.social/post/3lh5jih226k2k

Anyone interested in learning about RLHF? This text isn't complete yet, but looks to be a pretty useful resource as is already.

10

1

Reinforcement Learning: An Overview (arxiv.org)

submitted 1 year ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

An overview of RL published just a few days ago. 144 pages of goodies covering everything from basic RL theory to modern deep RL algorithms and various related niches.

This manuscript gives a big-picture, up-to-date overview of the field of (deep) reinforcement learning and sequential decision making, covering value-based RL, policy-gradient methods, model-based methods, and various other topics (including a very brief discussion of RL+LLMs).

11

5

Keynotes from the 2024 Reinforcement Learning Conference (www.youtube.com)

submitted 1 year ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

Recordings for the RLC keynote talks have been released.

Keynote speakers:

David Silver
Doina Precup (Not recorded)
Peter Stone
Finale Doshi-Velez
Sergey Levine
Emma Brunskill
Andrew Barto

12

1

OpenAI: Learning to Reason with LLMs (openai.com)

submitted 1 year ago* (last edited 1 year ago) by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink

OpenAI just put out a blog post about a new model trained via RL (I'm assuming this isn't the usual RLHF) to perform chain of thought reasoning before giving the user its answer. As usual, there's very little detail about how this is accomplished so it's hard for me to get excited about it, but the rest of you might find this interesting.

13

1

Introducing SIMA, a Scalable Instructable Multiworld Agent (deepmind.google)

submitted 2 years ago by howrar@lemmy.ca to c/reinforcement_learning@lemmy.ca

0 comments fedilink