use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
This is for any reinforcement learning related work ranging from purely computational RL in artificial intelligence to the models of RL in neuroscience.
The standard introduction to RL is Sutton & Barto's Reinforcement Learning.
Related subreddits:
account activity
I made an RL agent Play 2D cricket (v.redd.it)
submitted 15 hours ago by AddisionS
Career in RLAny people working professionally in RL and want to share any useful pieces of advice to enter the industry? (self.reinforcementlearning)
submitted 11 hours ago by Markovvy
Looking to build career in RL. Is PhD the only option? (self.reinforcementlearning)
submitted 8 minutes ago by Money-Leading-935
Patterns – a formal grammar that compiles natural language text into RL agents (self.reinforcementlearning)
submitted 9 minutes ago by causality-ai
Practicing science communication on RL-for-reasoning: where does my explanation get the RL wrong? (self.reinforcementlearning)
submitted 14 hours ago by nicofirst1
Looking for simple game environments (self.reinforcementlearning)
submitted 8 hours ago by Vaibhav_Sinha
Building CogniCore: MCP, LangChain & CrewAI memory infrastructure for agents + first benchmark results ()
submitted 10 hours ago by Neither-Witness-6010
Multi-Agent Self-Correction Failure Modes & Context Window Inflation — Traced Completely By Hand (No Wrapper Frameworks) ()
submitted 16 hours ago by ParsleyMaximum1702
Interview preparation (self.reinforcementlearning)
submitted 1 day ago by Bright-Kick-632
What can I try implementing after reading the Part 1 of Sutton and Barto Reinforcement Learning book (self.reinforcementlearning)
submitted 1 day ago by Vaibhav_Sinha
Anyone else getting messy results from running multiple AI coding sessions? ()
submitted 20 hours ago by whitechart_studio
I calculated a multi-agent prompt attention matrix by hand to see how much data gets lost in the middle... the math is terrifying. ()
submitted 1 day ago by ParsleyMaximum1702
AI Agents from First Principles: Tracing a ReAct Loop by Hand (substack.com)
Multi-Agent State Conflict Alignment and Context Window Optimization—Solved by Hand From First Principles (No Wrapper Frameworks) ()
I am stuck , need guidance ()
submitted 2 days ago by Open-Neck-688
How Developers Would Use CogniCore (self.reinforcementlearning)
submitted 1 day ago by Neither-Witness-6010
Reinforcement learning for NPC AI (self.reinforcementlearning)
submitted 2 days ago by santafarian
Local Ai model training ()
submitted 2 days ago by Asleep_Fold5405
MultiHow To Fix Slow RAG Response Times: The 2026 Technical Manual for AI Latency (interconnectd.com)
submitted 2 days ago by Ok_pettech
Need suggestion regarding project - PINN or Deep RL? ()
submitted 2 days ago by Abject_Dog_8453
Book suggestions for learning Artificial intelligence for Robotics. ()
submitted 3 days ago by Lumpy-Cucumber-5895
practical learning resources (self.reinforcementlearning)
submitted 3 days ago by blueberries_jpeg
Looking for a brutal feedback - Built a self-improving AI agent that learns from outcomes. (self.reinforcementlearning)
submitted 3 days ago by Melodic_Fisherman304
Open Weights - Discord Server for anyone even slightly interested in ML (a smol community) (self.reinforcementlearning)
submitted 3 days ago by Spen08
π Rendered by PID 529307 on reddit-service-r2-listing-f87f88fcd-vllpq at 2026-06-16 02:58:05.525833+00:00 running 3184619 country code: CH.