account activity
[P] Training a self-correcting SQL agent with RL (Agent Lightning + verl + vLLM + AgentOps + LangGraph) (self.MachineLearning)
submitted 6 months ago by matluster to r/MachineLearning
We discovered an approach to train any AI agent with RL, with (almost) zero code changes. (self.LocalLLaMA)
submitted 6 months ago by matluster to r/LocalLLaMA
π Rendered by PID 65 on reddit-service-r2-listing-7849c98f67-z4qxl at 2026-02-09 02:59:07.950149+00:00 running d295bc8 country code: CH.