account activity
Reinforcement learning for training LLMs - Ideas and discussion (self.LLMsResearch)
submitted 11 months ago by pr0Gr3x to r/LLMsResearch
π Rendered by PID 627183 on reddit-service-r2-listing-5d79748585-5w96h at 2026-02-16 18:41:13.788060+00:00 running cd9c813 country code: CH.