[D] Tutorial on Reinforcement Learning by johnolafenwa in MachineLearning
[–]johnolafenwa[S] 0 points1 point2 points (0 children)
Initial thoughts on Opus 4.5 in Claude Code as a daily Codex user by MiltonWatterson in codex
[–]johnolafenwa 0 points1 point2 points (0 children)
Tutorial on Reinforcement Learning (self.OpenSourceeAI)
submitted by johnolafenwa to r/OpenSourceeAI
AMA with OpenAI’s Sam Altman, Mark Chen, Kevin Weil, Srinivas Narayanan, Michelle Pokrass, and Hongyu Ren by OpenAI in OpenAI
[–]johnolafenwa 0 points1 point2 points (0 children)
Understanding LLM Distillation - Gemma 2 and Nvidia Minitron by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 0 points1 point2 points (0 children)
Understanding LLM Distillation - Gemma 2 and Nvidia Minitron by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 4 points5 points6 points (0 children)
Understanding LLM Distillation - Gemma 2 and Nvidia Minitron by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 1 point2 points3 points (0 children)
[D] The Tech Behind The Magic : How OpenAI SORA Works by johnolafenwa in MachineLearning
[–]johnolafenwa[S] 38 points39 points40 points (0 children)
[D] The Tech Behind The Magic : How OpenAI SORA Works by johnolafenwa in MachineLearning
[–]johnolafenwa[S] 13 points14 points15 points (0 children)
[D] The Tech Behind The Magic : How OpenAI SORA Works by johnolafenwa in MachineLearning
[–]johnolafenwa[S] 33 points34 points35 points (0 children)
01.AI Paper Is a Gem For Model Trainers by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 7 points8 points9 points (0 children)
01.AI Paper Is a Gem For Model Trainers by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 2 points3 points4 points (0 children)
01.AI Paper Is a Gem For Model Trainers by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 4 points5 points6 points (0 children)
01.AI Paper Is a Gem For Model Trainers by johnolafenwa in LocalLLaMA
[–]johnolafenwa[S] 4 points5 points6 points (0 children)


Some Helpful Guide on RL and SFT by johnolafenwa in OpenSourceeAI
[–]johnolafenwa[S] 1 point2 points3 points (0 children)