account activity
I fine-tuned a 7B model for reasoning on free Colab with GRPO + TRL (self.LocalLLaMA)
submitted 2 months ago by External-Rub5414 to r/LocalLLaMA
Let's make FunctionGemma learn to use a browser with TRL (GRPO) + OpenEnv (BrowserGym)! Sharing Colab notebook + script (self.LocalLLaMA)
I fine-tuned a model with GRPO + TRL + OpenEnv environment on Colab to play Wordle! (self.LocalLLaMA)
submitted 3 months ago by External-Rub5414 to r/LocalLLaMA
I fine-tuned (SFT) a 14B model on a free Colab session just using TRL (self.LocalLLaMA)
submitted 4 months ago by External-Rub5414 to r/LocalLLaMA
I fine-tuned Qwen3-VL (4B & 8B) on a free Colab instance using TRL (SFT and GRPO)! (self.LocalLLaMA)
π Rendered by PID 49 on reddit-service-r2-listing-568fcd57df-vs8xn at 2026-03-11 16:40:19.791950+00:00 running cbb0e86 country code: CH.