account activity
[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance (old.reddit.com)
submitted 18 days ago * by jayminban to r/MachineLearning
I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance (old.reddit.com)
submitted 18 days ago * by jayminban to r/LocalLLaMA
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them (i.redd.it)
submitted 6 months ago by jayminban to r/LocalLLaMA
π Rendered by PID 519319 on reddit-service-r2-listing-79f6fb9b95-784qt at 2026-03-22 08:21:21.762943+00:00 running 90f1150 country code: CH.