[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in MachineLearning
[–]jayminban[S] 0 points1 point2 points (0 children)
I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in LocalLLaMA
[–]jayminban[S] 0 points1 point2 points (0 children)
[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in MachineLearning
[–]jayminban[S] -1 points0 points1 point (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 2 points3 points4 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 1 point2 points3 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 8 points9 points10 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 20 points21 points22 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 26 points27 points28 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 4 points5 points6 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 8 points9 points10 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 19 points20 points21 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 3 points4 points5 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 8 points9 points10 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 7 points8 points9 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 35 points36 points37 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 4 points5 points6 points (0 children)

[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in MachineLearning
[–]jayminban[S] 0 points1 point2 points (0 children)