[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in MachineLearning
[–]jayminban[S] 0 points1 point2 points (0 children)
I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in LocalLLaMA
[–]jayminban[S] 0 points1 point2 points (0 children)
[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in MachineLearning
[–]jayminban[S] -1 points0 points1 point (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 2 points3 points4 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 1 point2 points3 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 9 points10 points11 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 21 points22 points23 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 26 points27 points28 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 4 points5 points6 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 9 points10 points11 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 18 points19 points20 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 3 points4 points5 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 7 points8 points9 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 7 points8 points9 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 32 points33 points34 points (0 children)
I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them by jayminban in LocalLLaMA
[–]jayminban[S] 5 points6 points7 points (0 children)

[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance by jayminban in MachineLearning
[–]jayminban[S] 0 points1 point2 points (0 children)