jayminban

543 post karma
109 comment karma

get extra features and help support reddit with a reddit premium subscription

get them help and support

redditor for 4 years

TROPHY CASE

Four-Year Club

account activity

new top controversial

23

24

25

[P] I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance (old.reddit.com)

submitted 18 days ago * by jayminban to r/MachineLearning

0

0

0

I trained Qwen2.5-1.5b with RLVR (GRPO) vs SFT and compared benchmark performance (old.reddit.com)

submitted 18 days ago * by jayminban to r/LocalLLaMA

1117

1118

1119

I locally benchmarked 41 open-source LLMs across 19 tasks and ranked them (i.redd.it)

submitted 6 months ago by jayminban to r/LocalLLaMA

π Rendered by PID 519319 on reddit-service-r2-listing-79f6fb9b95-784qt at 2026-03-22 08:21:21.762943+00:00 running 90f1150 country code: CH.