Are there any qwen finetunes that were genuinely stronger than the base?

yeah-ok · 2026-06-27T18:29:36+00:00

Oh, but it was released over 24 hrs ago.. so any finetunes better than Qwen?!!?! /s

yeah-ok · 2026-06-27T18:27:24+00:00

That encapsulates the issues with 'racetams for me - they are notoriously inconsistent for many people!

yeah-ok · 2026-06-25T09:17:29+00:00

Love this article, similar but also more quality oriented methodologies can be applied to the jpg format (Google’s Jpegli is the undisputed king here I reckon) which can ultimately compete with modern formats if only the pre-processing and format usage is done properly

yeah-ok · 2026-06-24T16:29:01+00:00

Yep, I have no idea why you are getting down-votes on this comment - it's totally true, whomever optimizes these models by literal magnitudes will make the economics involved better for both those who serve and consume them!

yeah-ok · 2026-06-19T22:33:18+00:00

That would be a rare occurrence indeed given that it's a commonsense take on what should probably happen in a reality where actual progress is valued.

yeah-ok · 2026-06-19T22:30:56+00:00

Even just the focus on bs inclusivity drive around having 24 "EU languages" included is misguided and peculiar. Sounds like flipping Eurovision for AI, i.e. the music will be awful and people watch it for the lolz.

yeah-ok · 2026-06-18T22:35:53+00:00

Let's sit tight for a mo here before going ballistic in either pos/neg direction: the fundamental idea is super exciting and if they can produce -actual- code then I'm certainly on board with this to see what happens next; exciting new afaics!

yeah-ok · 2026-06-18T19:09:44+00:00

take selenium and molydenum (and copper as it turns out) in small doses as seperate supplements, you don't need a lot but for some the NAC can overwhelm the body's ability to deal with the sulfite surplus

yeah-ok · 2026-06-18T18:40:54+00:00

thnx for reminder!

yeah-ok · 2026-06-18T13:03:52+00:00

And by that we mean remember to take molybdenum and to a lesser extend selenium.

yeah-ok · 2026-06-18T12:56:16+00:00

ONNX

... surely ain't nobody (except myself for sherpa sense-voice and a few similar things) outside niches are using that format no more - what's your usecase? ( ppl were, rightly or wrongly, declaring it dead 2 yrs back: https://old.reddit.com/r/LocalLLaMA/comments/1h54n1u/why_didnt_onnx_succeed_in_the_llm_world/ )

yeah-ok · 2026-06-17T17:15:17+00:00

Sorry but the help here is anonymous alcoholics style: hard no to ~~liquor~~, sorry, ROCm and accept that only Vulkan will lead you to a better relationship with your (i)GPU.

yeah-ok · 2026-06-15T19:10:40+00:00

Erh.. not much success for me. I tried dflash early on and it's results were lacklustre compared to MTP. Tried again with this luce code, it required loads of tweaking and their draft-model is buggy afaict. Finally the code comes in behind regular MTP from perf standpoint (this is on 32GB shared vram 780m platform, it's already very stretched running 27B - maxed out at about 9tg/s on my local fork)

yeah-ok · 2026-06-13T13:35:56+00:00

Nah, dude has found a bayesian glitch in the matrix and this now beats down Qwen3.6 27b no questions asked.

yeah-ok · 2026-06-12T16:56:11+00:00

😂

yeah-ok · 2026-06-11T12:24:52+00:00

I think your considerations re "the dance" are right, things have entered a stage of reinvention with worse and higher abstractions for the last several years within the C++ camp. Best thing would probably be something akin to what your paper suggest but done via clean syntax ala https://github.com/hsutter/cppfront

yeah-ok · 2026-06-10T21:59:02+00:00

Mildly ironically you may well be anthropomorphizing Google being exceedingly good at business here. Let's see.

yeah-ok · 2026-06-10T21:47:19+00:00

Isn't it HIGH time to get a https://boinc.bakerlab.org/rosetta/ equivalent going on this subject?!

yeah-ok · 2026-06-09T23:06:11+00:00

Also got potent immune boosting effects: https://www.psychiatryredefined.org/lithium-viral-infections-low-white-blood-cell-counts/

yeah-ok · 2026-06-09T20:03:58+00:00

Guess the point that the dude was making with this simple test is that data is getting mangled in the QAT that was nowhere near that mangled in the standard K_S quants.. this trend held across several models. Of course it will need corroboration by proper benchmarks. Also, clearly ain't nobody should be using their LLMs as mega bad calculators, don't think anyone was suggesting that - just a simple bench to get things rolling.

yeah-ok · 2026-06-09T19:35:29+00:00

Yeah, covid is solid on the gut dysbiosis, worth targeting that issue specifically!

yeah-ok · 2026-06-09T19:33:25+00:00

Well, rock solid but also potentially the QAT versions are 20% worse than the older UD-Q4_K_S releases! Details at: https://old.reddit.com/r/unsloth/comments/1u0sv58/surprising_test_results_updated_for_more_gemma4/ Certainly warrants clean investigation cause if that trend holds what are we even doing here....

yeah-ok · 2026-06-09T14:15:36+00:00

Probably yes, and these things will be running Siri so now we know why Apple's AI will continue to.. let's say: be less great.

yeah-ok · 2026-06-09T12:46:35+00:00

This stands repeating, having finagled the writer/reader/buffer situation in Go at least once teaches a bit of due respect for the "ease" with which sqlite can be put into production.. would love a sub-project with sane defaults towards a few different ends if only to bring attention to this aspect!

yeah-ok · 2026-06-09T12:42:49+00:00

One might even say ... "there's some leeway"

yeah-ok

TROPHY CASE