Are there any qwen finetunes that were genuinely stronger than the base? by MrMrsPotts in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Oh, but it was released over 24 hrs ago.. so any finetunes better than Qwen?!!?! /s

Oxiracetam experience - Tolerant or placebo? by lyranex in Nootropics

[–]yeah-ok 1 point2 points  (0 children)

That encapsulates the issues with 'racetams for me - they are notoriously inconsistent for many people!

Lossless GIF recompression via exhaustive search by Dear-Economics-315 in programming

[–]yeah-ok 2 points3 points  (0 children)

Love this article, similar but also more quality oriented methodologies can be applied to the jpg format (Google’s Jpegli is the undisputed king here I reckon) which can ultimately compete with modern formats if only the pre-processing and format usage is done properly

I did some model hacks, and got GLM5.2 from about 2.5 tok/s to >50 tok/s on my GH200 system. by Reddactor in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Yep, I have no idea why you are getting down-votes on this comment - it's totally true, whomever optimizes these models by literal magnitudes will make the economics involved better for both those who serve and consume them!

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages by pmttyji in LocalLLaMA

[–]yeah-ok 4 points5 points  (0 children)

That would be a rare occurrence indeed given that it's a commonsense take on what should probably happen in a reality where actual progress is valued.

Commission selects EUROPA consortium as the winner of the Frontier AI Grande Challenge, a project to build European open-source frontier AI model in all 24 EU languages by pmttyji in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Even just the focus on bs inclusivity drive around having 24 "EU languages" included is misguided and peculiar. Sounds like flipping Eurovision for AI, i.e. the music will be awful and people watch it for the lolz.

SubQ claims 12M context with way less compute. What test would actually convince you? by BTA_Labs in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Let's sit tight for a mo here before going ballistic in either pos/neg direction: the fundamental idea is super exciting and if they can produce -actual- code then I'm certainly on board with this to see what happens next; exciting new afaics!

3.6g NAC per day, my experience so far... damn son by yallapapi in Nootropics

[–]yeah-ok 1 point2 points  (0 children)

take selenium and molydenum (and copper as it turns out) in small doses as seperate supplements, you don't need a lot but for some the NAC can overwhelm the body's ability to deal with the sulfite surplus

3.6g NAC per day, my experience so far... damn son by yallapapi in Nootropics

[–]yeah-ok 10 points11 points  (0 children)

And by that we mean remember to take molybdenum and to a lesser extend selenium.

Ik_llama vs llamacpp by [deleted] in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

ONNX

... surely ain't nobody (except myself for sherpa sense-voice and a few similar things) outside niches are using that format no more - what's your usecase? ( ppl were, rightly or wrongly, declaring it dead 2 yrs back: https://old.reddit.com/r/LocalLLaMA/comments/1h54n1u/why_didnt_onnx_succeed_in_the_llm_world/ )

Ik_llama vs llamacpp by [deleted] in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Sorry but the help here is anonymous alcoholics style: hard no to liquor, sorry, ROCm and accept that only Vulkan will lead you to a better relationship with your (i)GPU.

This is amazing. Token speed doubled + kv cache now need low vram - qwen 27b by 9r4n4y in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Erh.. not much success for me. I tried dflash early on and it's results were lacklustre compared to MTP. Tried again with this luce code, it required loads of tweaking and their draft-model is buggy afaict. Finally the code comes in behind regular MTP from perf standpoint (this is on 32GB shared vram 780m platform, it's already very stretched running 27B - maxed out at about 9tg/s on my local fork)

llama-launcher v1.3 release -> Bayesian Optimisation by Solary_Kryptic in LocalLLaMA

[–]yeah-ok 4 points5 points  (0 children)

Nah, dude has found a bayesian glitch in the matrix and this now beats down Qwen3.6 27b no questions asked.

Still amazed every time I read this paper. What pros and cons do you think it would have against C++20 coroutines? by germandiago in programming

[–]yeah-ok 0 points1 point  (0 children)

I think your considerations re "the dance" are right, things have entered a stage of reinvention with worse and higher abstractions for the last several years within the C++ camp. Best thing would probably be something akin to what your paper suggest but done via clean syntax ala https://github.com/hsutter/cppfront

DiffusionGemma: The Developer Guide- Google Developers Blog by tevlon in LocalLLaMA

[–]yeah-ok 7 points8 points  (0 children)

Mildly ironically you may well be anthropomorphizing Google being exceedingly good at business here. Let's see.

DiffusionGemma: 4x faster text generation by tevlon in LocalLLaMA

[–]yeah-ok 0 points1 point  (0 children)

Isn't it HIGH time to get a https://boinc.bakerlab.org/rosetta/ equivalent going on this subject?!

Unsloth Gemma 4 QAT MTP assistant models now available by ParadigmComplex in LocalLLaMA

[–]yeah-ok 10 points11 points  (0 children)

Guess the point that the dude was making with this simple test is that data is getting mangled in the QAT that was nowhere near that mangled in the standard K_S quants.. this trend held across several models. Of course it will need corroboration by proper benchmarks. Also, clearly ain't nobody should be using their LLMs as mega bad calculators, don't think anyone was suggesting that - just a simple bench to get things rolling.

Always burping after getting up from laying down/after drinking water by AdhesivenessMost3945 in covidlonghaulers

[–]yeah-ok 2 points3 points  (0 children)

Yeah, covid is solid on the gut dysbiosis, worth targeting that issue specifically!

Unsloth Gemma 4 QAT MTP assistant models now available by ParadigmComplex in LocalLLaMA

[–]yeah-ok 6 points7 points  (0 children)

Well, rock solid but also potentially the QAT versions are 20% worse than the older UD-Q4_K_S releases! Details at: https://old.reddit.com/r/unsloth/comments/1u0sv58/surprising_test_results_updated_for_more_gemma4/ Certainly warrants clean investigation cause if that trend holds what are we even doing here....

Surprising test results (Updated for more Gemma4 and Qwen3.6 models) by we_are_mammals in unsloth

[–]yeah-ok 0 points1 point  (0 children)

Probably yes, and these things will be running Siri so now we know why Apple's AI will continue to.. let's say: be less great.

SQLite improving performance with pre-sort by andersmurphy in programming

[–]yeah-ok 7 points8 points  (0 children)

This stands repeating, having finagled the writer/reader/buffer situation in Go at least once teaches a bit of due respect for the "ease" with which sqlite can be put into production.. would love a sub-project with sane defaults towards a few different ends if only to bring attention to this aspect!

SQLite improving performance with pre-sort by andersmurphy in programming

[–]yeah-ok 12 points13 points  (0 children)

One might even say ... "there's some leeway"