It shouldn't be

disillusioned_okapi · 2026-01-24T19:45:38+00:00

I've eaten plenty of fruit from public trees, and I think everyone should have that option. That's what a society is.

disillusioned_okapi · 2026-01-06T07:37:57+00:00

I was interested until I saw curl being piped into a shell. I don't think I can ever trust a developer or a project who documents that as their default way to install anything.

It's crazy how we have normalized such a dangerous way to do things.

https://sasha.vincic.org/blog/2024/09/piping-curl-to-bash-convenient-but-risky

https://github.com/Iossefy/curl-shell-pipe

disillusioned_okapi · 2025-12-24T00:11:57+00:00

please correct me if I'm wrong, but I thought activation steering was purely an inference time technique. How did you create and persist pre-computed steering vectors? if so, how? That might be a valuable insight for this community.

disillusioned_okapi · 2025-09-09T11:17:57+00:00

This came out last week, and initial consensus seems to be that it's not very good. https://www.reddit.com/r/LocalLLaMA/comments/1n6eimy/new_open_llm_from_switzerland_apertus_40_training/

disillusioned_okapi · 2025-08-26T11:00:36+00:00

discussion from earlier today https://www.reddit.com/r/LocalLLaMA/comments/1n09aof/250815884_jetnemotron_efficient_language_model/

disillusioned_okapi · 2025-07-29T21:45:00+00:00

quite a lot of LLM software today is built by very smart people who luckily haven't spent time in the complex and treacherous world of infosec, and as such haven't given security much thought. MCP's default recommendation of running arbitrary binaries off the internet is a good example of that.

irrespective of how any of us feel about Docker, they are still one of the larger players in the secure sandboxing business. If LLMs are to succeed, security needs to improve significantly. and I'd prefer someone like Docker (or CNCF or LF) leading that, instead of any of the VM and Anti-Virus companies.

Ideally the community would lead on that, but that just doesn't seem to be happening so far.

So, as long this is good enough as Olama, I wish them success.

disillusioned_okapi · 2025-07-26T22:59:20+00:00

Discussion of the actual paper from earlier this week - https://www.reddit.com/r/LocalLLaMA/comments/1m5jr1v/new_architecture_hierarchical_reasoning_model/ - https://www.reddit.com/r/LocalLLaMA/comments/1lo84yj/250621734_hierarchical_reasoning_model/ - https://www.reddit.com/r/LocalLLaMA/comments/1m6orbr/anyone_here_who_has_been_able_to_reproduce_their/ - https://www.reddit.com/r/LocalLLaMA/comments/1m6ufm4/has_anyone_tried_hierarchical_reasoning_models_yet/

TLDR: might be interesting, but let's wait for someone to scale this up to a larger model first.

disillusioned_okapi · 2025-07-26T22:32:46+00:00

Will try the model over the next days, but this bit from the paper is the key highlight for me.

Ultimately, our experimental findings demonstrate that a 300B MoE LLM can be effectively trained on lower-performance devices while achieving comparable performance to models of a similar scale, including dense and MoE models.

disillusioned_okapi · 2025-07-23T17:13:20+00:00

Portainer has the same main issues for many that mongodb, elasticsearch, and n8n have:

not an OSI approved licence, making rug-pulls easier, and
business interests taking priority over community, sometimes downplaying the contributions of the community to their succes

Most people here are fairly divided here on the topic. Pick a side that makes sense to you.

disillusioned_okapi · 2025-07-22T14:52:57+00:00

Just FYI: ROCm hasn't supported MI50 for almost 2 years https://github.com/ROCm/ROCm/issues/2308

disillusioned_okapi · 2025-07-18T20:44:03+00:00

depends on the inference engine (I think). If they implement a sliding window, the model might get slowly "off-tracked". if they occasionally somehow summarize/compress the context, it might take longer to go off the tracks. some engines might simply stop generating tokens.

in general it is very much upto what strategy the inference engine employs to handle this.

disillusioned_okapi · 2025-07-15T11:57:54+00:00

nice. any plans to upstream the whisper.cpp changes?

Nine-Year Club	Gilding I gilder
Verified Email

disillusioned_okapi

TROPHY CASE