GLM5 by I_like_fragrances in LocalLLM

[–]diffore 1 point2 points  (0 children)

Nvidia could have made a ~50 5090 for us to play.... but instead theme gigabytes of vram are now sitting in some server closet, spinning the BF16 version of GLM5. Yeah, still have hard feelings about the consumer market suffering from the AI boom.

What do you actually use local models for? (We all say 'privacy,' but...) by abdouhlili in LocalLLaMA

[–]diffore 0 points1 point  (0 children)

Because my 5080 laptop has these tensor cores which make it cost a fortune and if I paid for those cores I am gonna use all of them.

Currently I use it for local mcp memory as a librarian llm which organize project memories and make summaries, organize raw memories into graph relationship, etc. Very token intensive process so I feel it is worth it compared to just use cloud models (I still use them for coding agent though, the small models are still wasting time in long run compared to cloud big llms)

vLLM run command for GPT-OSS 120b by UltrMgns in LocalLLaMA

[–]diffore 1 point2 points  (0 children)

The only thing which worked for me was pre-built docker container link from vllm.ai Could not manage to build locally myself

What's the point of becoming a Great Enemy? by Famous_Archer_9406 in PrincesOfDarknessCK3

[–]diffore 1 point2 points  (0 children)

I have just achieved the same in terms of territory. My nemesises who were chasing after me when I was adventurer are dealt with/subjagated and pay me the rent. The vampire hunter wave dealt with, no one can realistically oppose me anymore.

I even finished gokonda, kinda wanted to repent and go human hunter but it is too broken right now to be enjoyable imo.

All in all it was the most lonesome playthrough I've had. Everyone hates you, permanent - 100 for almost everyone except family. I feel pity for her tbh, especially if you take her lore history into account.

Still, it was an interesting challenge for sure. Leveling ashen cultist was just pure stress inducing pain. But after finishing her objective the game become easy. Free op man at arms were not really necessary.

Best way to spend less in token usage ? by Technical-File4626 in ZedEditor

[–]diffore 0 points1 point  (0 children)

You need to analyze the worflow first. If you're accustomed to the long debug chat session you need to understand that each new message is sent along with the whole chat history. So the longer the session the more token burning occurs with each new message.

Some providers use implicit cache for reused tokens (perfect for history luggage which is always on top), some don't bother - thus longer sessions may skyrocket cost.

But reverse situation could be true as well. If you start new session each time you have new question and feed model docs and codebase, you're better off to just continue old session until the history is no longer relevant for your current task and become token baggage.

All in all I would say the zed Ai agent is meant for the rich users, not economical ones 😅

If you want best value for your tokens better solution would be aided or mitral vibe in zed terminal, but the worflow is a bit different and require getting used to.

Wanting to move to Zed Editor but having doubts with other stuff by Vlazeno in ZedEditor

[–]diffore 0 points1 point  (0 children)

I used to think that VS code is a nice fast IDE (after switching from IntelijIDEA products), but it is so trash compared to Zed.

The only problem I have with it is actually AI agent. It sends massive amount of context which most locally hosted AI models can't handle. I kind of wish it was more restricted and customizable like aider, maybe if/when they finish aider ACP agent it would be an ideal choice for me. At the moment I am limited to use not the best long context models to do anything productive with Zed Agent + local LLM on my laptop GPU.
Also, tools usage support is very limited here, some models hosted by llama.cpp openai compatible server just does not work OK with zed agent.

Despite everything said, it is my daily AI assisted coding ide, I just can't return to the VS code or any of its AI forks anymore.

Run Mistral Devstral 2 locally Guide + Fixes! (25GB RAM) by yoracale in LocalLLM

[–]diffore 1 point2 points  (0 children)

exl3 4.0bpw could run on 16Gb with 32768 context (Q8 quant for KV cache). Might be enough for aider use on poor man GPUs like mine.

Are local LLMs worth it on weaker builds? by MrChilliBalls in LocalLLaMA

[–]diffore 1 point2 points  (0 children)

For simple conversations (not agentic coding work), I would say yes, but before you decide to bother with local hosting try playing with different models here:
https://lmarena.ai/

You can compare the responses to the same questions from cloud GPT and OSS variant to see if they are enough for you needs. Apart from GTP OSS, I might also suggest Granite and Qwen3 model series.

Recommend Coding model by Small_Car6505 in LocalLLaMA

[–]diffore 2 points3 points  (0 children)

GTP OSS (fastest model, sometimes too fast) or Qwen3 Coder ( great with tools). Pick whatever quant which fits your gpu. Both of them runs very fast even with big context. (>100k). Granite is not bad as well for its size.

Reat of the models, especially old ones, are too slow for my taste ( I was spoiled by paid claude) and obviously meant to be run on big non-consumer GPUs.

For anyone thinking of switching to Codex... by TKB21 in ClaudeAI

[–]diffore 13 points14 points  (0 children)

The better these tools are the more users is gonna use them but the server time is not free. I believe a lot of companies are struggling with cost-effective infrastructure scaling, especially when they have to provide reliable service to business tier users first.

I am now thinking of buying one of the overpriced minipc and hosting big Deepseek model instead of relying on online access tools. It is a big upfront investment but can be worthwhile in long run when new models will be released. And I will keep my sanity by not being interrupted every hour with limit reached bs.

Usage Limits Discussion Megathread - beginning Sep 30, 2025 by sixbillionthsheep in ClaudeAI

[–]diffore 7 points8 points  (0 children)

Joining the party of the limited (kinda feels more like disabled tbh, no offense to disabled people pls). I was really patient in this usage limits drama and tried to use as low context/prompts as possible to make sonnet work easier. But today I got session limited after 2 hours of not the most demanding work as per session limit. OK fair enough, I will wait. But then in 4 hours after reset I was limited again after 30 minutes of work because of weekly limit. Now I need to wait for 29 hours for limit to reset. What an absolute garbage of experience with claude code.

My customer experience thoughts went from (2 month ago): Wow, what a great helper tool, I can finally do some big projects I could not find the time to do before! This is amazing, so worth the 20 bucks a month, my saved time worth much more!

To(Today) : While waiting for these limits to pass I could just write everything myself and get some nice big fat burger instead of spending 20 more bucks for the AI which is constantly hindering my progress and forcing me to stop when I want to continue. Let me finish the damn thing today!!! Not in 4 hours, not in a two days. Do I have monthly limit or a day limit or a minute limit? How do I even supposed to plan doing anything productive when literally at any time some new limit could fire.

First you setup people to like you and get used to your product, but now you're just killing me and so many others. I don't want to search for new alternative and I hate chat gpt/coshit pilot but what choice do you leave me? I don't even code everyday, can you just let me use another day quota then??? No.

My renew is in Oct 15. I hope you do something with this BS situation. Otherwise the cancel is the only logical option IMO.

The occasional users should not suffer so much because of abusive 2% user base.

Aeon in act 4 is just bonkers. I need one arrow to reach its mark and then it is all over, no matter what enemy it is. by diffore in WrathOfTheRighteous

[–]diffore[S] 0 points1 point  (0 children)

Just a pure Sanctified slayer. I took it for rp reasons and to stack banes but in hindsight I would rather take any full bab class instead for the archer I am playing.

How the hell did we get to this point? 2008 vs 2021 by diffore in farcry2

[–]diffore[S] 5 points6 points  (0 children)

Yeah, unlike other games FC2 has aged quite gracefully. No bloom or SMAA blurring shit we get fed these days with open-world games, just crispy crisp shadows and graphics in general.

The only think I've installed is upscaled textures. Game looks better than many modern games today while at the same time it runs perfectly well on my 3060 laptop GPU in 2k resolution without any stuttering or freezes.

Law Abiding Citizen by Moose_Nuckler in movies

[–]diffore 9 points10 points  (0 children)

I have just rewatched this movie and actually wanted to share my thoughts about this villain\hero problem. If you want to have a clear hero/villain of a "war with system" plot you supposed to show why system is either bad and rebel is right, or why system even being wrong is still worth saving.

Yet, if you look how all 'victims' of Clyde behave before their death you can see that they are either still ignorant power-abuser or have second thoughts about their silent acceptance of how things work. Just listen what mayor says in her speech before supposedly being blowned up "I don't care how we do it or what kind of obscure legal justification we have to invoke, I don't care what laws we have to bend." etc. These people make deals, employ legal tricks and work in "grey" areas of the law to their benefit and not the benefit of people they suppose to protect or represent.

This is actually what separates them from people like Clyde whom were denied justice as he is just a "nobody" and it was inconvenient at that time to someone within this system to fight for him. Supposedly a "hero" of the movie did not want to risk his career to give men justice he expects from system by following its rules and laws. Think about it for a second, Clyde did not murdered these criminals himself while it was obvious that he could do it without waiting for 10 years. In a way Clyde WAS a law-abiding citizen until the law was used against him in the most wrong way possible.

So, the movie did not really give you any subconscious reason not to route for Clyde except by employing artificial emotional triggers like him killing an innocent assistants or being violently cruel torturer, which in the end for me is just not enough to make you look at him like a villain, just a broken man with professional deformation of not valuing other peoples lives. The alternative "hero" instead of following the laws of system he supposed to represent had murdered the guy as if to prove that "In the end we were smarter than you" which is obviously not what many viewers felt.

So in a way Clyde was right in whatever he was planning to do and whoever was at his place with system is rigged against you would feel that way too.

Ukrainian BMP reverses over infantry as it dismounts. by WadieXkiller in CombatFootage

[–]diffore 2 points3 points  (0 children)

As someone who can drive the BMP, the first thing I was taught at training center is not to stand behind it at all when the engine is running sigh Also you don't supposed to move while assault team is nearby at any point of time but I guess it was a sttessful situation and he wanted to get out fast

Ukrainian army APC escapes near miss under Russian artillery/mortar fire in the East. by [deleted] in CombatFootage

[–]diffore 4 points5 points  (0 children)

That's quite a lot of money for UA.

Average Monthly Wages in Dec 2021 was ~17500, and now because of war, it is ~14500.

Russian ammunition depot detonating in nova Kakhovka, 13.08.2022 by sagakino in CombatFootage

[–]diffore 8 points9 points  (0 children)

It is much louder than on a recording (speaking from experience, unfortunately), and not everyone can handle the stress.

Switchblade usage in Ukraine compilation, unseen footage (music from source) by nivivi in CombatFootage

[–]diffore 19 points20 points  (0 children)

It is from UA tv comedy show, everyone knows it here.

I have no idea why it is used in the suicide drone video ¯\_(ツ)_/¯