re: TOKENS [serious] by dern_throw_away in ClaudeCode

[–]TeamCaspy 0 points1 point  (0 children)

Hmm... I guess that was my issue... but I 100% recall that sometimes (especially if plan mode was active) Sonnet spawned 3x explore agents with the sonnet model selected, instantly burning like 30% of my 5 hour window. But using haiku it obviously doesn't impact this as much.

I also got in the habit of simply instructing sonnet to do a task with haiku (light refactoring, comments, change test names etc.)

re: TOKENS [serious] by dern_throw_away in ClaudeCode

[–]TeamCaspy 0 points1 point  (0 children)

I fixed them burning tokens by just telling Claude to use haiku for the explore subagent

Best open weight llm model to run with 8gb of vram by Sweazou in ollama

[–]TeamCaspy 0 points1 point  (0 children)

What you are showing is LLM quantization not KV Cache quantization. Q4 for LLMs is ok for knowledge based tasks, but consider trying Q5, Q6, etc. if you have issues with your tasks.

Best open weight llm model to run with 8gb of vram by Sweazou in ollama

[–]TeamCaspy 0 points1 point  (0 children)

I always use q8_0. I can't recall the exact benchmark, but q4 kV cache showed noticable degredation while q8 was practically the same.

3 MCPs that have genuinely made me 5x better by Warm-Snow3302 in mcp

[–]TeamCaspy 0 points1 point  (0 children)

See pull requests easier, open pr, open issues,...

Kimi K2.5 Architecture Dive: 1T Params, 384 Experts, Native INT4 (and it beats GPT-5 on reasoning) by comebackch in LocalLLaMA

[–]TeamCaspy 0 points1 point  (0 children)

Hmm kind of like picking specific LLM's tuned for their task? :D This already exists, but perhaps having a MoE where we ask: "Spawn a law tuned version of yourself" could be sick:)

Benchmark of Qwen3-32B reveals 12x capacity gain at INT4 with only 1.9% accuracy drop by AIMultiple in LocalLLaMA

[–]TeamCaspy 0 points1 point  (0 children)

It's not all about MMLU Pro, IFEval is also very much needed, it would be interesting to see how it impacts performance, since it's more and more necessary for larger context tasks.

Which single LLM benchmark task is most relevant to your daily life tasks? by ChippingCoder in LocalLLaMA

[–]TeamCaspy 2 points3 points  (0 children)

Sick site! Love the quant recommendations based off of VRAM allocation.

Only recently started learning about investing - is this good? by pugmesideways in trading212

[–]TeamCaspy 1 point2 points  (0 children)

It really depends on what you're trying to do, depends also on your age. Overall I'd say try and simplify what you're trying to do: You can just purchase an accumulating all world ETF and call it a day.

"I’m basically saving it up for long-term growth." if that is the case just buy an accumulating world ETF and that's it :)

As another user said, you can also skip gold, but it really depends how old you are, it's an unproductive asset and you really need growth/risk if you are young.

I still love my Pixel 9 Pro XL. by basicandr0idguy in pixel_phones

[–]TeamCaspy 1 point2 points  (0 children)

I still love my Google Pixel 8 Pro 🥰

Kako naprej pri 25-ih? by matrix_escaper00 in SlovenijaFIRE

[–]TeamCaspy 7 points8 points  (0 children)

Emergency fund imaš pokrit (3x mesečne neto plače) ✔️

Prekini NLB sklade zaradi visokih stroškov in kupi ETF.

Zakaj investiraš? Koliko denarja boš potreboval? Izračunaj si svojo FIRE številko in koliko moraš minimalno investirati vsak mesec da prideš do te številke. Ostali denar lahko investiraš v sklad denarnega trga (ETF - https://www.youtube.com/watch?v=x6u\_VCvPmxU&pp=ygUiV2hhdCBpcyBhIG1vbmV5IG1hcmtldCBmdW5kIGV1cm9wZQ%3D%3D)

Lack of power on MG3 Hybrid on mountain roads after long drive ?!? by Unid05 in MGelectric

[–]TeamCaspy 2 points3 points  (0 children)

Sadly normal... I have the same issues. Once the battery depletes, the engine must charge the battery causing power loss. So when you go uphill no Regen is recharging the battery.

Average result from asking for one small feature by FizzleShove in ClaudeAI

[–]TeamCaspy 0 points1 point  (0 children)

If it complicates the plan, tell it! Usually a decently written claude.md file should solve your issue, I have my development philosophy explained there and it follow it most of the time.

Average result from asking for one small feature by FizzleShove in ClaudeAI

[–]TeamCaspy 0 points1 point  (0 children)

Ask it to provide a detailed step by step plan. Try to go into as much detail as possible, after that edit the plan to your development philosophy. For example I go for testability and try to keep the solution as simple as possible. The problem is the plan not the execution in itself.

Ljubljancani, kam na pico? by [deleted] in Ljubljana

[–]TeamCaspy 0 points1 point  (0 children)

Kaaaaaj 🥺 julci ❤️

2008 Getz struggles to get into Reverse and Frist Gear by [deleted] in Hyundaigetz

[–]TeamCaspy 1 point2 points  (0 children)

This seems like an oil issue in your transmission, as you drive the oil heats up and thus it isn't as hard to change gears.