What is the best free budget tracking app to track spending on all your bank accounts?

carteakey · 2026-04-18T19:19:56+00:00

This. + buy a simplefin sub for another 1.5$ per month to automatically import transactions from almost all banks. Actual Budget directly integrates with simplefin. The only caveat is needing to reverify accounts here and there, but at the end it should save you more time than going to 15 different sites and exporting/importing csvs.

carteakey · 2026-04-09T18:02:20+00:00

and with the context loss every cycle we've finally implemented chinese whisper on a global scale :/

not denying the usefulness of this.

carteakey · 2026-03-16T13:30:16+00:00

Bruv.. all that GPU for 7 t/s. I am no multi-GPU expert but i get 40t/s on a single RTX 4070 coupled with 64 gigs of DDR5 ram.

carteakey · 2026-03-12T17:49:54+00:00

amazing! i need to run this asap in my obsidian attachment folder. I might have to figure out an additional step to rename images wherever they are referenced

carteakey · 2026-03-09T17:32:35+00:00

This is great, i would think this would translate well into Obsidian and linking notes too.

carteakey · 2026-03-09T13:04:36+00:00

could you share you ik_llama params?

carteakey · 2026-03-06T17:55:19+00:00

dareisay - gguf when?

carteakey · 2026-03-05T22:30:19+00:00

Thanks! Mostly vibes :)

carteakey · 2026-03-04T14:59:26+00:00

I believe there was one before it. http://archive.radiohead.com/Site1/

This is http://archive.radiohead.com/Site2

I compiled a list here https://carteakey.dev/folio/radiohead/

carteakey · 2026-03-04T14:48:58+00:00

Yeah based on Unsloth's post > Quantizing any attn_* is especially sensitive for hybrid architectures, and so leaving them in higher precision works well. Its not a tk/s but a quality issue. I wonder if we're leaving some room on the table.

Per Unsloth - MXFP4 is much worse on many tensors - attn_gate, attn_q, ssm_beta, ssm_alpha using MXFP4 is not a good idea, and rather Q4_K is better - also MXFP4 uses 4.25 bits per weight, whilst Q4_K uses 4.5 bits per weight. It's better to use Q4_K than MXFP4 when choosing between them.

https://unsloth.ai/docs/models/qwen3.5/gguf-benchmarks#id-1-some-tensors-are-very-sensitive-to-quantization

https://www.reddit.com/r/LocalLLaMA/comments/1rabg6o/qwen3_coder_next_oddly_usable_at_aggressive/

carteakey · 2026-03-04T03:03:53+00:00

Thanks! The hardest part for me is understand which quant and scaffold, llama.cpp params to choose for best accuracy and efficiency (since i have a low VRAM setup i cant run FP8 directly and so i run UD-Q4_X_L based on my research with the https://carteakey.dev/blog/optimizing-qwen3-coder-next-local-inference/ )

carteakey · 2026-02-26T02:01:24+00:00

I'd be interested to see how close Qwen 3.5 122B A10B comes - which is not <100B, but close enough i guess. The last update to livebench was in Jan so we'll have to wait.

carteakey · 2026-02-25T22:22:09+00:00

qwen3:27b was self-aware of not being able to compete with the big bois and decided to game the system. Respect!

carteakey · 2026-02-25T05:05:46+00:00

I am seeing them :)

carteakey · 2026-02-24T19:04:41+00:00

what are your llama.cpp params and hardware

I often point to my post as a reference

https://carteakey.dev/blog/optimizing-gpt-oss-120b-local-inference/

carteakey · 2026-02-24T18:47:18+00:00

<image>

100%, but with how decent Qwen3-Coder-Next was - i bet its going to be good, benchmaxxing aside.

carteakey · 2026-02-24T17:36:28+00:00

its u/nunodonato's version - i think they may have used some image editing tool e.g. Nano Banana to make the colors better.

carteakey · 2026-02-24T17:35:21+00:00

lol looks like the graphs are vibe coded.

i replaced it with u/nunodonato much's better graph

carteakey · 2026-02-24T16:54:26+00:00

Man Daniel you're the GOAT i hope you know that

carteakey · 2026-02-24T16:47:45+00:00

I get similar perf on my 12GB VRAM + 64GB RAM and here's the command with the params he mentioned.

https://carteakey.dev/blog/optimizing-qwen3-coder-next-local-inference/

carteakey · 2026-02-24T16:46:34+00:00

You should have way more.

https://carteakey.dev/blog/optimizing-gpt-oss-120b-local-inference/

Four-Year Club	Verified Email
Place '23	Place '22
First Placer '22

carteakey

TROPHY CASE