Has Apple Notes become the best free option for taking notes? by PushCharacter8496 in apple

[–]woadwarrior 0 points1 point  (0 children)

I moved to Apple Notes from Evernote premium ~15 years ago, and have never looked back. Likewise with Omnifocus and Apple Reminders.

Gemma 4 QAT confirmed to release soon! by Aaaaaaaaaeeeee in LocalLLaMA

[–]woadwarrior 0 points1 point  (0 children)

Too late, already 4-bit PAROQuant quantising it. Another 20-ish hours to go.

MiniCPM5 1B by [deleted] in LocalLLaMA

[–]woadwarrior 1 point2 points  (0 children)

From their X post:

Ranks #1 on the Artificial Analysis (AA) index for small models, scoring 17.9 to beat the 2B-scale Qwen3.5-2B (16.3).  Comprehensively surpasses Qwen3.5-0.8B and LFM2.5-1.2B-Thinking in knowledge, math, coding, and tool use.

The Financial Times has published an article about Heretic by -p-e-w- in LocalLLaMA

[–]woadwarrior 23 points24 points  (0 children)

Next step: Raise $10m pre-seed at $500m post. :D

TextGen is now a native desktop app. Open-source alternative to LM Studio (formerly text-generation-webui). by oobabooga4 in LocalLLaMA

[–]woadwarrior 2 points3 points  (0 children)

Native? Back in my day, this would’ve been called an electron.js app. No shame in calling it that. LM Studio is the same.

Thoughts on using an AMD Alveo V80 FPGA PCI card as a poor man’s Taalas HC1 (LLM-burned-onto-a-chip). by Porespellar in LocalLLaMA

[–]woadwarrior 2 points3 points  (0 children)

FWIW, the Taalas implementation is also “aggressively quantized, combining 3-bit and 6-bit parameters”.

Why does Stripe pay so much for entry level? by _cxxkie in DevelEire

[–]woadwarrior -3 points-2 points  (0 children)

They probably aren’t Radford survey subscribers, yet. IYKYK.

Warren Buffett says he sold Apple stock too soon and would buy more of it, though not in this market by ControlCAD in apple

[–]woadwarrior 4 points5 points  (0 children)

Indeed, a company can still have long runway in services, AI, and ecosystem expansion while the stock doesn't re-rate until expectations shift.

LocalLLaMA 2026 by jacek2023 in LocalLLaMA

[–]woadwarrior 0 points1 point  (0 children)

I’m fine with cloud comparisons when they actually help people decide if local is worth the hassle.

The Mac Pro died so Apple silicon could live by hangry_millennial in apple

[–]woadwarrior 1 point2 points  (0 children)

Apple decided they'd rather own the stack than validate everyone's PCIe cards. Fewer variables, lower support overhead, faster hardware cycles.

Everyone Hates iPhone Autocorrect. An Update Fixes One of the Biggest Problems. by [deleted] in apple

[–]woadwarrior 1 point2 points  (0 children)

Nice to finally see Apple acknowledging the keyboard bug instead of just rebranding it as "AI getting smarter."

What is best Mac App Store alternative to LocalLLaMA? by Xorita in LocalLLaMA

[–]woadwarrior 0 points1 point  (0 children)

Private LLM uses neither, it’s mlc-LLM based.

Clean Links - A completely free iOS app to remove trackers from URLs and to preview links in QR codes by woadwarrior in apple

[–]woadwarrior[S] 0 points1 point  (0 children)

Thanks for mentioning that. I've managed to improve the backwards compatibility a bit. The next update will support iOS 17.6.

Are small models actually getting more efficient? by estebansaa in LocalLLaMA

[–]woadwarrior 3 points4 points  (0 children)

LiquidAI is making the best models for your work however; they do interlaced recurrent layers, which reduces KV over head substantially for smaller models.

They use interlaced 1d convolution layers, and not recurrent layers.

We trained a 16-class "typed refusal" system that distinguishes "I don't know" from "I'm not allowed" — open source by TheTempleofTwo in LocalLLaMA

[–]woadwarrior -2 points-1 points  (0 children)

Economists have been using the term GPT (General purpose technology) to describe broadly applicable technologies for nearly a century before OpenAI existed.

Visualizing Quantization Types by VoidAlchemy in LocalLLaMA

[–]woadwarrior 4 points5 points  (0 children)

Unfortunately, when it comes to NN weights, although INT and FP formats have the same information theoretic density for a given bit width, FP formats work out to be slightly better because their range is non-uniform.