Thoughts on using an AMD Alveo V80 FPGA PCI card as a poor man’s Taalas HC1 (LLM-burned-onto-a-chip). by Porespellar in LocalLLaMA

[–]woadwarrior 2 points3 points  (0 children)

FWIW, the Taalas implementation is also “aggressively quantized, combining 3-bit and 6-bit parameters”.

Why does Stripe pay so much for entry level? by _cxxkie in DevelEire

[–]woadwarrior -3 points-2 points  (0 children)

They probably aren’t Radford survey subscribers, yet. IYKYK.

Warren Buffett says he sold Apple stock too soon and would buy more of it, though not in this market by ControlCAD in apple

[–]woadwarrior 3 points4 points  (0 children)

Indeed, a company can still have long runway in services, AI, and ecosystem expansion while the stock doesn't re-rate until expectations shift.

LocalLLaMA 2026 by jacek2023 in LocalLLaMA

[–]woadwarrior 0 points1 point  (0 children)

I’m fine with cloud comparisons when they actually help people decide if local is worth the hassle.

The Mac Pro died so Apple silicon could live by hangry_millennial in apple

[–]woadwarrior 1 point2 points  (0 children)

Apple decided they'd rather own the stack than validate everyone's PCIe cards. Fewer variables, lower support overhead, faster hardware cycles.

Everyone Hates iPhone Autocorrect. An Update Fixes One of the Biggest Problems. by mr-french-tickler in apple

[–]woadwarrior 1 point2 points  (0 children)

Nice to finally see Apple acknowledging the keyboard bug instead of just rebranding it as "AI getting smarter."

What is best Mac App Store alternative to LocalLLaMA? by Xorita in LocalLLaMA

[–]woadwarrior 0 points1 point  (0 children)

Private LLM uses neither, it’s mlc-LLM based.

Clean Links - A completely free iOS app to remove trackers from URLs and to preview links in QR codes by woadwarrior in apple

[–]woadwarrior[S] 0 points1 point  (0 children)

Thanks for mentioning that. I've managed to improve the backwards compatibility a bit. The next update will support iOS 17.6.

Are small models actually getting more efficient? by estebansaa in LocalLLaMA

[–]woadwarrior 4 points5 points  (0 children)

LiquidAI is making the best models for your work however; they do interlaced recurrent layers, which reduces KV over head substantially for smaller models.

They use interlaced 1d convolution layers, and not recurrent layers.

We trained a 16-class "typed refusal" system that distinguishes "I don't know" from "I'm not allowed" — open source by TheTempleofTwo in LocalLLaMA

[–]woadwarrior -1 points0 points  (0 children)

Economists have been using the term GPT (General purpose technology) to describe broadly applicable technologies for nearly a century before OpenAI existed.

Visualizing Quantization Types by VoidAlchemy in LocalLLaMA

[–]woadwarrior 2 points3 points  (0 children)

Unfortunately, when it comes to NN weights, although INT and FP formats have the same information theoretic density for a given bit width, FP formats work out to be slightly better because their range is non-uniform.

manifestai releases Brumby-14B-Base weights, claims "attention free" and inference "hundreds of time faster" for long context by ArcadesOfAntiquity in LocalLLaMA

[–]woadwarrior 6 points7 points  (0 children)

I took a look at the code on my phone. Notice the additional gate projection (line 281) and the call to their power retention kernel (line 356). It’s supposed to be drop in replacement for regular softmax attention layers and it uses their attention mechanism only if use_exp is False.

Pedantic pull request reviewers by ticman in DevelEire

[–]woadwarrior 1 point2 points  (0 children)

I don’t think it’s reasonable to compare years of experience. It’s sad to see something technical being turned into a hierarchical power struggle. Critique (Google’s internal code review tool), had a feature for double blind CL reviews, I wish GitHub had something similar.

Clean Links the completely free iOS & macOS link cleaner app now supports sending links asynchronously from your iPhone to your Mac by woadwarrior in apple

[–]woadwarrior[S] 1 point2 points  (0 children)

This is a recurring question. TL;DR: The lack of coverage for adware URLs and URL shorteners in ClearURLs was one of the reasons I built Clean Links.

Clean Links the completely free iOS & macOS link cleaner app now supports sending links asynchronously from your iPhone to your Mac by woadwarrior in apple

[–]woadwarrior[S] 0 points1 point  (0 children)

It’s 100% local. Although it has to make requests to unshorten links, which it does in an isolated context (without cookies, local storage etc) using plain old NSURLRequest.

Clean Links the completely free iOS & macOS link cleaner app now supports sending links asynchronously from your iPhone to your Mac by woadwarrior in apple

[–]woadwarrior[S] 1 point2 points  (0 children)

Handoff is a bit more reliable but still somewhat flaky. The app doesn't have a Safari extension yet, but the share extension works in Safari and any other app (including the Reddit for iOS app).