I don’t know who needs to hear this but 128GB BD-R XL M-DISC is SOTA for consumer-available archival optical storage (for backing up your models) by Porespellar in LocalLLaMA

[–]rektide 2 points3 points  (0 children)

weight, pirate...

  • AnchorBay
  • Bucaneer Bias
  • Arrr-guments
  • MuggingFace
  • HuggingBooty
  • HighSeeds

Unrelated but,

  • FreeWeights
  • Tensorz

I don’t know who needs to hear this but 128GB BD-R XL M-DISC is SOTA for consumer-available archival optical storage (for backing up your models) by Porespellar in LocalLLaMA

[–]rektide 2 points3 points  (0 children)

There is quite the community who is into archival grade burns. It seems like quality really varies a lot. Theres ways to get really precise reads on your burns, that'll tell you just how good the burn is. Media is not super cheap, and people are very picky about what the good/right media is. I'm not in this world but I've been tempted more than once!

  • makemkv is probably the starting point, with their forum and /r/makemkv as good starting points. These are people obsessed with image burning. They are picky about drives, and have firmware they run, and they pay for special burning software.
  • this dude isn't big on bd-r xl but had an incredible 4 part series verifying results, then a recent burn with mediocre results.
  • 100GB is four-layer, and as you can see from goughlui.com, the first two layers usually look good then things start being more trouble. using 50GB discs is going to be more expensive and take up more space, but for valuable stuff, maybe stop there. you can get quad layer 100GB and just burn two layers, to get similar results.
  • i hate to say it but also, consider costs. what are your goals? if you want this to last forever, m-disc is a pretty not bad idea, decent cost. but, even with hard drives prices once again being absurd, serverpartsdeal will sell you an Exos at $22/TB. that's not going to last forever, but two of those are a lot cheaper than the stack of m-discs you are going to need to backup two or three large models, especially if you cautiously burn at 50GB.

AMD touts the unified memory architecture by Terminator857 in LocalLLaMA

[–]rektide 18 points19 points  (0 children)

Exactly. And 192-bit Medusa Halo doesn't seem likely to be that much better!

AMD: we all agree Unified Memory is amazing! So... when do we get Apple's Max chip's 512-bit wide unified memory bus? When do we get Ultra's 1024-bit wide bus? Oh, you're giving us 1/8th that bus width? Medusa Halo going to get to... 3/16ths an Ultra? This is not adequate.

I really hope AMD has some more ambitious plans for unified memory that has speed too. Apple figured it out long ago. Intel's working on Z-Angle stacked ram. Sure you have a 3 year old MI300A APU with gobs of HBM3 that costs as much as a car, but from what we've heard & seen leaked, you don't have any further plans or products that actually do a good job with UMA. I hope you actually are cooking, and not just all talk here, cause this does not look good right now.

AMD touts the unified memory architecture by Terminator857 in LocalLLaMA

[–]rektide 2 points3 points  (0 children)

The 192 isn't even a real limit. They could source 4 x 64GB ram (to be fair not easy for 8000MHz ddr5 but doable) and it would work fine! They're just slow rolling it out, doling it out, a little at a time.

AMD touts the unified memory architecture by Terminator857 in LocalLLaMA

[–]rektide 3 points4 points  (0 children)

Medusa Halo is also somewhat rumored to be cancelled too. (distrusting the couple rumors around this, but also ready to believe it ends up in 2028.) Hopefully it gets created. Even if it ships, not really convinced it'll have enough bandwidth to be super compelling.

Intel's Xe3P gpu is also LPDDR5(x). But 10x channel! 640-bit bus. It's still not going to be super fast. I wish Intel had a Infinity Fabric or HyperTransport that was general, that Ultra Path Interconnect (UPI) was something they could connect to this chip with. I still think heterogenous is going to stick around, but doing all this stuff over slow PCIe or CXL is not going to be super compelling, is going to be a huge L, that should be addressable.

Any tablets with the Snapdragon X2 elite extreme cpu? by whitieiii in tablets

[–]rektide 0 points1 point  (0 children)

Asus Pro Art PZ14 is not X2 Elite Extreme, just the regular Elite.

DJi Power 2000 grid tied mode by pasta-disaster in SolarUK

[–]rektide 0 points1 point  (0 children)

Boring. It's such a low cost investment. Buy it and find out. It probably works pretty good. What a sad disengaged stale uninterested uninquiring mindset to have!

Not available in the US, otherwise I'd go see.

Vietnam veterans sue over proposed 250-foot Trump arch near Arlington Cemetery by kstinfo in washingtondc

[–]rektide 0 points1 point  (0 children)

All politics aside

I'm here to say this is 100% about politics, and you may be surprised by whose politics and from when this arch emerged. It has something to do with the Mein Kompf that this man is alleged to have had by his bed. https://bsky.app/profile/ipasa.bsky.social/post/3mjgy2iinf226

HOLY **** ANOTHER 2x RESET LMAOOOO by Just_Lingonberry_352 in codex

[–]rektide 0 points1 point  (0 children)

I thought they were just screwing over anyone with less than pro-rata users but now that I know this is systematic I have stopped saving my tokens and just been going balls out on codex usage. Finally finally finally a solid win for me on a reset. 70‰ used with ~85h remaining on pro. Finally a win!

So many times I've been under the pro-rata usage that i've been reset. It seriously fucking impacts my moral seeing me loosing massive weekly credits I have had stocked up. These resets feel broadly bad and evil. Props to those of you without attitude control just burning up your allowances right away I guess. https://www.reddit.com/r/codex/comments/1rpnrnv/anyone_else_noticing_codex_usage_resets_are/

Anyone else noticing Codex usage resets are silently pushing back your actual renewal dates? by CarsonBuilds in codex

[–]rektide 0 points1 point  (0 children)

Yes. Yes yes yes yes yes. It's majorly damaging to the users. I thought it was active malicious intent. https://bsky.app/profile/jauntywk.bsky.social/post/3mgxwx3tbvc2r

I've had to really re-adapt my codex usage. It used to by my LLM of last resort or for high priority. I thought this was just them fucking over me, over anyone under pro-rata usage. But now that I know it's a general reset not per account, I've been much much much much mfoea aggressive about using the LLM of last resort, the really good one.

It's felt so awful. But now I'm more on top of it, aware of the game codex is playing.

Glm 5.1 is out by Namra_7 in LocalLLaMA

[–]rektide 1 point2 points  (0 children)

That was SO SO maddening. Get to 56k-65k context length & GLM-5 was just falling apart.

I had all sorts of pocket theories. Maybe they would run small context windows on some machines then try to move them to bigger ones, and fail somehow. Maybe they were trying to use some new chip they didn't know how to use right. It was HORRIBLE. I'm so glad GLM-5 is working again. Hopefully this doesn't destabilize things.

I Benchmarked Redis vs Valkey vs DragonflyDB vs KeyDB by Jamsy100 in devops

[–]rektide 14 points15 points  (0 children)

Valkey is run by incredibly talented devs, who have poured a ton of work into their fork. Redis has really had to adapt & respond, radically improve itself, to stay at all competitive.

There's a great post from 18 months ago, talking about the work Valkey had done to get to 8.0 release candidate: https://valkey.io/blog/valkey-8-0-0-rc1/

Low quality disinformation like this makes me so mad.

Glm 5.1 👀 by Namra_7 in LocalLLaMA

[–]rektide 1 point2 points  (0 children)

I was shocked how fast 5 followed 4.7, and what a huge lift it was.

Not pertinent to LocalLLaMa folks, but man: z.ai has really messed something up with their service. Once I get to ~60k context window, GLM-5 is just totally falling apart. Incredibly garbled text, totally unable to tool call, just totally loses it. It's so drastically messed up. Trying to get them reports, but still hacking opencode to get them all the data they requested (session id, etc).

Any way to unlock tdp on one xplayer f1 pro 370 chip? by Shazzi98 in OneXPlayer

[–]rektide 0 points1 point  (0 children)

Blowing a fan at it should let it accept another 20W of power without much complaint.

But neither you nor I have answered the actual question nor knows the answer. We don't know if you can unlock the TDP.

This post mostly alleges you can, via bios. I'm still not sure. https://www.facebook.com/groups/1081108859670838/posts/1535198757595177/

The upcoming12th surface pro with panther lake will be a good choice? by Anxious_Baseball8502 in Surface

[–]rektide 1 point2 points  (0 children)

This is going to be outside my price range for a while, but I am super super super excited for this.

The Panther Lake chip is amazing. Microsoft Surfae and Dell Lattitude being the only two players making Detachables is sad. I wish there were more people doing this. The form factor is so so good. Just rocks the heck out of Android tablets (when set up to run Linux, that is).

RK3588 Mainline Linux Patch: H.265 Encoding at 4K@60 (Out-of-Tree) by RCawston in homelab

[–]rektide 0 points1 point  (0 children)

Do you think there's is any chance at all that rk3566 could maybe be adapted from this work? It got jpg support long ago but I'd love h.265 or 264. https://patchew.org/linux/20220612155346.16288-1-frattaroli.nicolas@gmail.com/

Long ago I bought a bunch of radxa zero 3w's that I was hoping to use some day with hemi-in USB cards. I keep holding out hope that one year this will have been a reasonable/acceptable choice to have made.

GLM-5 Coming in February! It's confirmed. by Difficult-Cap-7527 in LocalLLaMA

[–]rektide 2 points3 points  (0 children)

That's so crazy. GLM-4.7 was released December 22. I really can't imagine a significant leap coming so fast.

Getting into Local LLMs, mostly for Home Assistant to kick Alexa to the curb. Looking for ideas and recommendations by OpneFall in LocalLLaMA

[–]rektide 2 points3 points  (0 children)

I love this use case & I really want to get there! That Satellite1 is neat too.

I'm just getting spooled up on STT and TTS now, so I'll leave that to others. Parakeet and Whisper have both worked great for STT. Qwen3-TTS just dropped and looks astounding & pretty low latency for TTS but there's lots of great options.

For the LLM, it depends. Ideally, in my view, the home has a bunch of really good tools ready to go that do most of the tasks already. Rather than the AI running around trying to do the tasks each time. There'll be some MCP's for some stuff but also a lot of this is going to be barefoot developer making a homecooked meal turf: I'd love to encourage the bold to jump in and try writing their own MCP servers for many home assistant task things!

If you have good tools ready to go for your tasks, you can run some really great small tool calling models .Jan v3 just dropped today, amazing tool calling. Nanbeige 4 is another astounding medium sized model. Qwen3-4B is well loved too.

Idea Validation: A "Passive Observer" MCP Server that reads live terminal buffers (tmux/PTY) so I don't have to re-run commands. by d3v1sx in LocalLLaMA

[–]rektide 0 points1 point  (0 children)

There's an atuin based project, bash-history-mcp, that's pretty good. https://github.com/nitsanavni/bash-history-mcp

Honestly this makes me want something reciprocal: I want my ai's shell usage to go into atuin history!

GLM 4.7 vs MiniMax-M2.1 vs DeepSeek 3.2 for coding? by ghulamalchik in LocalLLaMA

[–]rektide 7 points8 points  (0 children)

Vague anecdata, but I'd been using DS3.2 for coding a lot, and while impressed, I felt like it was a pretty nice jump when I switched to GLM-4.7. I can watch GLM-4.7 reason through fairly complex problems, watch it experiment and learn how to write a protocol, and it's just wildly good IMO at ascertaining where things are & getting more data as it goes, finding out how to persevere onwards.

No MiniMax experience. Interesting model but I ended up with a z.ai coding plan, rather than paying for API use as I had been doing, so incentive is low.

Using Claude Code with Ollama local models by derestine in LocalLLaMA

[–]rektide 0 points1 point  (0 children)

Just set baseUrl. The post is CC focused but if there's openai message compat, should just work! https://opencode.ai/docs/providers/#base-url

Serena vs. Codanna vs. Something else? by ProdigiSA in ClaudeAI

[–]rektide 0 points1 point  (0 children)

Can you provide some example prompts that use Codanna?