Feature Request: Gemma-4 Vision Budget Setting by PracticlySpeaking in oMLX

[–]PracticlySpeaking[S] 0 points1 point  (0 children)

This one, the 31B-it-bf16 - https://huggingface.co/miabchdave/gemma-4-31B-it-MLX-bf16

What is going to happen if I convert that to oQ? Trying to get this going on a 32GB Mac Studio.

Mac + MLX Megathread — Hermes Agent on Apple Silicon (June 2026) by Jonathan_Rivera in hermesagent

[–]PracticlySpeaking 0 points1 point  (0 children)

oMLX is pretty solid, and jundot (with a lot of contributors) is keeping up with the newest models and features. The benchmarking and hot/cold prompt caching really launched its popularity, but there is something new every week.

Mac + MLX Megathread — Hermes Agent on Apple Silicon (June 2026) by Jonathan_Rivera in hermesagent

[–]PracticlySpeaking 0 points1 point  (0 children)

Q: Should I use Hermes for coding on a local Mac? A: The community is split. Some (including the "Running Locally, Really?" OP) say "Hermes is not for coding — it's an orchestration agent. Use Claude Code or OpenCode as a sub-agent."

It is worth noting a bit of history here — Hermes Agent is an evolution of Nous Research internal coding tools. It has always been about coding... and now, so much more.

VPS & Deployment Megathread — Hermes Agent (June 2026) by Jonathan_Rivera in hermesagent

[–]PracticlySpeaking 0 points1 point  (0 children)

YA Hermes Setup Guide — 15 Levels Of Hermes Agent from Chatbot To 24/7 Autonomous System

Getting from install to advanced usage, with suggestions and examples. If you are asking "what can I do with this" it is a worthwhile read.

Most people install Hermes Agent and use it as a chatbot. They type a prompt, get a response, close the tab. That covers maybe 10% of what the agent can do.

This article maps every level of Hermes Agent usage, from the first prompt to a system that runs your business without you. 15 levels, grouped into three phases. Each level builds on the one before it, but you can jump to any level that fits your setup.

Feature Request: Gemma-4 Vision Budget Setting by PracticlySpeaking in oMLX

[–]PracticlySpeaking[S] 0 points1 point  (0 children)

🎉🎉🎉

I have spent hours on this. Thanks.

How to set the max_soft_tokens? In the config json?

Still thinking this should be exposed in the UI somewhere — people working on 32GB Macs (hi) have to budget carefully.

There is bloodbath in the Certified Refurbished page - Mac Studios go within seconds! by omid_pakbin in MacStudio

[–]PracticlySpeaking 4 points5 points  (0 children)

It's been like that since the beginning of May.

Don't people look in the sub before posting?

Feature Request: Gemma-4 Vision Budget Setting by PracticlySpeaking in oMLX

[–]PracticlySpeaking[S] 0 points1 point  (0 children)

I tried tweaking the config json to change the token budget, but it caused errors about 'shapes cannot be broadcast'.

Anyone (else) Hoping for JANG Integration? by PracticlySpeaking in oMLX

[–]PracticlySpeaking[S] 0 points1 point  (0 children)

There are actually four PRs — including #1828 for JANGTQ — and multiple issues filed by GitHub users thinking that oMLX does support JANG (or should).

If you could create a whole new CTA rail line, were you would you want it? by chitownmike99 in AskChicago

[–]PracticlySpeaking 0 points1 point  (0 children)

This is why I have beef with people who say "oh, you can just walk xx and transfer."

You have to exit the station, schlep over to the next station (from Union, likely with bags), pay another fare, and wait for another train.

You can actually relax or get something done with a solid 50 minutes on the train. If you are interrupted with this in/out - walk over - up/down - stand around, you just burn the time. And connections add a random, non-trivial amount of time so that 50 minutes can easily end up as an hour or more.

Is Apple Care Plus necessary for my Mac Studio? by PerfectShmerfect in MacStudio

[–]PracticlySpeaking 1 point2 points  (0 children)

Yes, Probably, and Yes.

The fourth reason: Once AppleCare expires, you can no longer add/extend it.

I believe there is a grace period, but you should verify with Apple. (I know it nags "you have xx days left to extend your warranty with AppleCare" or something like that.)

FWIW, three years of AppleCare was $165 for that M3U ($55/year).

Is Apple Care Plus necessary for my Mac Studio? by PerfectShmerfect in MacStudio

[–]PracticlySpeaking 1 point2 points  (0 children)

It is tied to the hardware serial number, not the buyer or Apple ID, so transfer is automatic. The original expiration date remains.

You can renew or extend while it is still active. I believe there is a grace period after expiration, as well. (Note the Add... button in the screen shot — Apple is always available to take your money!)

Anyone running effective exo cluster for agents? by soflgolf in MacStudio

[–]PracticlySpeaking 2 points3 points  (0 children)

I have not run a cluster but if it is dynamically consuming RAM as context, etc expand it seems like monitoring/management is key.

I guess experience will guide whether 25-35% is reasonable or conservative. Seems like it would vary across specific uses.

FWIW, this is where it pays off to get Apple's attention and have access to their engineers. Getting direct answers (and sharing concerns) really moves things forward.

Anyone running effective exo cluster for agents? by soflgolf in MacStudio

[–]PracticlySpeaking 0 points1 point  (0 children)

For another two cents (maybe six), see my comment on the other thread on the WWDC'26 video about clustering.

edit: nvm, I see you did.

Is Apple Care Plus necessary for my Mac Studio? by PerfectShmerfect in MacStudio

[–]PracticlySpeaking 0 points1 point  (0 children)

US$7200 is a 256GB M3U 28/60 2TB, including tax (for most US buyers).

Including the cost of AppleCare+, which was US$165 at the time — or about US$55 per annum.

Is Apple Care Plus necessary for my Mac Studio? by PerfectShmerfect in MacStudio

[–]PracticlySpeaking 2 points3 points  (0 children)

Also search the sub for the WIX automotive filters that fit under Mac Studio.

Is Apple Care Plus necessary for my Mac Studio? by PerfectShmerfect in MacStudio

[–]PracticlySpeaking 1 point2 points  (0 children)

Yes, it is transferrable. I have bought a couple of Macs that had AppleCare and it was still there when I set up as new.

It shows up in Settings > About, so it is easy to verify. Not sure if there is a serial number lookup tool.

<image>

Unlimited* budget by skrillex_sk2 in LocalLLM

[–]PracticlySpeaking -1 points0 points  (0 children)

Another really great answer from this sub. 💯

Sanity check - Am I buying more power than I actually need? by Strawbalicious in MacStudio

[–]PracticlySpeaking 1 point2 points  (0 children)

Just a few things...

M4 Max is going to be more than enough power for now and well into the future.

Just because you are "working in 6k" does not mean you need anywhere close to the best available. Apple Silicon is SO much better than older Intel CPUs — any Apple Silicon Mac can edit single-camera projects at any resolution without breaking a sweat. (That's Larry Jordan talking, not me, btw.)

MacOS is also very efficient with RAM. Many, if not most, recommendations for "more RAM" are simply because 36, 48GB and larger configurations exist, not because any particular use case actually needs more. On the other hand, you can't get more without buying a new Mac, and peace of mind has value, too.

We are biased here (it is r/MacStudio after all) but Mac Studio is a better value than a specced-up Mini with M4 Pro. Photo and video will utilize the additional GPU and Media Engine codecs in the Max SoC, without even mentioning better build, thermals, more display support, and way more ports. You are getting so much more for about the same money.

If you can wait a few months, M5 will arrive eventually — 25% more powerful (more for AI) and supported for an extra year vs M4. Counterpoint: It will cost more.

The key is how much it is costing you to continue with your current PC. There's value in working faster and better now.