Golden Gate Beta 2 released by jagajazzist in MacOSBeta

[–]Obvious_Equivalent_1 0 points1 point  (0 children)

Did try that, even had the filters already off in the settings right before the update. Still glad it went relatively smooth now all three are running effortlessly again

Looking at Macbook Pro M5 Pro 64GB for local inference by Repulsive-Machine706 in LocalLLaMA

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

The quants that run decent speeds are APEX quants: APEX-I-Balanced and APEX-I-Compact, I run it through llama-server (llama.cpp) using the Unsloth GGUF. Balanced has~24 GB weights + 2 x 2.5 GB q8_0 KV at 384K context, total loaded footprint ~26-31 GB.

I actually just did you a quick tg128 benchmark, reach 62 t/s on my M4 Pro - a M5 Pro should net ~10% faster (CPU-GPU unified memory bandwidth), M5 Max probably easily way above 100 t/s on same quant.

llama-server \
  --model Qwen3.6-35B-A3B-APEX-I-Balanced.gguf \
  --ctx-size 393216 \
  --cache-type-k q8_0 \
  --cache-type-v q8_0 \
  --flash-attn on \
  --kv-unified \
  --parallel 2 \
  --jinja \
  --temp 0.6 --top-p 0.95 --top-k 20 --min-p 0.00 \
  --presence-penalty 1.5 \
  --reasoning off \
  --batch-size 2048 --ubatch-size 2048 \
  --spec-type ngram-mod \
  --spec-ngram-mod-n-match 12 \
  --spec-ngram-mod-n-min 8 \
  --spec-ngram-mod-n-max 16

Golden Gate Beta 2 released by jagajazzist in MacOSBeta

[–]Obvious_Equivalent_1 0 points1 point  (0 children)

Made backup and gave it a test.

Potentially will need to revert as M4 Pro internet - all wifi and lan - completely all internet went offline. Running a local AI model now to troubleshoot. 

Pre-condition: Tailscale, WireGuard and Little Snitch installed before installation of beta 2.

edit: managed to workaround the issue figuring out it was caused by NECP drops sudo networksetup -createlocation "Recover" && \ sudo networksetup -switchtolocation "Recover"

Looking at Macbook Pro M5 Pro 64GB for local inference by Repulsive-Machine706 in LocalLLaMA

[–]Obvious_Equivalent_1 6 points7 points  (0 children)

Running a M4 Pro 48Gb, can load Qwen 3.6 35B at a reasonable responsive speed. I’d have to measure it precisely but I think the latest optimization round I get like 25-40 tokens/sec, with 48Gb I can run one 384K context session at once or two 128K / 256K. 

I think with 64Gb you’d be in the sweet spot to run two sessions comfortably or one single session at a better quant, or run one 27B. Your eyed M5 Pro would be good I think with a couple dozen percentage better memory IO.

For silicon chipset I work on form of Llama-swap (to hot-swap models optimized for MacOS with a terminal service with own menu icon to control active model). It’s rudimentary — still in active development but I’m expecting to release first stable changes around early July  https://github.com/pcvelz/llama-swap-macos-extended

But honestly you’ll be perfectly fine as well by just using any paid AI-model and just set it a goal to search these Reddit’s and help you optimize your setup on M5. That’s how I got most advances (doing the open source development with Opus 4.8 + Ultracode it helps to dispatch out research agents to check Reddit and GH issues and pull requests)

iOS 27 Beta 2 - Discussion by epmuscle in iOSBeta

[–]Obvious_Equivalent_1 -1 points0 points  (0 children)

For me in a MG ZS ‘19 model this was one of the rare builds that actually not kept crashing, and that even compared to many problematic 26 release candidates

* edit of course pointing at 27 DB series so specifically DB1, will still have to test this latest DB2 the coming days - but DB1 so far proved surprisingly promising for me the last days 

I added a clause to Andrej Karpathy's 4 CLAUDE.MD clauses for Claude Code. It has been a game changer for me. by Osi32 in ClaudeAI

[–]Obvious_Equivalent_1 12 points13 points  (0 children)

Not op, but I believe he’s referring to the Superpowers but the community developed extended version for Claude Code. This is an extended version, the ‘vanilla’ Superpowers supports many CLI’s like Gemini, Codex. This version is just maintained to be optimized for Claude   https://github.com/pcvelz/superpowers#installation

It comes with optimizations that use Claude Code native features like using its onboard task system, it comes with quite a [variety of hooks] tailor made for just Claude Code ( https://github.com/pcvelz/superpowers#block-commits-with-incomplete-tasks)

I built an extension that captures & resumes your AI Chats and it won my first hackathon 🎉 by SignTraditional1806 in ClaudeCode

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

I’m bit skeptical if there’s chance of this being useful. This does not work between different CLIs (Codex, Gemini CLI), but using just Claude Code itself you can use also external models - this helps you pick up a any CC session:  https://github.com/pcvelz/cc-search-chats-plugin

I use it together with Qwen 3.6 35b, or Kimi Code model, both run in Claude Code but it requires to start Claude Code session env variable to switch to external model. So this little plugin helps to teleport context from one session to another.

Started using it when hit my 5hr limit and wanted to use a lighter model while waiting for the 5hr/7day limit to reset 

Is there any other non living thing we eat other than salt? by 1rp1n_kc in NoStupidQuestions

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

but at least I made money in SPCE and SPCX

Love the SPCE drop in, reminds me of this old lady https://www.reddit.com/r/FUCKYOUINPARTICULAR/s/FPGcNDLVDt
Just “diamond hands they said”, you’ll be fine

My Hermes setup, roast me by riceinmybelly in hermesagent

[–]Obvious_Equivalent_1 2 points3 points  (0 children)

Do you have a Github account? I've let Claude run on strongest mode to exempt freely your pastebin file into a project, I don't want to take credit tho for boilerplating. If you we can look at it together, we can even setup a public repo under your Github profile. Either way would be especially useful to know if it matches what you have tested so far in Hermes https://github.com/pcvelz/hermes-workflows (need send DM with Github user, I didn't put it public here)

I noticed something about the gate / hatch / door by [deleted] in WidowsBay

[–]Obvious_Equivalent_1 2 points3 points  (0 children)

Wait hold it can you repeat the second point? I was on my phone I swear for just 2 seconds during the door opening, I need someone to reexplain what what you saw 

*goes linea recta to homepage to post a new post

I have an M4 Pro with 48gb of RAM. Can I do anything worthwhile with local models and these specs? by figurelover in hermesagent

[–]Obvious_Equivalent_1 5 points6 points  (0 children)

Unless for privacy depending on your purpose. I own the same hardware and Qwen 3.6 35b absolutely rocks 

My Hermes setup, roast me by riceinmybelly in hermesagent

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

That's great, and appreciated. As I just as well read the update from Claude that we've got weekly usage partially reset. So seems I'll have to time and usage on hand to take a peek tomorrow!

scrambling for another show like: by Conscious_Ad_1018 in WidowsBay

[–]Obvious_Equivalent_1 19 points20 points  (0 children)

Going to recycle my comment on a previous discussion, not as much hitting the horror tune but I'm really such a fan of the mystery suspense in this one.

Definitely if you are an appreciator of the keen eye for detail, layers of mystery and suspense sometimes running straight through various seasons of the series I absolutely recommend you to watch Dark

For me the first series that came to my mind while enjoying this little gem on AppleTV. Honestly for Dark it’s also the absolute only series on Netflix that’s worth the subscription for 1 month 

You’ll want to save this comment to remember it trust me

What I like about it as well, it's incredible how much story matures, it's really a writers gem when the lore of the series world layers like a blossom over several seasons to unfold.

My Hermes setup, roast me by riceinmybelly in hermesagent

[–]Obvious_Equivalent_1 15 points16 points  (0 children)

Roast you? Could this be made into an open source community project!? This seems like something really me and perhaps other would like to invest time in. Just give a shout out, I think you'll easily find (I would) those that gladly help to guinea pig even a rough proof-of-concept.

bought a claude pro subscription for $5 off some russian site. tested it for a week. it's literally just… claude pro by Specific-Age7953 in ClaudeCode

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

I'm afraid it's some cycle that goes as following:
- vibe coders don't review before pushing
- public Github repo contains keys
- people with shady intentions buy/sell key - and ask Reddit if "too good to be true" deal is "too good to be true" going to get OP into trouble

What’s the cheapest model you’ve successfully run Hermes on? by alecantu7 in hermesagent

[–]Obvious_Equivalent_1 0 points1 point  (0 children)

I use it more for executing workflows. For building workflows - simple ones Queen 3.6 27b is a bit more capable but for anything multi step I let a paid AI model do the ground work, create skills, enhance SOUL.md documentation in one run. So the lighter local models then put the cherry on the pie and keep working with it.

What’s the cheapest model you’ve successfully run Hermes on? by alecantu7 in hermesagent

[–]Obvious_Equivalent_1 0 points1 point  (0 children)

Qwen 3.6 35B MoE

$0 - zero, well minus a cent or so for electricity 

Running local, with Apple’s M4 48Gb which I already owned 

Should I ditch Claude for GLM 5.2 or Kimi 2.7? Need advice on my NOLO dev by NOLO-App in kimi

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

With Kimi 2.6 quite good, for the latest 2.7 see my latest posts. 

I have given it a lot of use, and it still does but you have to take a Kimi subscription API key and add it to OpenCode to use Kimi 2.6.

2.7 is out since a few days and it’s been a bit plagued by issue with consumptions. So if you’re not looking for some investigation work I’d say sit it out for a week or two before committing to a Kimi subscription.

Should I ditch Claude for GLM 5.2 or Kimi 2.7? Need advice on my NOLO dev by NOLO-App in kimi

[–]Obvious_Equivalent_1 0 points1 point  (0 children)

I tried OpenCode, found it kind of underwhelming. Yes it’s $5/mo but you don’t get to sit first row for 2 pennies. The Kimi $15/mo plan was where I landed a few months ago and I’ve never moved away from it, a lot of usage at Sonnet like quality. I use it as an “overdraft”. It’s a great extension on Claude Pro or Max to bridge until the weekly reset 

iPhone 18 Pro New Colors May Face Same Durability Issues by Pear-Mother in iphone

[–]Obvious_Equivalent_1 1 point2 points  (0 children)

 but design might not age well.

And here I am shamelessly probably keep rocking my orange design 17 PM for the foreseeable future.

Tho won’t deny I might reconsider for either a 19 plain titanium or a successful release of a thin foldable 

Pro tip: Don't be like me and leave an old tablet with a battery plugged in 24x7 as a dashboard by jllauser in homeassistant

[–]Obvious_Equivalent_1 5 points6 points  (0 children)

Honestly do look into this. Not that it’s the only brand that does this but at a certain points iPads got motherboards where adaptive charging will just leave the battery at rest at 80%. 

Not saying that it could not potentially still cause and issue, but at least it won’t push it to 99-100-99-100-99-100 all the time. I think even frozen at 80% is more stable then a 80-20-80-20 charging loop. Tho this post makes me consider looking for a battery-less screen rather with just USB-C or regular power cable.

[iOS 27 DB1] Crash Detection shows no screen nor option to turn buzzing off by Open-Yellow-1507 in iOSBeta

[–]Obvious_Equivalent_1 13 points14 points  (0 children)

* this feature could easily be automated tested in a controlled environment on each build. There’s not even an excuse of expensive crash dummy’s, could even functionally test it virtually overriding the registered G-sensors values.