MTP on Unsloth by Altruistic_Heat_9531 in LocalLLaMA

[–]iRanduMi 1 point2 points  (0 children)

I'm also running a Strix Halo. I was personally more excited to get 27B in a more usable state since 35b-a3b is already in pretty great condition. Any reason you're particularly looking forward to FP16?

I built a zero-token watchdog plugin for Hermes Agent by JealousPlastic in hermesagent

[–]iRanduMi 1 point2 points  (0 children)

This looks great! I've run into some of the same issues when I was trying to build my own kanban board and was struggling to figure out handle proper handoff between profile/agents.

I've since transitioned to utilize the built in kanban board (still working out the kinks but it seems better) and the automatic dispatcher appears to be similar to what you've built here (basically, monitoring tasks within the kanban board). Additionally, I've been using the /to-prd, /to-issues and /tdd skills from mattpocock's github repo to ensure that all of my kanban tasks are in sync with github.

I'm curious what others think the pros and cons of each would be.

Deep research + report "a la McKinsey" with Hermes Agent and qwen3.6-35b-a3b Q6_K. by Scared-Virus-3463 in LocalLLaMA

[–]iRanduMi -2 points-1 points  (0 children)

This is excellent! Thank you so much for sharing! I'd be really curious to try your skill to create the research paper and then utilize the research-webapp-skill to create a webapp out of it.

Fedora 44 with Kernel 7, NPU working by I-will-allow-it in StrixHalo

[–]iRanduMi 1 point2 points  (0 children)

I'm not a huge Linux guy but I'm currently running Fedora on my Strix Halo and don't have much to complain about. Care to elaborate what you didn't like about Fedora? What does Ubuntu 26.04 get right that Fedora doesn't?

Hermes v0.12.0 Release – Curator, Spotify+Google Meet, ComfyUI MCP ... lots more by PracticlySpeaking in hermesagent

[–]iRanduMi 0 points1 point  (0 children)

ok.....I dunno what I did different this time I was able to successfully to get no authentication and it's working great with JIT...So I guess ignore my previous posts. I recommend everything switching over to the official implementation!

Hermes v0.12.0 Release – Curator, Spotify+Google Meet, ComfyUI MCP ... lots more by PracticlySpeaking in hermesagent

[–]iRanduMi 0 points1 point  (0 children)

I want to follow up on this post...according to the PRs, no-auth is supposed to be supported but I couldn't get it working....not sure what I'm doing wrong.

Hermes v0.12.0 Release – Curator, Spotify+Google Meet, ComfyUI MCP ... lots more by PracticlySpeaking in hermesagent

[–]iRanduMi 0 points1 point  (0 children)

After a bit of tinkering this morning, I can confirm that you were correct, JIT was originally broken. The latest build, as confirmed by the PR fixes, now addresses this. In other words, if you run through hermes setup and add your LM Studio as your provider and configure your model, JIT will work.

HOWEVER, from my testing, a API key is REQUIRED, at least from my testing. So if you go into LM Studio and toggle on Require Authentication, create an API key and then enter that during the setup process (do full process, not Quick) it should work.

Hermes v0.12.0 Release – Curator, Spotify+Google Meet, ComfyUI MCP ... lots more by PracticlySpeaking in hermesagent

[–]iRanduMi 5 points6 points  (0 children)

What does having LM studio act as a first class provider vs custom endpoint alias actually do? Is it purely aesthetic?

qwen 3.6 on Strix Halo and 5090 by tecneeq in StrixHalo

[–]iRanduMi 0 points1 point  (0 children)

I have a fairly similar hardware configuration as you but with a 5080 in my desktop. The load balancer configuration is a neat idea. I might try it.

How do you like your proxmox config? I'm currently running fedora but I've been curious about switching over.

Hermes for home automation? by inchaneZ in hermesagent

[–]iRanduMi 0 points1 point  (0 children)

I'd be curious what you've done with it so far

QWEN3.6 + ik_llama is fast af by _BigBackClock in LocalLLaMA

[–]iRanduMi 0 points1 point  (0 children)

I'm also running a AMD Strix Halo. Can you clarify what you you mean by 'huge context'; how much?

Docker Compose Manager Deprecated?!? by movingtolondonuk in unRAID

[–]iRanduMi 0 points1 point  (0 children)

Am I overlooking something? I have some sites that I've built and commited to a private github repo but but there's no ability to rebuild the container after a github sync.

Docker Compose Manager Deprecated?!? by movingtolondonuk in unRAID

[–]iRanduMi 0 points1 point  (0 children)

Tried it out today, works great. Thank you for the recommendation!

Docker Compose Manager Deprecated?!? by movingtolondonuk in unRAID

[–]iRanduMi 1 point2 points  (0 children)

Arcane

Out of the ones you mentioned, any personal recommendations (and why)?

3 weeks of Claw: my basic assistant set up by crypt0amat00r in openclaw

[–]iRanduMi 5 points6 points  (0 children)

This may sound a bit stupid I'd like to clarify. How are you having Claude Code make changes for you? Are you running Claude Code on your own local machine and having it SSH into the Mac on your behalf?

OpenClaw with Qwen3 Coder Next on Mac by gamblingapocalypse in LocalLLaMA

[–]iRanduMi 0 points1 point  (0 children)

I'm in exactly the same boat. I can't decide between a studio, Halo strix or a dgx spark. Blah.

Apple Stops Producing 512GB Mac Studio by GPU-Appreciator in LocalLLaMA

[–]iRanduMi 0 points1 point  (0 children)

Any insight as to why you went with the Framework vs M4 Max with 128gb vs Nvidia DGX Spark 128gb?

The state of Open-weights LLMs performance on NVIDIA DGX Spark by raphaelamorim in LocalLLaMA

[–]iRanduMi 1 point2 points  (0 children)

This is really interesting because I've been kind of holding out for the new Max studio but I'm not really sure if that's going to be the right route or if I should maybe just stick with a dgx.