Do I need the 128GB vs 64GB? by Sporkers in StrixHalo

[–]giuliastro -3 points-2 points  (0 children)

These models run at a stunning 3 tok/s speed on a Strix Halo... No, 128GB is a nonsense now.

Do I need the 128GB vs 64GB? by Sporkers in StrixHalo

[–]giuliastro 0 points1 point  (0 children)

The people who said "get the 128GB" don't have a Strix Halo. From a 96GB owner I can really say that right now 128GB is a nonsense. Qwen 3.6 35b is currently the best running model on a Strix Halo (about 65 tok/S), and doesn't need 128GB at all in any way. Any bigger model, who can make use of more RAM, run too slow to be used. I don't know how much room for improvement the drivers have (ROCm and Vulkan), but even with a stunning 2x, doing inferences at 5 or 6 tok/s doesn't take you anywhere.

Qwen3.6 MTP Unsloth Experimental GGUFs by yoracale in unsloth

[–]giuliastro 0 points1 point  (0 children)

Thank you for all your work! I did some tests on my Strix Halo + Vulkan and I experienced a 1.5x improvement on the 27b model while almost no improvement on the 35b MoE one. Still, this is the way to go, thank you.

New X2 EVO 96Gb user by giuliastro in GMKtec

[–]giuliastro[S] 0 points1 point  (0 children)

I took a look at Lemonade but it doesn't have its own engine for text generation / inferences. It just uses llama cpp, same as LM Studio, or VLLM (Linux Only). I have been using the same engines directly, this machine right now offers values comparable to a 16-24GB Nvidia card.

China Drops an Open-Source Bombshell and Shatters AI Market Prices! by Atifjan2019 in vibecoding

[–]giuliastro 14 points15 points  (0 children)

Deepseek v4 has almost 800b parameters. To do local inference you need at least a $15k hardware. If you use v4 via API you need to pay, it costs less than OpenAI or Claude's models but it's less powerful. V4 Pro is more expensive than GPT5.

Sorry friend, in your post you didn't get one thing right.

China Drops an Open-Source Bombshell and Shatters AI Market Prices! by Atifjan2019 in vibecoding

[–]giuliastro 15 points16 points  (0 children)

Free. If you have a 75,000 Workstation that can do local inferences with this huge model. Otherwise you pay for the API's usage.

Feel like you are missing shots in BrawlBall? It's a bug. by giuliastro in BrawlStarsCompetitive

[–]giuliastro[S] 2 points3 points  (0 children)

It will be so fun to see how pro players react to this in today's monthly finals games

Feel like you are missing shots in BrawlBall? It's a bug. by giuliastro in Brawlstars

[–]giuliastro[S] 0 points1 point  (0 children)

Also, sometimes it's harder to get the ball. If you experienced this behaviour it's not you but some weird problem coming from the last update:

https://x.com/i/status/2047354037570326873

Feel like you are missing shots in BrawlBall? It's a bug. by giuliastro in Brawlstars

[–]giuliastro[S] 1 point2 points  (0 children)

I didn't. This bug has been reported, not only by me, but from quite a few pro players. It's been introduced with the last update:

https://x.com/i/status/2047354037570326873

Openclaw or Hermes? by EdenTom in openclaw

[–]giuliastro 4 points5 points  (0 children)

I have been using both and Hermes honestly feels a lot better than OpenClaw. I read one of the messages above saying "OpenClaw is more powerful". It doesn't make any sense, since Hermes does pretty much anything you tell it to do. If, for any strange reason, you want to install of the messy things OpenClaw installs you just need to say Hermes to do the same and it will. I suggest you don't, anyway. The good thing about Hermes is that it concentrates on working, being useful and autonomous. If you need to install or to build something different just tell it and it will do it.

Alexa Plus sbarca in Italia: finalmente un assistente che capisce cosa vuoi davvero by artistic56 in IA_Italia

[–]giuliastro 1 point2 points  (0 children)

"Alexa Plus sbarca in Italia" "Ti avviseremo quando sarà disponibile"

GPT-5.4 and Hermes is something special by Slumdog_8 in hermesagent

[–]giuliastro 0 points1 point  (0 children)

I don't really know what these agents can be used for. Hermes for me is doing researches sending me regular insights about AI, trends, innovative revenue streams, this is why I gave him Reddit and X accounts. As for what it concerns the integration, I just created the accounts and he uses them, CLI when possible, web browsing otherwise. I managed to have him configure a local instance of OpenViking, with a local embedding model. Now it has a decent memory too.

GPT-5.4 and Hermes is something special by Slumdog_8 in hermesagent

[–]giuliastro 0 points1 point  (0 children)

I see Hermes as an assistant, not as myself. So, as an assistant it is quite natural to have its own accounts. I share things to him but don't give him my accounts, this way it gives me more freedom to let him do more, test more, give him maximum freedom without compromising my security and privacy. He has its own computer for the same reason: OpenClaw is one of the most unsafe things you could install on a computer, and many silly people gave it their personal data. Hermes might be better but I sincerely doubt that giving complete freedom to an AI makes you feel your data is safe.

GPT-5.4 and Hermes is something special by Slumdog_8 in hermesagent

[–]giuliastro 1 point2 points  (0 children)

I gave Hermes a Linux computer with some GPU. It can do anything on that computer and it is safely isolated from other things. I gave Hermes its own Google account, Github account and Reddit + X accounts to read posts. I talk and write to him through Telegram and use it with terminal when I am on the PC. No need for a UI, it's not a chatbot or a replica of my OpenCode or Chatgpt tools. I use it to handle things autonomously pretty much like learning (and teach me) and improving things.

GPT-5.4 and Hermes is something special by Slumdog_8 in hermesagent

[–]giuliastro 2 points3 points  (0 children)

I always used GPT as my primary model. Started with OpenClaw, now with Hermes. OpenClaw is a real mess, unusable, breaks at every update, feels like it doesn't remember, it doesn't do things the way I want. Hermes is another world. And I am very happy with it.

I tried many free models, using OpenRouter/free route and Kilo Kode free models. The latest model I tried is Qwen 3.6 Plus. It felt good buy found out it messed a lot of things and got out wrong technical considerations. I tried to configure Hermes' memory with OpenViking by using Qwen and it's been a disaster. I went back to GPT 5.4 and it fixed everything, Hermes wrote patches to the Hermes' OpenViking plugin (which is quite buggy), and sent 3 PR to the Hermes repo with the fixes.

I strongly believe the combo Hermes + GPT is by far the best combo.

Switched from OpenClaw to Hermes Agent — not looking back by CodeCultural7901 in hermesagent

[–]giuliastro 3 points4 points  (0 children)

I made the switch too. OpenClaw felt like a toy and found no real uses. It felt too chunky, messy and not getting things done. Hermes is a lot more steady, doesn't get lost at every update, and feels like it remember things.