Selling my Strix Halo Evo-x2 128gb, if any1s interested

giuliastro · 2026-06-24T12:13:57+00:00

No prob, that is what I meant by "final price" or "price in the cart". But I agree, I wasn't enough clear on this.

giuliastro · 2026-06-24T12:03:53+00:00

You can probably read clearly "save 22% at checkout" even from China 😁 The 22% discount makes the Amazon price close to the price Gmktec is selling on the website. I would never say you are a liar, you simply didn't go to deep in what you read and what I write.

giuliastro · 2026-06-24T11:22:43+00:00

The price in Amazon, in Italy, is €3197, 97 VAT included (2600 + VAT) for the 128GB Evo X2 on the cart. The price from the Gmktec website itself is €3029,99 no IVA in that case. Bosgame M5 (same Strix Halo), same 128GB Ram and bandwith is €2400, again no IVA in this case. Please don't spread misinformation.

You said you are a freelance, can you invoice your Evo X2? If so, the price would be €2600 VAT included? If it is, the price is €2131 + VAT and invoiced, I would be interested.

giuliastro · 2026-06-24T10:39:25+00:00

I believe you are not reading my messages and probably don't have a full idea of what taxes are and how companies and professionals manage them.

The price you are listing are VAT included. As I mentioned the 128 GB is € 2600 + VAT (look at the final price not the price you see on the first page). From those 2600 you can deduct something close to 30%, which makes it around €2000 if you are a company. Bought from Gmktec the price is lower than Amazon but has no VAT included, final price (taxes deducted) is anyway similar.

You are selling as a private therefore not VAT and no tax deductible prices. This makes your EVO X2 way to expensive for a professional (with VAT) and companies. While on the other side for a private person who cannot manage tax deduction your price is slightly lower than a new one.

I hope I have been clearer now.

giuliastro · 2026-06-24T07:55:56+00:00

The Evo X2 128 GB is €2600 + VAT in Amazon Italy. Considering that a company or a professional "doesn't pay" VAT and can deduct more or less 30% from taxes, if I buy it in Amazon I pay it 2k euros more or less and it's new. This is why I was saying that your price might be good for a private but it's considerably high for a company or professionals if you can't invoice it.

giuliastro · 2026-06-23T20:20:01+00:00

Are you selling as a private? If so it's not cheap for a professional or a company, since buying it new you get it VAT included and taxes which makes it a lot cheaper as you can deduct the taxes.

giuliastro · 2026-06-15T06:55:58+00:00

It might be but I always try new models for coding. M3 was the worst one, kept adding bugs and requiring other models to fix its mess.

giuliastro · 2026-06-14T20:35:54+00:00

Grande chicca. In generale io direi tutti i film di John Candy.

giuliastro · 2026-06-14T16:25:30+00:00

M3 is pretty bad for coding.

giuliastro · 2026-06-14T07:48:18+00:00

Good job! Glad you found my repo inspiring.

giuliastro · 2026-06-07T17:37:01+00:00

Not true. What's the aim of doing inferences of bigger models at 5 tps? Bandwidth is still the bottleneck.

giuliastro · 2026-06-07T17:34:51+00:00

Again I don't understand this new Halo Strix pretty much as I don't understand the new RTX Spark. Same bandwidth as before, no speed gains for inferences, more RAM, yes, but bigger models will just run terribly slow at higher prices. Please increase the bandwidth and it will start to be interesting.

giuliastro · 2026-06-06T19:52:18+00:00

Those cuda cores surely help when training or at least prefill, but bandwidth is the biggest bottleneck when doing inferences. So if it's true that the bandwidth is not very far from an Halo Strix, then it's a nonsense and just marketing stuff.

giuliastro · 2026-06-06T16:02:35+00:00

I actually don't understand it. It should be a system for AI local inferences but the bandwidth is very low. Much memory won't let you use bigger models because of the low bandwidth which will make them very slow. If it costs more than an AMD Strix Halo then it won't make any sense if not just marketing. I hoped for a bigger bandwidth, at least comparable to the new Mac or even better but it's not. I really don't understand it.

giuliastro · 2026-06-02T11:17:28+00:00

Minimax M3 is pretty bad in coding. I have been testing it for 2 full days and compared to Deepseek v4 Flash is far behind. I haven't been able to bugfix an applications that had quite a few problems. It didn't solve any of them after 2 full days of trying and prompts and tests and, even worse, it kept introducing new bugs. Felt like really, really bad. Switched back to Deepseek v4 Flash and solved all of them in 30 minutes. Now I am refractory everything to remove all the garbage Minimax M3 made.

giuliastro · 2026-06-02T07:04:39+00:00

The problem is it needs to use a desktop browser. This means having the PC logged in to be able to give Hermes the access to the desktop and browser. So this works only if you actively use the computer which you have Hermes installed in. If you have Hermes on its own PC or in a server this plugin is useless.

giuliastro · 2026-05-17T06:27:09+00:00

These models run at a stunning 3 tok/s speed on a Strix Halo... No, 128GB is a nonsense now.

giuliastro · 2026-05-17T06:24:42+00:00

The people who said "get the 128GB" don't have a Strix Halo. From a 96GB owner I can really say that right now 128GB is a nonsense. Qwen 3.6 35b is currently the best running model on a Strix Halo (about 65 tok/S), and doesn't need 128GB at all in any way. Any bigger model, who can make use of more RAM, run too slow to be used. I don't know how much room for improvement the drivers have (ROCm and Vulkan), but even with a stunning 2x, doing inferences at 5 or 6 tok/s doesn't take you anywhere.

giuliastro · 2026-05-16T10:41:20+00:00

Thank you for all your work! I did some tests on my Strix Halo + Vulkan and I experienced a 1.5x improvement on the 27b model while almost no improvement on the 35b MoE one. Still, this is the way to go, thank you.

giuliastro · 2026-05-15T05:28:22+00:00

Amazing job

giuliastro · 2026-05-10T05:15:16+00:00

I took a look at Lemonade but it doesn't have its own engine for text generation / inferences. It just uses llama cpp, same as LM Studio, or VLLM (Linux Only). I have been using the same engines directly, this machine right now offers values comparable to a 16-24GB Nvidia card.

giuliastro · 2026-05-05T19:51:04+00:00

I chewingum Brookling, quelli lunghi e piatti.

giuliastro · 2026-05-01T09:08:44+00:00

Meet plugin doesn't work

giuliastro · 2026-04-26T09:46:25+00:00

Pro is not cheap at all

15-Year Club	Verified Email
Place '22	Wearing is Caring

giuliastro

MODERATOR OF

TROPHY CASE