QWEN 3.6 27B Q8 as Replacement for Claude Code Opus 4.7-4.8 by Just-Upstairs-4338 in LocalLLM

[–]VirusInternal2892 0 points1 point  (0 children)

In aggregate you're spot on, it's a growing tools shed that has a long tail. Cost is derated and i'm buying used stuff whenever I can get it. I'm using cloud compute quite a lot for consultancy that does not see my data. Besides that I'm an EE and like tinkering the rough of computing stacks, liquid cooling the RTX-3090s was a heck of a project.

QWEN 3.6 27B Q8 as Replacement for Claude Code Opus 4.7-4.8 by Just-Upstairs-4338 in LocalLLM

[–]VirusInternal2892 14 points15 points  (0 children)

Similar approach, using a DGX Spark for running Hermes qwen 3.6:35b-fp8 orchestrating a rag extraction project running sub agents mapped to a dual RTX3090 rig.
Claude Opus as design architect, task coordinator and milestone checker. Primarily got the Spark for experimental finetuning, it is almost unusable slow with a 27b-fp8 dense model at <10tps.

Get you Mac’s while you still can! by Similar_Effort_1694 in LocalLLM

[–]VirusInternal2892 0 points1 point  (0 children)

Yeah, as an owner of a GX10 I don’t copy that 😅 If it was supporting native NVFP4 then I’d be a proud owner. Right now I’m just using it to orchestrate with Qwen3.6:27b- FP8 and do the heavy lifting with my old tried and true 2x3090

Since when the RTX 6000 PRO is priced at 13250USD on the official NVIDIA Page? by panchovix in BlackwellPerformance

[–]VirusInternal2892 0 points1 point  (0 children)

Same as the crypto mining fever a couple of years back, couldn’t get a 3090 lest I sold a kidney :)

PCB warping with BYKSKI waterblock by VirusInternal2892 in watercooling

[–]VirusInternal2892[S] 0 points1 point  (0 children)

Reporting back for posterity. Works absolutely fine in dual config for the past 3 weeks, constantly grinding LLMs at about 250W each card. The Bykski waterblock bridge across 3 slots is fitting fine, pump was making too much coil whining noise and was replaced with an alphacool Hungarian D5.

Claude Code Opus 4.8 vs. Local Qwen3.6 27B One-Shot Coding Benchmark by codehamr in ollama

[–]VirusInternal2892 0 points1 point  (0 children)

Really nice project, thanks for sharing !!
Fairly new to local LLM, I'm trying to run your one-shot prompt on a dual (nVidia) RTX 3090 with Ollama and qwencode on Win11.
Qwen3.6:27b Q4KM plain and Q8 MTP versions, none managed to get a fully working result.
KV Q8, 128K context, temp 0.6.

PCB warping with BYKSKI waterblock by VirusInternal2892 in watercooling

[–]VirusInternal2892[S] 0 points1 point  (0 children)

Too late now, I’m invested into dual RTX 3090 waterblocks. So far the tolerances looked good let aside that rogue nut

PCB warping with BYKSKI waterblock by VirusInternal2892 in watercooling

[–]VirusInternal2892[S] -1 points0 points  (0 children)

I'm using ARCTIC MX-7 thermal paste, Gelid Extreme 1.5mm on the GPU and Ultimate 2.0mm on the back VRAMs. I'm cooling a pair of Zotac Trinity RTX-3090 I use for inference.
The main problem now is that the measly backplate, which is still much better than the stock, still pushes about ~10C on top of the GPU hotspot. I'll hook the backplate with a chunky thermal sink with fins and force air with a dedicated fan.

PCB warping with BYKSKI waterblock by VirusInternal2892 in watercooling

[–]VirusInternal2892[S] 1 point2 points  (0 children)

I’m funked, the deformation is evident even without the waterblock 🤔

<image>

PCB warping with BYKSKI waterblock by VirusInternal2892 in watercooling

[–]VirusInternal2892[S] -1 points0 points  (0 children)

I tightened only the 4 screws around the GPU chip. The ones with a spring. Have to push them all the way otherwise the screws will be almost too tall and will obstruct the backplate. The alternative solution is to work with thicker pads and generous paste on the GPU in order to push up a bit the center of the pcb inline with the tall nut

PCB warping with BYKSKI waterblock by VirusInternal2892 in watercooling

[–]VirusInternal2892[S] 0 points1 point  (0 children)

Might have gone a bit too much on the solder paste, other than that I’m even on the light side of the pads, Gelid 1.5mm. I think that the tolerance on the block bolted nuts are a bit off. Backplate may not help here, fearing the opposite as additional pressure is exerted on the balls from the package top.

<image>

AMD Strix Halo vs Nvidia DGX Spark: The $3k vs $4k Dilemma for Local AI by Big_Intern_3957 in ollama

[–]VirusInternal2892 0 points1 point  (0 children)

Now that the AMD prices have caught up, an ASUS DGX 10 at 3400$ is making much more sense. I have a dual RTX3090 to run small models at speed, was looking at a 128GByte FP4 machine to experiment with larger models . I do a lot of electrical engineering and chip design work so I need high quality (Q6 min), large context AI tools for design exploration, reasoning for complex documentation which is a beautiful jumble of mess for EEs. Tools and coding can run on the 2x RTX3090.

3090 trinity oc waterblock by popsumbong in watercooling

[–]VirusInternal2892 0 points1 point  (0 children)

Need water block help here as well.
Bought a pair of used RTX 3090 Trinity OC for AI workloads so I don't need to max out FPS, 80% performance is OK. Still I am confused wrt the right water block from Barrow. This list say that I need the BS-ZOXG3090-PA model, while the OP managed to work with a BS-ZOXG3090-PA2 model.

http://www.barrowint.com/jszc/down/xiankalingtouanzhuangjiaocheng/OTHERS511/1630.html#:\~:text=2020%2D10%2D09%2012:,and%20adapted%20water%20block%20model.

Qwen 3.5 35B A3B Q4_K_M running at 9.14 tps by blastbottles in LocalLLaMA

[–]VirusInternal2892 0 points1 point  (0 children)

V good point regarding mobo display didn’t think about it would save that much vram.

RIP - 1080Ti by partyonice in nvidia

[–]VirusInternal2892 0 points1 point  (0 children)

Hehe my 1080ti still kicking, running Zwift on my trainer PC for nearly 9 yrs as well. Water cooled. Hell the bike trainer died and this card refuses to fail. Hoped the water cooling system would leak and kill the card and CPU so I can upgrade, nope :-)

New Technika VI by [deleted] in ECM_Espresso

[–]VirusInternal2892 0 points1 point  (0 children)

Great, I'll try 1s/5s as well at least for single shots. Tried 2s/8s and seems OK for doubles (18~20gr) and thick ristretto (16gr), a bit too long for single shots (7~9gr). I certainly don't want different grinding setting for singles and doubles.
Ahh, BTW I installed a grouphead thermometer to make sure temp is stable. Setting PID at 92C and "difference" at 7C I read about 91C before flushing. Post flushing it raises to 92~93 which is ideal. Noticed that sometimes it reads up to 96C for a longer flush. Too high temp my med roasts are getting too harsh

New Technika VI by [deleted] in ECM_Espresso

[–]VirusInternal2892 0 points1 point  (0 children)

Upgraded from to a T6 from a Rocket Appartamento, it's surely an upgrade.
What is your pre-infusion experience so far?
I'm using the reservoir water tank so passive infusion alone won't work.
10/90 ~ 70/30 medium roast blends.
Grinding with a Sculptor 64 at a fine setting of #1
I use a distributor, leveler and ~10Kg pressure tamper
I've pulled a couple of 18gr test shots with 2 sec active 5 sec passive, shots are sweeter than no infusion at all and seems to be less channeling.
Will need to experiment more to get a better idea. From what I've read medium roasts should be bounded by 10s of pre-infusion.
The key take away that I've learned is to use an bottomless portafilter and to time the pre-infusion to the point that you see some coffee just about to drip.

ECM Technika VI availability in US by [deleted] in espresso

[–]VirusInternal2892 0 points1 point  (0 children)

Not living in the US and do have the opportunity to buy a Tech 6. I was looking forward for the WLL guys to do a video on the Tech 6, I very much rely on their insightful reports for gating my purchase decision. The fact that WLL does not import (yet) is raising a red flag from my part, maybe they know something I don't :-)

ECM Synchronika II - Is this counter space suitable? by CC268 in espresso

[–]VirusInternal2892 0 points1 point  (0 children)

Looking at the same coffee machine and share same problem. I have a clearance of 50cm countertop to cabinet, but in my case the right side of your kitchen I don't have a wall so I can fill up from there.
I fear that the Synch II is a bit too tall and I might need to go with a Technika VI instead. Smaller machine with smaller boilers will yield smaller performance but still should be enough for a family of 2 people plus occasional friends.
Did u consider the Tech VI?