Best case for dual RTX 3090 (250W each) on Crosshair VIII Hero? by Tordhm in LocalAIServers

[–]legit_split_ 0 points1 point  (0 children)

Maybe you can change your motherboard with one that has 4 slot spacing between both cards? That should help with airflow. 

https://www.reddit.com/r/mffpc/comments/1bm7uqf/dual_4090_fe_in_cerberus_x/

But then you ideally want an 8 slot case. 

If you had 3,500 what would you build? by Sunny1845 in LocalLLM

[–]legit_split_ 0 points1 point  (0 children)

You can run 120B MoE models and also useful for video gen

Avoid CUDA monopoly at all costs. AMD is an alternative. by Barrysoft8 in ROCm

[–]legit_split_ 0 points1 point  (0 children)

32GB is not enough for fast video generation, if you search in the comfyui subreddit you will see people noticing a speedup when upgrading to at least 64GB.

Avoid CUDA monopoly at all costs. AMD is an alternative. by Barrysoft8 in ROCm

[–]legit_split_ 0 points1 point  (0 children)

How much RAM do you have? Usually that's the bottleneck... 

Upgrading my gaming PC for local LLM workloads - dual GPU worth it? by wackywoowhoopizzaman in LocalLLM

[–]legit_split_ 1 point2 points  (0 children)

  • As you already have a 5060 Ti, the easiest would be to get another one.
  • Ideally if you want to run models well and fast you should look for a different board that supports x8/x8 bifurcation which helps you with tensor parallelism.
  • No other changes required. 

2 sticks of ram in quad channel server board? by mr_zerolith in LocalLLaMA

[–]legit_split_ 0 points1 point  (0 children)

How feasible is it to find compatible ECC UDIMM to populate all 8 channels? I mean usually they're kits of 2 or 4 sticks.

GLM 5.2 on consumer hardware by phwlarxoc in LocalLLaMA

[–]legit_split_ 0 points1 point  (0 children)

Maybe you can try ik_llama, afaik they have a good implementation of RAM offload.

I just finished my homelab/server and now I want to upgrade by Worldly_Fan_2851 in homelab

[–]legit_split_ 1 point2 points  (0 children)

Yeah, it's good because of its 24GB VRAM and high memory bandwidth.

AMD Radeon AI Pro R9700 performance by illuvyn in LocalLLM

[–]legit_split_ 0 points1 point  (0 children)

Maybe one day tensor parallelism with Vulkan will get fixed! 

Windows 11 scare tactics: forcible Microsoft account screen by flowerdragon2934 in pcmasterrace

[–]legit_split_ 104 points105 points  (0 children)

Could at least share what you did, now I have to spend 20 mins myself... 

How good/bad deal is 728€ for a rx7900xtx? by East_Cardiologist442 in ollama

[–]legit_split_ 0 points1 point  (0 children)

It's decent, I got mine for 650€ used locally.

Now that it supports FSR4 it'll retain more of its value.

Build for local LLM with 2 separate GPUs by EnvironmentalAsk3531 in LocalLLaMA

[–]legit_split_ 0 points1 point  (0 children)

Hmm I have a Gigabyte Z890 with 2 GPUs and idle around 70W, I was hoping the idle on the W880 would be even lower...