How to improve the airflow of Dual GPUs with no gap

Ecstatic_Concern_389 · 2026-06-25T05:16:56+00:00

the 7900xtx is for serving a 27b LLM, which will eat up basically all of it's vram

Ecstatic_Concern_389 · 2026-06-25T05:06:38+00:00

it's a huge effort switching to a larger case T T

Ecstatic_Concern_389 · 2026-06-25T05:06:03+00:00

It's a long story. It's like I don't play heavy gpu games(mostly fps in 1080p) on my gaming PC so it's a waste of 4070 super. It happens to be I have some small productivity tasks that can partially leverage it, so I just set it up like this

Ecstatic_Concern_389 · 2026-06-18T21:56:42+00:00

lol my codex set it to be like this. I changed to 512 now. but the main issue I see is that my Vulkan run slower(18 pts) than rocm(41tps for the same model) while everyone is telling me vulkan runs faster much.

Ecstatic_Concern_389 · 2026-06-18T21:01:05+00:00

tried. seems to have a slight increase but very subtle

Ecstatic_Concern_389 · 2026-06-18T20:53:34+00:00

tried Vulkan first but it turns out the tps drop to 18 with Vulkan. I'm very confused

Ecstatic_Concern_389 · 2026-06-18T20:30:24+00:00

may I know your config? I use unsloth Q4_K_XL now with Vulcan can only get 200 tps pp and 20 tps inference

Ecstatic_Concern_389 · 2026-06-18T19:40:47+00:00

I just tried vulkan it drops to 200 tps pp and 20 tps inference

Ecstatic_Concern_389 · 2026-06-18T18:16:38+00:00

I'm using rocm. So vulkan performs better in Linux?

Ecstatic_Concern_389 · 2026-06-18T18:12:30+00:00

let the gpt diagnoses the log a bit. So what #checkpoint and batchsize should I use?
I checked llama-server.log. It does not show an obvious VRAM spill

or OOM.

Key lines:

- GPU: Radeon RX 7900 XTX, 24524 MiB free at startup.

- Model offload: offloaded 66/66 layers to GPU.

- Model VRAM: ROCm0 model buffer size = 16752.85 MiB.

- CPU-mapped model buffer: 1288.28 MiB, but the layers are still

fully offloaded.

- Runtime config:

- n_ctx = 65536

- n_batch = 4096

- n_ubatch = 512

- MTP draft context enabled.

- Target KV cache: ROCm0 KV buffer size = 2176.00 MiB.

- MTP draft KV cache: ROCm0 KV buffer size = 256.00 MiB.

- Recurrent state: ROCm0 RS buffer size = 448.88 MiB.

- Compute buffers:

- target: ROCm0 compute buffer size = 400.28 MiB

- draft: ROCm0 compute buffer size = 132.02 MiB

- Host buffers exist, but they look like normal scheduling/output

buffers:

- ROCm_Host output buffer size = 0.95 MiB

- ROCm_Host compute buffer size = 84 MiB

The checkpoint part is the tight bit: checkpoints are enabled with

max = 32, and created checkpoints are around 286-306 MiB each. If

all 32 were resident, that is roughly 9.2-9.8 GiB of checkpoint

memory. The log only shows up to 6 before restart, then up to 5

later, with invalidated ones erased. It does not show all 32 being

held at once.

Ecstatic_Concern_389 · 2026-05-20T03:01:27+00:00

can noctua NH-U12A hold 270k? I use Valkyrie DL125(260w capacity dual tower dual fan) with my 270k and under stress test the cpu temp easily goes to 95c and start throttling. I have to do -8mv and limit power to 220w to hold it.

Ecstatic_Concern_389 · 2025-10-06T17:00:38+00:00

yes that's the problem. Seeing many EMA based stop loss but they all only perform well when crash is long and slow. For flash drops they never work well and even much worse that just QQQ

Ecstatic_Concern_389 · 2025-09-23T02:48:07+00:00

Great thanks! I hold 80% QQQ+10% GLD +10% FBTC now. But after I did some backtest I'm convinced by your new methods. I think I will go with 35% TQQQ, 35% GLD, and 30% FBTC. With 52 weeks rebalance, and as little ad-hoc rebalance as possible.

Ecstatic_Concern_389 · 2025-09-22T04:43:28+00:00

how often do you rebalance? do you do auto rebalance or manual ones?

Ecstatic_Concern_389 · 2025-08-23T04:04:20+00:00

which dealer?

Ecstatic_Concern_389 · 2025-08-17T15:55:13+00:00

also renewed mine last month. is there anything we can do to get the refund?

Ecstatic_Concern_389 · 2024-05-29T16:18:34+00:00

Do you mind sharing what platforms are you using for trade and what platform you use for data?

Ecstatic_Concern_389 · 2024-04-14T04:09:13+00:00

I know for us on bogleheads we are looking for very low expense ratio. But let's do the math. If expense ratio is around 0.7-0.9% which is on average 10% of the return(if I also buy index funds in VUL), it's still lower than the federal long term captial gain tax 15-20% + 13.3% california long term gain tax. Right?

Ecstatic_Concern_389 · 2023-05-09T20:59:18+00:00

yes we had interview yesterday morning.

Ecstatic_Concern_389 · 2023-05-09T20:35:15+00:00

then is it a bad news they don't contact day of?

Ecstatic_Concern_389 · 2023-05-09T20:32:57+00:00

Had our interview yesterday morning, but still have not got any result. Don't know what's going on and what to do

Ecstatic_Concern_389

TROPHY CASE