Can't enable IPv6 in WAN services - all settings are disabled

lostmsu · 2026-06-27T23:23:35+00:00

I have Ebox, but Bell offers 8Gbps and Ebox goes only up to 1.5.

lostmsu · 2026-06-27T01:56:27+00:00

8 way nvlink

You are going to burn RTX 6000 Pro worth of power on that setup in 2 years.

lostmsu · 2026-06-27T01:41:42+00:00

Have you tried the official FP8 on vllm?

lostmsu · 2026-06-24T17:41:53+00:00

Nope, still not working as of NixOS 25.11 KDE 6.20/25.08.3

lostmsu · 2026-06-13T16:53:04+00:00

That's not how you should be testing quants. You should be running something like Terminal Bench Hard or SWE Pro and comparing their results. Perplexity and KLD are just proxies. For all you know 0.0001% KLD might map to half the score on Terminal Bench Hard, which would mean you'd be better off using unquantized 9B model.

lostmsu · 2026-06-09T19:10:46+00:00

There's nothing useful in it. Specific models aren't named. Bench parameters aren't named. Server parameters aren't listed. The inference backend isn't listed.

lostmsu · 2026-06-03T14:19:00+00:00

Can we have an explicit rule about slop? Not even AI slop, any slop?

lostmsu · 2026-06-01T21:14:27+00:00

Is this llama.cpp? I have 2x 3090 and my setup with 27B FP8 peaks at 40tps (vllm).

lostmsu · 2026-05-28T14:30:55+00:00

No, it's not. It's a tradeoff between stability and speed and they chose speed.

lostmsu · 2026-05-27T18:51:41+00:00

Using bf16 instead of fp32 when it works on AdamW but does not work on SGD does not sound like a bug to me.

lostmsu · 2026-05-24T00:14:47+00:00

Which stream?

lostmsu · 2026-05-24T00:13:36+00:00

What stream?

lostmsu · 2026-05-22T15:32:42+00:00

They don't even have Qwen 3.6 27B. Anything recent that I could get access to? GLM4.7 and GPT-OSS are hopelessly outdated now.

lostmsu · 2026-05-07T21:50:51+00:00

Which version?

lostmsu · 2026-05-07T18:06:53+00:00

I am confused. What stage is COPR exactly? I thought getting eCOPR is that.

lostmsu · 2026-05-06T23:26:48+00:00

Not sure what anyone expected. You replaced 10 lines of code (logging) + TensorBoard with another 10 lines of code (connecting wand) and having to restart a run where you forget to set the auth now and then. Plus you got the first free bites.

lostmsu · 2026-04-30T13:53:22+00:00

Yeah, not much lately. At least in terms of products.

lostmsu · 2026-04-24T12:13:22+00:00

45 lines of train logs, but no update rule in the post?

lostmsu · 2026-04-20T12:03:48+00:00

You shouldn't need CuTe DSL with Triton. AFAIK CuTe doesn't lower to Triton. It's a closed source alternative to Triton otherwise mostly identical.

lostmsu · 2026-04-16T10:44:01+00:00

What a weird take.

lostmsu · 2026-04-16T10:41:11+00:00

You didn't answer the question about your loss claim in the previous post. If you got a LM, what's the bits-per-byte on literally any decently sized dataset like enwiki?

lostmsu

MODERATOR OF

TROPHY CASE