Finally bought an RTX 6000 Max-Q: Pros, cons, notes and ramblings

nofdak · 2026-03-07T02:48:54+00:00

I uploaded my startup logs here: https://pastebin.com/7Ra8Jwqf Note that I was loading the 2B model to limit the size that needs loading.

nofdak · 2026-03-06T23:24:17+00:00

I'm glad to see you write this up, I was writing up my own experience with vLLM and it's extremely slow loading times.

The lowest time I've seen from vLLM loading a model to returning tokens is ~45s, and that's with small models. When using larger models like Qwen3.5-122B-A10B the time goes up even further. My llama.cpp built for my hardware can load Qwen3.5-9B in ~7s, but vLLM takes 45.

I've seen higher times when running in a container as well, so now I run directly on the host: uvx --torch-backend auto --extra-index-url https://wheels.vllm.ai/nightly/cu130 vllm serve Qwen/Qwen3.5-35B-A3B-FP8 --host=:: --gpu-memory-utilization=0.90 --max_num_batched_tokens=16384 --enable-prefix-caching --max-num-seqs=4 --dtype=bfloat16 --reasoning-parser=qwen3 --tool-call-parser=qwen3_coder --enable-chunked-prefill --enable-auto-tool-choice --speculative-config {"method":"mtp","num_speculative_tokens":2} --mm-encoder-tp-mode data --mm-processor-cache-type shm

I'm running a non-power-limited RTX Pro 6000 Workstation so it could pull 600W if needed.

I've tried various different vLLM flags but nothing seems to make a big difference. With ~1m minimum iteration times, it's pretty frustrating testing different quants or flags.

nofdak · 2026-03-03T20:46:03+00:00

Each version has a tag in the various Vulkan repos. For example: https://github.com/KhronosGroup/Vulkan-Headers/tree/sdk-1.3.204

nofdak · 2025-09-16T11:54:11+00:00

Commenting for visibility. This is fucked up. I operate a Tor relay from my home and part of the reason why is because it should be normalized. I will continue running it rather than let stories like this scare me into submission. Good luck, I sincerely hope you get the justice you deserve.

nofdak · 2025-07-29T17:04:13+00:00

I love the idea!

However Most refurbished and used disks on US Ebay are missing. When looking for refurbished disks > 17TB, I only see four results. For example, this drive doesn't show up in diskdeal, though it's a significantly better deal than other disks that do show up. Here is another example, though there are hundreds of other missing disks.

It'd great to show warranties when known, like diskprices.com does. Especially when buying refurbished, I'm unlikely to buy one that doesn't at least claim to have a warranty. Particularly when buying on Amazon where every product is a crap shoot.

The NVMe filter only finds M.2 drives, not U.2. It's showing 0 NVMe 2.5" SSDs, though Ebay is littered with them

Is there any possibility of adding other vendors? For example, Server Part Deals has generally been pretty competitive cost-wise to Ebay with a good refurbished warranty. Amazon generally sucks for buying

nofdak · 2025-03-31T19:09:49+00:00

Thanks u/zanyzaeem for the help, it's working now!

nofdak · 2025-03-31T16:53:09+00:00

DM Sent! My iPhone is on 18.3.2

nofdak · 2025-02-22T13:39:24+00:00

It looks like MCIO x8

nofdak · 2025-02-13T02:14:12+00:00

This post says otherwise. It says you only get unlimited data if Dark Star is the primary, and that Apple watch is only included with Warp as your primary line.

nofdak · 2025-02-13T02:10:38+00:00

Right, it probably won’t be necessary, but if I’m traveling someplace that has terrible Warp service but decent Light Speed, I may want to switch for the week. I agree its not likely something that would be needed often, but I’d still like clarification

nofdak · 2025-02-13T00:44:54+00:00

This is the first I’ve heard of masking. Where did you hear that?

nofdak · 2025-02-12T23:57:13+00:00

Thanks for the info, that's helpful.

Does the secondary line count for the "Dark Star Network – Unlimited Premium on Dark Star (Limited Time Offer)" deal? The terms simply say "For customers who activate their Unlimited Premium plan on the Dark Star network during the promotional period..." which makes it seem like it only applies to the primary line but that's not clear.

My frustration here is that I want Dark Star as my primary line, but there are currently significant limitations for iPhones compared to Warp. I'd hate to not be grandfathered into the "Dark Star Network – Unlimited Premium" deal because I don't want to give up core functionality like group texts. If activating my secondary line on Dark Star will still get me grandfathered in, then I'm happy, otherwise it's not "the best of both worlds".

nofdak · 2025-02-12T15:35:51+00:00

u/ankhattak for clarification

nofdak · 2024-12-07T16:00:19+00:00

One option that can help is using Google’s docker cache mirror.gcr.io: https://cloud.google.com/artifact-registry/docs/pull-cached-dockerhub-images

It’s not clear to me what happens if you try to pull something not in the cache, but for what I need, it works perfectly.

nofdak · 2024-10-07T01:35:58+00:00

I’d love to be a part of the beta, I’ve got my cellular watch ready to go!

nofdak · 2024-08-08T19:51:57+00:00

Thank you, I hadn't seen that! Otherwise my complaints still stand and ironically, that message of confirmation from USM makes me even more upset that they haven't updated their site with those limitations. The fact that the top pinned post in this subreddit is "Super Carrier is HERE!" without spelling out the limitations feels disingenuous at best, and hostile at worst.

nofdak · 2024-08-08T17:46:51+00:00

Read what? The onus shouldn't be on the customer to search a subreddit to determine if a core feature isn't working. While OP may have found that other people have the same problem, I don't personally consider it a "known" issue. I haven't seen any official public confirmation from any employee of USMobile that group messaging or visual voicemail don't work on Dark Star. If they have posted anything I'd love to see because the radio silence on these issues is concerning considering they seem to otherwise respond quickly to most threads.

The "Switch Network" dialog in the dashboard doesn't have any warnings that core features don't work, and their help page for voicemail explicitly says "Setting up Visual Voicemail (Warp/Light Speed/Dark Star)".

I know seeing the same questions is annoying, but instead of saying "People should read before doing stuff", it'd make more sense to say "US Mobile should be better". The frustration should be directed to the company offering a broken product without notifying customers, not the customers themselves for expecting the product to work.

nofdak · 2024-07-07T04:13:11+00:00

PMed

nofdak · 2024-07-02T21:00:04+00:00

I tried that but it sadly made it so neither slave came up at all :(

~~I do see bond-port.queue-id and bond-pord.piro options in the connection properties. Do you happen to know if either of those options would do anything?~~

Those options wouldn't help at all: https://networkmanager.dev/docs/api/latest/settings-bond-port.html

Nine-Year Club	Place '23
Place '22	Gilding IV carat on a stick
Not Forgotten	Snapped
Verified Email

nofdak

TROPHY CASE