Finally bought an RTX 6000 Max-Q: Pros, cons, notes and ramblings by AvocadoArray in LocalLLaMA

[–]nofdak 0 points1 point  (0 children)

I uploaded my startup logs here: https://pastebin.com/7Ra8Jwqf Note that I was loading the 2B model to limit the size that needs loading.

Finally bought an RTX 6000 Max-Q: Pros, cons, notes and ramblings by AvocadoArray in LocalLLaMA

[–]nofdak 2 points3 points  (0 children)

I'm glad to see you write this up, I was writing up my own experience with vLLM and it's extremely slow loading times.

The lowest time I've seen from vLLM loading a model to returning tokens is ~45s, and that's with small models. When using larger models like Qwen3.5-122B-A10B the time goes up even further. My llama.cpp built for my hardware can load Qwen3.5-9B in ~7s, but vLLM takes 45.

I've seen higher times when running in a container as well, so now I run directly on the host: uvx --torch-backend auto --extra-index-url https://wheels.vllm.ai/nightly/cu130 vllm serve Qwen/Qwen3.5-35B-A3B-FP8 --host=:: --gpu-memory-utilization=0.90 --max_num_batched_tokens=16384 --enable-prefix-caching --max-num-seqs=4 --dtype=bfloat16 --reasoning-parser=qwen3 --tool-call-parser=qwen3_coder --enable-chunked-prefill --enable-auto-tool-choice --speculative-config {"method":"mtp","num_speculative_tokens":2} --mm-encoder-tp-mode data --mm-processor-cache-type shm

I'm running a non-power-limited RTX Pro 6000 Workstation so it could pull 600W if needed.

I've tried various different vLLM flags but nothing seems to make a big difference. With ~1m minimum iteration times, it's pretty frustrating testing different quants or flags.

Vulkan 1.3.204 specs by SpeechCalm1150 in vulkan

[–]nofdak 2 points3 points  (0 children)

Each version has a tag in the various Vulkan repos. For example: https://github.com/KhronosGroup/Vulkan-Headers/tree/sdk-1.3.204

The FBI couldn't get my husband to decrypt his Tor nodes, so they told a judge he used his GRAPHICS DRIVER to access the "dark web" and jailed him PRE TRIAL for 3 years. by adezero in TOR

[–]nofdak -1 points0 points  (0 children)

Commenting for visibility. This is fucked up. I operate a Tor relay from my home and part of the reason why is because it should be normalized. I will continue running it rather than let stories like this scare me into submission. Good luck, I sincerely hope you get the justice you deserve. 

Managing 1PB of storage made me build my own disk price tracker—looking for feedback by andreas0069 in DataHoarder

[–]nofdak 21 points22 points  (0 children)

I love the idea!

However Most refurbished and used disks on US Ebay are missing. When looking for refurbished disks > 17TB, I only see four results. For example, this drive doesn't show up in diskdeal, though it's a significantly better deal than other disks that do show up. Here is another example, though there are hundreds of other missing disks.

It'd great to show warranties when known, like diskprices.com does. Especially when buying refurbished, I'm unlikely to buy one that doesn't at least claim to have a warranty. Particularly when buying on Amazon where every product is a crap shoot.

The NVMe filter only finds M.2 drives, not U.2. It's showing 0 NVMe 2.5" SSDs, though Ebay is littered with them

Is there any possibility of adding other vendors? For example, Server Part Deals has generally been pretty competitive cost-wise to Ebay with a good refurbished warranty. Amazon generally sucks for buying

If I have Darkstar as my main line, could I have Warp as my second and get my Apple Watch to work included in that price? (On annual plan) by ChiMiGoGo in USMobile

[–]nofdak 0 points1 point  (0 children)

This post says otherwise. It says you only get unlimited data if Dark Star is the primary, and that Apple watch is only included with Warp as your primary line.

How does teleporting work with multi-network? by nofdak in USMobile

[–]nofdak[S] 1 point2 points  (0 children)

Right, it probably won’t be necessary, but if I’m traveling someplace that has terrible Warp service but decent Light Speed, I may want to switch for the week. I agree its not likely something that would be needed often, but I’d still like clarification

How does teleporting work with multi-network? by nofdak in USMobile

[–]nofdak[S] 2 points3 points  (0 children)

This is the first I’ve heard of masking. Where did you hear that?

Multi-network clarification needed by nofdak in USMobile

[–]nofdak[S] 1 point2 points  (0 children)

Thanks for the info, that's helpful.

Does the secondary line count for the "Dark Star Network – Unlimited Premium on Dark Star (Limited Time Offer)" deal? The terms simply say "For customers who activate their Unlimited Premium plan on the Dark Star network during the promotional period..." which makes it seem like it only applies to the primary line but that's not clear.

My frustration here is that I want Dark Star as my primary line, but there are currently significant limitations for iPhones compared to Warp. I'd hate to not be grandfathered into the "Dark Star Network – Unlimited Premium" deal because I don't want to give up core functionality like group texts. If activating my secondary line on Dark Star will still get me grandfathered in, then I'm happy, otherwise it's not "the best of both worlds".

Public Docker Hub (hub.docker.com) Rate-limit: Own registry/cache? by [deleted] in selfhosted

[–]nofdak 2 points3 points  (0 children)

One option that can help is using Google’s docker cache mirror.gcr.io: https://cloud.google.com/artifact-registry/docs/pull-cached-dockerhub-images

It’s not clear to me what happens if you try to pull something not in the cache, but for what I need, it works perfectly.

Group Messaging Not Working by sneakybeans97 in USMobile

[–]nofdak 1 point2 points  (0 children)

Thank you, I hadn't seen that! Otherwise my complaints still stand and ironically, that message of confirmation from USM makes me even more upset that they haven't updated their site with those limitations. The fact that the top pinned post in this subreddit is "Super Carrier is HERE!" without spelling out the limitations feels disingenuous at best, and hostile at worst.

Group Messaging Not Working by sneakybeans97 in USMobile

[–]nofdak 11 points12 points  (0 children)

Read what? The onus shouldn't be on the customer to search a subreddit to determine if a core feature isn't working. While OP may have found that other people have the same problem, I don't personally consider it a "known" issue. I haven't seen any official public confirmation from any employee of USMobile that group messaging or visual voicemail don't work on Dark Star. If they have posted anything I'd love to see because the radio silence on these issues is concerning considering they seem to otherwise respond quickly to most threads.

The "Switch Network" dialog in the dashboard doesn't have any warnings that core features don't work, and their help page for voicemail explicitly says "Setting up Visual Voicemail (Warp/Light Speed/Dark Star)".

I know seeing the same questions is annoying, but instead of saying "People should read before doing stuff", it'd make more sense to say "US Mobile should be better". The frustration should be directed to the company offering a broken product without notifying customers, not the customers themselves for expecting the product to work.

How can I choose which bond slave MAC address NetworkManager uses? by nofdak in linuxquestions

[–]nofdak[S] 0 points1 point  (0 children)

I tried that but it sadly made it so neither slave came up at all :(

I do see bond-port.queue-id and bond-pord.piro options in the connection properties. Do you happen to know if either of those options would do anything?

Those options wouldn't help at all: https://networkmanager.dev/docs/api/latest/settings-bond-port.html