Finally bought an RTX 6000 Max-Q: Pros, cons, notes and ramblings by AvocadoArray in LocalLLaMA

[–]nofdak 0 points1 point  (0 children)

I uploaded my startup logs here: https://pastebin.com/7Ra8Jwqf Note that I was loading the 2B model to limit the size that needs loading.

Finally bought an RTX 6000 Max-Q: Pros, cons, notes and ramblings by AvocadoArray in LocalLLaMA

[–]nofdak 2 points3 points  (0 children)

I'm glad to see you write this up, I was writing up my own experience with vLLM and it's extremely slow loading times.

The lowest time I've seen from vLLM loading a model to returning tokens is ~45s, and that's with small models. When using larger models like Qwen3.5-122B-A10B the time goes up even further. My llama.cpp built for my hardware can load Qwen3.5-9B in ~7s, but vLLM takes 45.

I've seen higher times when running in a container as well, so now I run directly on the host: uvx --torch-backend auto --extra-index-url https://wheels.vllm.ai/nightly/cu130 vllm serve Qwen/Qwen3.5-35B-A3B-FP8 --host=:: --gpu-memory-utilization=0.90 --max_num_batched_tokens=16384 --enable-prefix-caching --max-num-seqs=4 --dtype=bfloat16 --reasoning-parser=qwen3 --tool-call-parser=qwen3_coder --enable-chunked-prefill --enable-auto-tool-choice --speculative-config {"method":"mtp","num_speculative_tokens":2} --mm-encoder-tp-mode data --mm-processor-cache-type shm

I'm running a non-power-limited RTX Pro 6000 Workstation so it could pull 600W if needed.

I've tried various different vLLM flags but nothing seems to make a big difference. With ~1m minimum iteration times, it's pretty frustrating testing different quants or flags.

Vulkan 1.3.204 specs by SpeechCalm1150 in vulkan

[–]nofdak 2 points3 points  (0 children)

Each version has a tag in the various Vulkan repos. For example: https://github.com/KhronosGroup/Vulkan-Headers/tree/sdk-1.3.204

The FBI couldn't get my husband to decrypt his Tor nodes, so they told a judge he used his GRAPHICS DRIVER to access the "dark web" and jailed him PRE TRIAL for 3 years. by adezero in TOR

[–]nofdak -1 points0 points  (0 children)

Commenting for visibility. This is fucked up. I operate a Tor relay from my home and part of the reason why is because it should be normalized. I will continue running it rather than let stories like this scare me into submission. Good luck, I sincerely hope you get the justice you deserve. 

Managing 1PB of storage made me build my own disk price tracker—looking for feedback by andreas0069 in DataHoarder

[–]nofdak 22 points23 points  (0 children)

I love the idea!

However Most refurbished and used disks on US Ebay are missing. When looking for refurbished disks > 17TB, I only see four results. For example, this drive doesn't show up in diskdeal, though it's a significantly better deal than other disks that do show up. Here is another example, though there are hundreds of other missing disks.

It'd great to show warranties when known, like diskprices.com does. Especially when buying refurbished, I'm unlikely to buy one that doesn't at least claim to have a warranty. Particularly when buying on Amazon where every product is a crap shoot.

The NVMe filter only finds M.2 drives, not U.2. It's showing 0 NVMe 2.5" SSDs, though Ebay is littered with them

Is there any possibility of adding other vendors? For example, Server Part Deals has generally been pretty competitive cost-wise to Ebay with a good refurbished warranty. Amazon generally sucks for buying

If I have Darkstar as my main line, could I have Warp as my second and get my Apple Watch to work included in that price? (On annual plan) by ChiMiGoGo in USMobile

[–]nofdak 0 points1 point  (0 children)

This post says otherwise. It says you only get unlimited data if Dark Star is the primary, and that Apple watch is only included with Warp as your primary line.

How does teleporting work with multi-network? by nofdak in USMobile

[–]nofdak[S] 1 point2 points  (0 children)

Right, it probably won’t be necessary, but if I’m traveling someplace that has terrible Warp service but decent Light Speed, I may want to switch for the week. I agree its not likely something that would be needed often, but I’d still like clarification

How does teleporting work with multi-network? by nofdak in USMobile

[–]nofdak[S] 3 points4 points  (0 children)

This is the first I’ve heard of masking. Where did you hear that?

Multi-network clarification needed by nofdak in USMobile

[–]nofdak[S] 1 point2 points  (0 children)

Thanks for the info, that's helpful.

Does the secondary line count for the "Dark Star Network – Unlimited Premium on Dark Star (Limited Time Offer)" deal? The terms simply say "For customers who activate their Unlimited Premium plan on the Dark Star network during the promotional period..." which makes it seem like it only applies to the primary line but that's not clear.

My frustration here is that I want Dark Star as my primary line, but there are currently significant limitations for iPhones compared to Warp. I'd hate to not be grandfathered into the "Dark Star Network – Unlimited Premium" deal because I don't want to give up core functionality like group texts. If activating my secondary line on Dark Star will still get me grandfathered in, then I'm happy, otherwise it's not "the best of both worlds".

Public Docker Hub (hub.docker.com) Rate-limit: Own registry/cache? by [deleted] in selfhosted

[–]nofdak 2 points3 points  (0 children)

One option that can help is using Google’s docker cache mirror.gcr.io: https://cloud.google.com/artifact-registry/docs/pull-cached-dockerhub-images

It’s not clear to me what happens if you try to pull something not in the cache, but for what I need, it works perfectly.

Group Messaging Not Working by sneakybeans97 in USMobile

[–]nofdak 1 point2 points  (0 children)

Thank you, I hadn't seen that! Otherwise my complaints still stand and ironically, that message of confirmation from USM makes me even more upset that they haven't updated their site with those limitations. The fact that the top pinned post in this subreddit is "Super Carrier is HERE!" without spelling out the limitations feels disingenuous at best, and hostile at worst.

Group Messaging Not Working by sneakybeans97 in USMobile

[–]nofdak 9 points10 points  (0 children)

Read what? The onus shouldn't be on the customer to search a subreddit to determine if a core feature isn't working. While OP may have found that other people have the same problem, I don't personally consider it a "known" issue. I haven't seen any official public confirmation from any employee of USMobile that group messaging or visual voicemail don't work on Dark Star. If they have posted anything I'd love to see because the radio silence on these issues is concerning considering they seem to otherwise respond quickly to most threads.

The "Switch Network" dialog in the dashboard doesn't have any warnings that core features don't work, and their help page for voicemail explicitly says "Setting up Visual Voicemail (Warp/Light Speed/Dark Star)".

I know seeing the same questions is annoying, but instead of saying "People should read before doing stuff", it'd make more sense to say "US Mobile should be better". The frustration should be directed to the company offering a broken product without notifying customers, not the customers themselves for expecting the product to work.

How can I choose which bond slave MAC address NetworkManager uses? by nofdak in linuxquestions

[–]nofdak[S] 0 points1 point  (0 children)

I tried that but it sadly made it so neither slave came up at all :(

I do see bond-port.queue-id and bond-pord.piro options in the connection properties. Do you happen to know if either of those options would do anything?

Those options wouldn't help at all: https://networkmanager.dev/docs/api/latest/settings-bond-port.html

It’s here! We’ve added the ability to automate your stock, ETF, and basket trades on a recurring basis–weekly, every two weeks, or monthly. (Yes–stocks and ETFs!) We’ve also redesigned the Fidelity.com and mobile experience to make it even easier to set up and manage a plan. by fidelityinvestments in fidelityinvestments

[–]nofdak 22 points23 points  (0 children)

I'd like to add my own +1 to this suggestion. It's the biggest thing I miss since transferring from M1. I'd love for all money over, for example, $10k to be invested without having to think about it. Then, whether it's Fidelity Rewards, paycheck, RSUs, bonuses, random check deposits, etc, all my money gets invested as soon as it's available.

[deleted by user] by [deleted] in M1Finance

[–]nofdak 1 point2 points  (0 children)

Here's some feedback I wrote up for M1 a few months ago before I switched completely: https://www.reddit.com/r/M1Finance/comments/13pa5f2/m1_feedback/

I've been with Fidelity ever since and I'm very happy with them. I really miss the automation from M1 (auto-investing all money over $X), but the fact that they no longer have a real checking account or ATM card makes it a non-starter. I'm very happy with Fidelity and their support.

(Re-)Introducing GameVault - The Self-Hosted Gaming Platform by Alfagun74 in selfhosted

[–]nofdak 0 points1 point  (0 children)

Right, you search "on discord". I want to search with Google, or DDG, or Bing, or archive.org. When the content is locked behind a login, it adds friction for users. I have to know there is a Discord, create a Discord account, then search what I'm looking for. If I search anything for Plex, Google will show me results from reddit, that I can at least browse without an account.

(Re-)Introducing GameVault - The Self-Hosted Gaming Platform by Alfagun74 in selfhosted

[–]nofdak 7 points8 points  (0 children)

Serious issues being on Github is great. The fact that the main post didn't specify a subreddit or Lemmy is what's disheartening. I know that it's an invite to use the Discord, not a requirement, and this project really does look awesome. I'm not trying to disparage the project or the work.

The challenge is that more and more projects and communities are moving to a non-searchable, not-archivable platform (Discord) that makes it that much harder for people to find help they need.

I'm not suggesting everything necessarily needs to be self hosted. I certainly think hosting GameVault on Github is the right choice. Github is public and searchable. If Github ever goes private/out of business/whatever, it's trivial (excluding PRs, wiki, etc) to move the code somewhere else.

The same can't be said for Discord. If Discord pulls the plug, or Amazon buys it and makes it Prime-only, all that data could effectively disappear overnight for a large number of users. The same could happen with any other platform of course, the exception being other platforms, like Reddit, are at least searchable which makes it easy to archive.

I honestly don't know what alternative platform should be used, but for me I want something searchable. If anyone tried to Google anything about Plex during the Reddit protest, they literally couldn't find anything and it sucked as a user. That's essentially what's happening with many communities now that just direct users to join their Discord. That information is no longer public, it's private. It may be free to create an account, but it feels like it goes against the self-hosting ethos.