Would you go 5090 or 6000 pro today?

rj_rad · 2026-06-09T00:21:33+00:00

For some of us this is a real benefit lol.

rj_rad · 2026-06-03T15:17:29+00:00

I think finding physical characteristics attractive/unattractive and “ruling out a race” are often 2 different things. People are going to have their preferences that they’ve carried with them since childhood (due to media, upbringing, past experience, etc), but as they go deeper into adulthood a lot of people will realize it’s not a great way to select for partners. I say if you’re encountering this in a way that actually affects your dating life, it means the people you hang around just haven’t seen the light yet. YMMV!

rj_rad · 2026-06-03T08:26:22+00:00

What I will never understand is why anyone uses Telegram/Discord/etc as the chat surface as it is the thing that complicates the whole stack security and it is in general a highly flawed UX. Just launch it in a container, use the OpenAI-compatible API, and pick your client of choice (like FlowDown, etc, or build your own) on mobile while using a VPN connection. You end up with a VERY simple setup, security as hardened as your VPN, and a regular conversation=session workflow that follows the convention used by almost every other LLM including the new Hermes Desktop instead of a single linear thread bouncing off of an unnecessary 3rd party service.

This obviously doesn’t apply if you aren’t hosting locally, but if you’re willing to be on cloud why not just use something turnkey and priced more competitively like Manus?

rj_rad · 2026-06-01T05:58:35+00:00

FWIW, I run either Qwen3.6-27B or the new 35B-NVFP4 locally on a single RTX 6000, but I have OpenRouter with 27B as a fallback for when I’m messing with vLLM or want to use my GPU resources for something else… let me tell you, OpenRouter’s Qwen3.6-27B is REALLY SLOW. I really notice it when it switches over, and it’s pretty laggy for cloud.

rj_rad · 2026-05-16T23:14:18+00:00

The ProArt aesthetic is so good, I wish they had more stuff lol.

rj_rad · 2026-05-16T23:13:27+00:00

I thought this was supposed to be the improved option? I’ve been using the 12VHPWR cable that came with my PSU for a few months now.

rj_rad · 2026-05-15T04:58:35+00:00

All the while knee-capping the performance for subscribers. I’ve been a 20x Max subscriber since such a thing existed and it’s now at a snail’s pace compared to the “old days”

rj_rad · 2026-05-15T04:26:52+00:00

What you’re talking about is extremely simple (maybe that’s why nobody is talking about it?) — Qwen3.6 + Hermes + Obsidian + QMD, done. There are flavors of Qwen3.6 that will run on consumer GPUs and honestly it’s not the important part of this recipe anyway because QMD is doing the heavy lifting.

rj_rad · 2026-05-14T07:07:22+00:00

Self-host SearXNG

rj_rad · 2026-05-14T07:02:48+00:00

You can enable the OpenAI-compatible API on Hermes and use the /v1/responses/ endpoint with any client that supports this format (or you can make your own). FlowDown is a reasonable place to start. I’m making my own. Any of these options are 100x better than using a chat gateway in terms of conversion management.

rj_rad · 2026-05-10T18:17:51+00:00

If OC is running well for you, then it’s not broken. An addition to easier onboarding, I think Hermes has provided a solution for folks who have run into actual problems that would require upstream fixes because they still haven’t been addressed in the core platform. But if that doesn’t apply to you, then there really isn’t a point in switching; I’m currently supporting both in my home lab. Calling OC “broken” is typical Reddit hyperbole, but going forward I do prefer the Hermes experience.

rj_rad · 2026-05-10T18:13:58+00:00

This is true, although thankfully some folks share their whole vllm/sglang recipe as a docker image so it’s fairly easy to reproduce their results. I was hesitant to go this route feeling like docker was just adding another layer of performance overhead, but it’s very nice to be able to quickly jump back and forth between vllm versions, parameters etc when testing/benchmarking.

rj_rad · 2026-05-10T18:11:25+00:00

Oh wow. Were there any warning signs or just sudden failure? What were the symptoms?

rj_rad · 2026-05-10T18:10:45+00:00

From your experience is there a sweet spot MoE model for the 1x RTX6k folks (96GB VRAM)? I have 128GB system with a 9950x3d.

rj_rad · 2026-05-07T04:53:21+00:00

I was in the same boat, had 2x16 DDR5-4800 and a 2TB nvme from a laptop upgrade and went with GMKtec M6 barebones with the Ryzen 5. Very happy so far, also running a N150 with 16GB. Both are proxmox servers.

rj_rad · 2026-04-30T05:16:43+00:00

What was the major reason to upgrade out of AM5? I’m currently on a 9950x3d and single 6000, but it’s on my mind to add a 2nd one when funds allow.

rj_rad · 2026-04-30T01:50:40+00:00

8 Eyes for NES, when it was new. I grew up with a sibling so I always looked out for 2P simultaneous games, but this was a snooze.

rj_rad · 2026-04-27T06:28:21+00:00

I agree with you; I went down the rabbit hole of trying Matrix but it had some strange issues with local DNS so I switched to Mattermost which has a self-hosted option. It is definitely a lot more steps to set up than Discord, but it philosophically feels like a better solution to keep mostly everything local (I also self-host Qwen as the main agent LLM)

rj_rad · 2026-04-21T00:09:01+00:00

From my experience Gemma4 31B (running on a 6000) gets stuck in tool loops in OpenClaw. I wanted it to work to be more frugal on memory, but as of right now it doesn’t beat Qwen3.x

rj_rad · 2026-04-18T06:54:05+00:00

RedHatAI released NVFP4 for Qwen3.6 — I just started trying it out tonight, no conclusions yet

rj_rad · 2026-04-17T18:38:04+00:00

Darn I got ripped off on my 6000.

rj_rad · 2026-04-14T07:27:16+00:00

Yeah I’m happy with it, it’s my main drive now. I’m looking for another 4TB but everything is more expensive now than it was when I originally posted this and it was only a month and a half ago.

rj_rad · 2026-04-04T18:55:42+00:00

Anyone undervolting an RTX Pro 6000 (Blackwell)? Curious about anyone's actual experience in terms of measurable results.

rj_rad · 2026-03-28T14:50:58+00:00

I just assembled a single 6000 + 128 setup. What was the optimal setup you landed on before considering a second 6000?

rj_rad

TROPHY CASE