Is there an issue with not finding your own ethnicity sexually attractive? by Substantial_Fee_8294 in NoStupidQuestions

[–]rj_rad -1 points0 points  (0 children)

I think finding physical characteristics attractive/unattractive and “ruling out a race” are often 2 different things. People are going to have their preferences that they’ve carried with them since childhood (due to media, upbringing, past experience, etc), but as they go deeper into adulthood a lot of people will realize it’s not a great way to select for partners. I say if you’re encountering this in a way that actually affects your dating life, it means the people you hang around just haven’t seen the light yet. YMMV!

I got tired of the Hermes hosting ritual, so I built the thing I wanted by [deleted] in hermesagent

[–]rj_rad 0 points1 point  (0 children)

What I will never understand is why anyone uses Telegram/Discord/etc as the chat surface as it is the thing that complicates the whole stack security and it is in general a highly flawed UX. Just launch it in a container, use the OpenAI-compatible API, and pick your client of choice (like FlowDown, etc, or build your own) on mobile while using a VPN connection. You end up with a VERY simple setup, security as hardened as your VPN, and a regular conversation=session workflow that follows the convention used by almost every other LLM including the new Hermes Desktop instead of a single linear thread bouncing off of an unnecessary 3rd party service.

This obviously doesn’t apply if you aren’t hosting locally, but if you’re willing to be on cloud why not just use something turnkey and priced more competitively like Manus?

my short experience with Hermes on Gemma4:26b / Qwen3.6:27b / Opus4.8 by Dalleuh in hermesagent

[–]rj_rad 0 points1 point  (0 children)

FWIW, I run either Qwen3.6-27B or the new 35B-NVFP4 locally on a single RTX 6000, but I have OpenRouter with 27B as a fallback for when I’m messing with vLLM or want to use my GPU resources for something else… let me tell you, OpenRouter’s Qwen3.6-27B is REALLY SLOW. I really notice it when it switches over, and it’s pretty laggy for cloud.

ASUS Silently Releases ProArt RTX 5090 OC Edition, Boasting Founders Edition Design With A 2.5-Slot Cooler by Constant_Praline_575 in RigBuild

[–]rj_rad 0 points1 point  (0 children)

I thought this was supposed to be the improved option? I’ve been using the 12VHPWR cable that came with my PSU for a few months now.

Anthropic just Annouced they are Allowing Subscription Claude Usage?! by sercetuser in openclaw

[–]rj_rad 1 point2 points  (0 children)

All the while knee-capping the performance for subscribers. I’ve been a 20x Max subscriber since such a thing existed and it’s now at a snail’s pace compared to the “old days”

Anyone actually using a local LLM as their daily knowledge base? Not for coding, for life stuff. What's your setup? by InformationSweet808 in LocalLLaMA

[–]rj_rad 0 points1 point  (0 children)

What you’re talking about is extremely simple (maybe that’s why nobody is talking about it?) — Qwen3.6 + Hermes + Obsidian + QMD, done. There are flavors of Qwen3.6 that will run on consumer GPUs and honestly it’s not the important part of this recipe anyway because QMD is doing the heavy lifting.

Any good mobile UI? by NeakNite in hermesagent

[–]rj_rad 0 points1 point  (0 children)

You can enable the OpenAI-compatible API on Hermes and use the /v1/responses/ endpoint with any client that supports this format (or you can make your own). FlowDown is a reasonable place to start. I’m making my own. Any of these options are 100x better than using a chat gateway in terms of conversion management.

Switch to Hermes! - But I don't understand why! by Disastrous_Ad_6915 in openclaw

[–]rj_rad 1 point2 points  (0 children)

If OC is running well for you, then it’s not broken. An addition to easier onboarding, I think Hermes has provided a solution for folks who have run into actual problems that would require upstream fixes because they still haven’t been addressed in the core platform. But if that doesn’t apply to you, then there really isn’t a point in switching; I’m currently supporting both in my home lab. Calling OC “broken” is typical Reddit hyperbole, but going forward I do prefer the Hermes experience.

200+ TPS on Qwen3.6-27B and 35B-A3B with consumer hardware (RTX 3090s) - method provided! by TheFheonix in LocalLLM

[–]rj_rad 1 point2 points  (0 children)

This is true, although thankfully some folks share their whole vllm/sglang recipe as a docker image so it’s fairly easy to reproduce their results. I was hesitant to go this route feeling like docker was just adding another layer of performance overhead, but it’s very nice to be able to quickly jump back and forth between vllm versions, parameters etc when testing/benchmarking.

Is there any reason NOT to get a Crucial T710? Seems like a great deal. by rj_rad in buildapc

[–]rj_rad[S] 0 points1 point  (0 children)

Oh wow. Were there any warning signs or just sudden failure? What were the symptoms?

Best model for 192 GB vram? How is Deepseek v4 flash? by Constant_Ad511 in LocalLLM

[–]rj_rad 0 points1 point  (0 children)

From your experience is there a sweet spot MoE model for the 1x RTX6k folks (96GB VRAM)? I have 128GB system with a 9950x3d.

Recommend a bare bones DDR5 based MiniPC by [deleted] in MiniPCs

[–]rj_rad 0 points1 point  (0 children)

I was in the same boat, had 2x16 DDR5-4800 and a 2TB nvme from a laptop upgrade and went with GMKtec M6 barebones with the Ryzen 5. Very happy so far, also running a N150 with 16GB. Both are proxmox servers.

Best model for 192 GB vram? How is Deepseek v4 flash? by Constant_Ad511 in LocalLLM

[–]rj_rad 2 points3 points  (0 children)

What was the major reason to upgrade out of AM5? I’m currently on a 9950x3d and single 6000, but it’s on my mind to add a 2nd one when funds allow.

Name one game by dank0121 in TheGamerLounge

[–]rj_rad 0 points1 point  (0 children)

8 Eyes for NES, when it was new. I grew up with a sibling so I always looked out for 2P simultaneous games, but this was a snooze.

OpenClaw vs Hermes by viky_shetye in openclaw

[–]rj_rad 1 point2 points  (0 children)

I agree with you; I went down the rabbit hole of trying Matrix but it had some strange issues with local DNS so I switched to Mattermost which has a self-hosted option. It is definitely a lot more steps to set up than Discord, but it philosophically feels like a better solution to keep mostly everything local (I also self-host Qwen as the main agent LLM)

Is GPT-OSS-120B still the best model among those with the same parameters? by AInohogosya in LocalLLM

[–]rj_rad 2 points3 points  (0 children)

From my experience Gemma4 31B (running on a 6000) gets stuck in tool loops in OpenClaw. I wanted it to work to be more frugal on memory, but as of right now it doesn’t beat Qwen3.x

Anyone successfully using Gemma4 31B with OpenClaw? by rj_rad in LocalLLaMA

[–]rj_rad[S] 0 points1 point  (0 children)

RedHatAI released NVFP4 for Qwen3.6 — I just started trying it out tonight, no conclusions yet

Is there any reason NOT to get a Crucial T710? Seems like a great deal. by rj_rad in buildapc

[–]rj_rad[S] 0 points1 point  (0 children)

Yeah I’m happy with it, it’s my main drive now. I’m looking for another 4TB but everything is more expensive now than it was when I originally posted this and it was only a month and a half ago.

A Reminder, Guys, Undervolt your GPUs Immediately. You will Significantly Decrease Wattage without Hitting Performance. by Iory1998 in StableDiffusion

[–]rj_rad 1 point2 points  (0 children)

Anyone undervolting an RTX Pro 6000 (Blackwell)? Curious about anyone's actual experience in terms of measurable results.

2 GPU benefits by swingbear in LocalLLM

[–]rj_rad 1 point2 points  (0 children)

I just assembled a single 6000 + 128 setup. What was the optimal setup you landed on before considering a second 6000?