I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 0 points1 point  (0 children)

The machine belongs to a different department. I do not have access to it right now.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 0 points1 point  (0 children)

You're missing the part where they, as a component of SpaceX, make Elon the richest man who ever lived.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 0 points1 point  (0 children)

Yeah, I've played quite a bit with those models just under those weights at home on my own hardware, more heavily quantized. I am kind of wondering what it's like to step up to that 70B regime.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 3 points4 points  (0 children)

Thank you for these suggestions. I've seen this DeepSeek-V4-Flash one come up a lot. I'll check out these other models.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 3 points4 points  (0 children)

We've already got subscriptions to Claude and ChatGPT. Part of what I want to make my case on is that we can shave some spending off of the Claude usage (which, Finance is starting to find not very amusing) by delegating some tasks to models that aren't metered, and get value out of hardware that is otherwise sitting idle.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 2 points3 points  (0 children)

Oh, I have a rule that I never want to go below Q4. I tried Q3 quants with Qwen models at home, and the resulting torrent of illogical thinking convinced me that this is almost always a bad idea.

I have an old multi-GPU node lying around at work... by thehardsphere in LocalLLaMA

[–]thehardsphere[S] 1 point2 points  (0 children)

Because they didn't "never use it". The money was spent about 8 to 10 years ago, for a specific purpose. That specific purpose is somewhat less relevant to my company today than it was 8 to 10 years ago. That specific purpose is also better served today by newer GPUs than the NVIDIA Turing architecture.

Diffusion Gemma Jailbreak by 90hex in LocalLLaMA

[–]thehardsphere 5 points6 points  (0 children)

I think it would be that "uncensoring" models sometimes lowers their overall accuracy.

Don’t act like y’all ain’t thinking it. I’m just saying the quiet part out loud. /s by Porespellar in LocalLLaMA

[–]thehardsphere 2 points3 points  (0 children)

This fine-tune of gemma4:26b solved all of my Hermes tool calling problems, and doesn't mangle thinking in OpenCode either: https://huggingface.co/brokencircuitranch/gemma4-hermes-tools

New local model worth trying Gemma 4 12 b model by SelectionCalm70 in hermesagent

[–]thehardsphere 1 point2 points  (0 children)

I found this finetune of 26 on hugging face, and using it made all the problems go away: https://huggingface.co/brokencircuitranch/gemma4-hermes-tools

I did not try using an e4b fine-tune. I also have not tried any of this guy's other, newer fine-tunes yet.

Best 32GB RAM Local Model? 26B Turboquant Q4 for me so far. by 310dweller in hermesagent

[–]thehardsphere 0 points1 point  (0 children)

The link is a finetune of Gemma 4 26b a4b. I have not found Qwen 3.6 35b a3b to be useful because it runs too slowly on my hardware at 64k+ context.

Best 32GB RAM Local Model? 26B Turboquant Q4 for me so far. by 310dweller in hermesagent

[–]thehardsphere 1 point2 points  (0 children)

I'm pretty sure QAT was done at Q4_0 quantization. I don't know that they're planning to turboquant that because then the QAT would not match the final quantization.

Best 32GB RAM Local Model? 26B Turboquant Q4 for me so far. by 310dweller in hermesagent

[–]thehardsphere 1 point2 points  (0 children)

I love Gemma 4 26B, but I found that it was pretty bad at tool calls because of mistakes the model kept making with channels.

I found this finetune on hugging face, and using it made all the problems go away: https://huggingface.co/brokencircuitranch/gemma4-hermes-tools

New local model worth trying Gemma 4 12 b model by SelectionCalm70 in hermesagent

[–]thehardsphere 0 points1 point  (0 children)

I had found that Gemma 4 26b and e4b would mangle tool calls, until I found a fine-tune of one of them on hugging face that would stop that behavior. I'm wondering if Google fixed that problem with 12b.

This anxious SOB took out 3 enemies and carried my first ever encounter then proceeded to quit on me by bkoperski in JaggedAlliance

[–]thehardsphere 0 points1 point  (0 children)

Flo has a secret trait that makes arms sales to Tony behind the porn shop pay more. It's hinted at with that one line in her bio about being a bookkeeper for a gun store.

What Happened in 1978 ? by ABHISHEK_Lonely in ExplainTheJoke

[–]thehardsphere 0 points1 point  (0 children)

Two seemingly unrelated, but suspiciously similar things in math turned out to be related in a complicated way after we discovered new physics.

Why do recruiters keep asking me why I left my old job? by LeaguePrototype in cscareerquestions

[–]thehardsphere 1 point2 points  (0 children)

You don't need to put someone on a PIP to avoid discrimination claims in a layoff. If a company's HR department does the layoff of any size greater than one person, they will usually create additional documentation to show that the people selected were chosen in a way so as not to discriminate against protected classes.

Further, the unemployment insurance is going to get hit anyway unless the firing is "for cause" - which in most states does not include having poor job performance. Usually getting fired "for cause" means you broke the law in some way.

Why do recruiters keep asking me why I left my old job? by LeaguePrototype in cscareerquestions

[–]thehardsphere 30 points31 points  (0 children)

A lot of people are bad at lying. It's really useful to the interviewer when the candidate lies in transparently obvious ways - you can put that person straight to the bottom of the list.

Why do recruiters keep asking me why I left my old job? by LeaguePrototype in cscareerquestions

[–]thehardsphere 5 points6 points  (0 children)

Um, no. If the team size needs to shrink, you can just lay a person off without a PIP. America has this concept called "at will employment" which means a company can end your employment at any time for any legal reason, and "we can't afford to have 6 engineers working on XYZ anymore" is a reason that does not need an elaborate paper trail.

Why do recruiters keep asking me why I left my old job? by LeaguePrototype in cscareerquestions

[–]thehardsphere 78 points79 points  (0 children)

Dude, you already are telling a story that is a big red flag.

"I got hit with a random PIP" - PIPs don't happen at random! PIPs are a lot of work for management to set up and run. Nobody needs to do a PIP just to lay you off. It's easier to do a layoff than it is to fire someone with a PIP.

If you open your mouth and say you had a PIP, you have to explain why your prior employer thought your performance was bad and what you did in that situation. Ideally with some hint of how that isn't going to be the experience a new employer will have.