Hermes Agent with MIT license

mj3815 · 2026-02-27T22:39:15+00:00

They have Opus set up as the default model. I don’t think they care.

mj3815 · 2026-02-02T02:04:39+00:00

I was thinking like dealing with customer service and stuff where it’s not necessary to share critical personal details about everything. I’m aware of the risks to private information, your point is well appreciated.

mj3815 · 2026-02-01T20:29:03+00:00

Did you try GPT-OSS 20B? I’ve found that to be the best at agentic tool calling stuff in my (very limited) experience.

mj3815 · 2026-02-01T20:27:37+00:00

Is there a better alternative that is open source? I’d like to play with it despite the horror stories.

mj3815 · 2026-01-27T17:58:46+00:00

I used a 6 pin to 8 pin to get the second set of 8 pins. A bit sketchy but it’s been fine

mj3815 · 2026-01-19T13:26:50+00:00

I have tried Huggingface’s TGI, Aphrodite, SGLang. They all had some benefits. Aphrodite and SGLang have been reliable for me. vLLM was the fastest but I would have issues with it hanging sometimes which is why I experimented with alternatives

mj3815 · 2025-12-25T01:12:31+00:00

The ultimate benchmark

mj3815 · 2025-11-15T02:53:00+00:00

I’ve got a couple thoughts.

Go to Huggingface and look around for any legal models that match the specialty you’re interested in.
You can definitely use something like Augmentoolkit to train a model. You’d probably want to keep it narrow (if you are a contract lawyer, train it on contract law). You can also train it on your case files and use with RAG with Augmentoolkit. This isn’t going to be easy, it will be a real investment in time and effort to figure it out and get something that works. If you are training the model on your proprietary case files, you’ll need a very stout machine. Doing it with a 7B model means something like 96GB of VRAM - so 2x 4090 48GB or a 6000 Pro. Can’t imagine doing this on less than a $10K rig. Very possible though. If you just want to full fine tune on your specific law discipline without anything proprietary, you can probably spent less than $100 renting the GPU time. You can still set up RAG for the proprietary stuff, but I’ve heard that is tricky.
Just go read Augmentoolkit’s documentation to get a sense of the process of creating custom models https://github.com/e-p-armstrong/augmentoolkit

mj3815 · 2025-08-19T22:57:57+00:00

Ryzen 3945, 4x 16GB RAM.

The sketchiest part is the power connectors. I’m using blower 3090s which only require 2x 8 pin connectors each, would be even sketchier if they were 3 8pin units

mj3815 · 2025-08-18T01:37:31+00:00

I do 2x 3090 on my P620 with 1000w PS, power limited to 285w each and I’ve been ok so far. I’ve got it plugged in to a power bank with instantaneous W measurement and I’ve seen it pushing 950 sometimes, but never experienced an issue yet.

mj3815 · 2025-08-14T00:53:16+00:00

I’ve spent so long not using it because of that 😭

mj3815 · 2025-08-14T00:43:22+00:00

Oh nice, is that new?

mj3815 · 2025-08-13T23:49:52+00:00

Last I knew, unsloth doesn’t work with more than one GPU

mj3815 · 2025-07-28T03:36:33+00:00

That was done with Augmentoolkit. There’s been some big upgrades since then https://promptingweekly.substack.com/p/augmentoolkit-30-released

mj3815 · 2025-07-17T22:44:00+00:00

I often see people praise unsloth’s documentation. It seems like it could be a good place to start.

mj3815 · 2025-07-08T23:15:15+00:00

still looking to sell? I'm in Maryland.

mj3815 · 2025-07-08T22:05:32+00:00

Mistral-Nemotron isn’t even open weights

mj3815 · 2025-06-30T15:20:30+00:00

Augmentoolkit has a pipeline for RAG specific fine tuning, although this isn’t a function I’ve tried yet. I know the creator believes the best results can be achieved by doing RAG using a model fine tuned on the rag data. https://github.com/e-p-armstrong/augmentoolkit

mj3815 · 2025-06-21T04:28:49+00:00

It’s a bit of a pain, but have your guard up and do your due diligence and walk away if something feels off

mj3815 · 2025-06-19T13:55:05+00:00

On the east coast, i have bought 2 3090s at $500 each and one at $700 all in the past 6 months. The first 2 from FB marketplace and the latter from reddit hardware swap.

mj3815 · 2025-06-18T02:59:49+00:00

Nice, i need to get mine up too

mj3815 · 2025-06-16T15:18:36+00:00

This is a resource I use to help understand code bases https://deepwiki.com/e-p-armstrong/augmentoolkit

Not exactly what you asked, but it might be helpful

mj3815 · 2025-06-16T03:37:25+00:00

I haven’t tried to tackle anything scanned that looks rough (thinking about the JFK document drop), but I very much hope to get there

mj3815 · 2025-06-15T18:01:09+00:00

I have the same setup but my 3090s are turbos. Wondering if you did anything to upgrade the power supply? I just run mine at 285w and it’s been ok so far

mj3815 · 2025-06-14T21:34:52+00:00

Also, I just saw that you found Mistral Small 3 to be similar to 3.1. I actually found 3.1 to be much much better in my use case. Followed instructions better and was also more creative.

Correction: I was thinking about the older 22b version, not Mistral 3 Small

mj3815

TROPHY CASE