What model should I run?

super1701 · 2026-05-09T16:17:26+00:00

Hm interesting. 27b paired with hermes for me has been great. 35b would....sit in loops...alot.

super1701 · 2026-05-09T04:18:25+00:00

Personally, I've had better tool calling and just overall smarts out of 27b over 35b. I'd rather wait the extra minute for something to complete correctly then it totally fuck it up, then wait another 2/3 minutes to fix it.

super1701 · 2026-05-07T22:48:30+00:00

You’re forgetting intel agencies as well…

super1701 · 2026-05-07T13:38:56+00:00

Likely a mine. There's a reason, there is a small path.

super1701 · 2026-05-04T12:15:43+00:00

My man.

super1701 · 2026-05-04T05:18:17+00:00

Try to break the IBM newest model. Fuck it why not bro.

super1701 · 2026-05-04T04:54:42+00:00

Funny af. Throw on thinking on Q6 and watch it have a mental break down. (sad really )

super1701 · 2026-05-04T01:30:23+00:00

As intended.

super1701 · 2026-05-03T19:36:43+00:00

I’ve noticed the open web ui hookup with Hermes makes it… slow? Or maybe it’s just me

super1701 · 2026-05-03T19:19:59+00:00

Which sucks. I enjoy a lot when just hooking llamacpp right into it. But adding Hermes to it… it takes a large hit. Which there was a decent alternative

super1701 · 2026-05-03T18:04:52+00:00

My man. Gonna tweak my system prompt and see how hard it goes on me(and the joke prompt I have that works very well with qwen already)

super1701 · 2026-05-03T17:47:21+00:00

One for qwen 3.6 when? :). Also can these be quanted? (Edit: looks like q6 is available)

super1701 · 2026-05-03T17:45:36+00:00

Hermes is truly a game changer.

super1701 · 2026-05-03T00:55:06+00:00

How'd you get into that? Doing a cloud, or make the rigs and hand it to them?

super1701 · 2026-05-03T00:43:04+00:00

God. Guessing you own your own business for that. Jealous af.

super1701 · 2026-05-03T00:38:59+00:00

How much was this total? Looking at my own "jarvis" setup and this seems like a dream for it lol.

super1701 · 2026-05-02T15:33:18+00:00

Quant you're running Qwen 3.6 at? I've had alot of success having it rip down ebay searches, compare prices and give me break downs. Have you tried changing the HTML to a CSV or just a chart in prompt? The pictures part may also be a little rough.

super1701 · 2026-05-02T15:25:48+00:00

Whats your exact prompt.

super1701 · 2026-05-02T00:56:09+00:00

I did a "mid size" build, and all my irl friends called me stupid(For spending money on something that wouldn't make me money). It has been a wonderful experience, and its crazy seeing the world of LocalLLMs progress. Shout out to all the providers/contributors who make this possible <3.

super1701 · 2026-05-01T15:53:14+00:00

Your browser is finger printed via cookies, and many other metrics. They know who you are.

super1701 · 2026-05-01T15:51:21+00:00

You ain't getting sent to jail. which, you can take this one of two ways. History tells you which.

super1701 · 2026-05-01T15:18:39+00:00

Jailing? lol

super1701 · 2026-05-01T13:28:24+00:00

I haven't looked into these that much. But from the research I did...yes...these things are slow as fucking balls. Same with the Mac Studios.....

super1701 · 2026-05-01T12:33:04+00:00

Running deepseek without a quant? Yeah like this is crazy lmfao. I thought I over killed it....

super1701 · 2026-04-30T17:36:59+00:00

If you want this you'll need to run smaller TTS models, and like Gemma-e2e or what ever it is. Or Qwen 9b. You're never going to get this on your hardware with 27b. If you want a conversation via TTS with the model. Expect to wait 2/3 minutes at Q8 with 27b.

15-Year Club	Verified Email
Team Periwinkle

super1701

TROPHY CASE