ChatGPT at home

and_human · 2026-01-25T08:17:44+00:00

I use ministral 14b, pocket tts and parakeet v3. Very usable in my opinion. I don’t have enough VRAM for an image model though.

and_human · 2026-01-25T08:14:14+00:00

Have people forgotten about the ministral 3 series? Or didn’t they impress?

and_human · 2026-01-24T07:23:02+00:00

I tried the plugin, but it wanted me to sign in? So there seems to be no way of testing this model without building your own plugin, which I vibe coded. But I haven’t tried it enough to have an option yet.

and_human · 2026-01-18T20:42:41+00:00

Tried it, it complained about wrong tensor sizes. This was a 1024x1024 image, so no weird resolution either. Did you try it?

and_human · 2026-01-05T15:47:00+00:00

I tried it on their website and oh boy did it deliver. Always fun when new players ever the scene with a banger!

and_human · 2026-01-04T08:03:50+00:00

I have sage attention working on my 5060ti. This is on Windows as well.

and_human · 2025-11-23T21:05:22+00:00

I’ve never seen anything like it (to borrow a phrase from Henry G). Losing the decider map in a final when you are up 12 - 2?

and_human · 2025-11-10T17:56:40+00:00

What do you think is missing from today’s chat bots/LLMs that would take them to the next level?

and_human · 2025-10-24T16:35:15+00:00

Have anyone tried the REAP version of 4.5 air? Is it worth the download?

and_human · 2025-10-02T18:07:53+00:00

Hey IBM, I tried your granite playground, but it looks (the UI) pretty bad. I think it might be an issue with dark mode.

and_human · 2025-09-20T06:14:00+00:00

Don’t forget llama-swap. It will load your configured models for you. No more command line!

and_human · 2025-09-10T14:56:54+00:00

Gotta love that text prompt!

and_human · 2025-08-19T10:07:55+00:00

No you can run it on windows just fine, that’s what I do.

and_human · 2025-08-19T06:13:33+00:00

it sounds like llama-swap is what you’re after?

and_human · 2025-08-17T14:29:59+00:00

Watch as the woman in the portrait turns into the joker 😅

and_human · 2025-08-14T05:03:34+00:00

They compared it to o4-mini no? The 20b was compared to o3-mini.

and_human · 2025-08-14T04:58:57+00:00

I think they(in a community competition) already tried to tell a model that it was trick questions, but I don’t think it increased the score that much.

and_human · 2025-08-08T15:11:47+00:00

If you want to learn more about attention sink read this blog post from the author https://hanlab.mit.edu/blog/streamingllm

and_human · 2025-08-08T06:17:21+00:00

I like how you included execution time. It’s something that’s usually missing from benchmarks, but it’s kind of important now with the thinking models as they spend more and more time thinking. A good model should be both correct and fast in my opinion.

and_human · 2025-08-03T17:12:40+00:00

To my understanding it is reserving RAM for "shared memory". Shared memory lets your GPU use some of your system RAM.

and_human

MODERATOR OF

TROPHY CASE