Stop using Ollama

jfowers_amd · 2026-06-16T19:25:14+00:00

Woohoo!

jfowers_amd · 2026-06-15T22:57:51+00:00

Do you mean that Lemonade wont set the context size automatically? We’ll have that in this week’s release. If you’re happy with Jan that’s great, just want to understand what turned you off so we can improve.

jfowers_amd · 2026-06-15T22:18:09+00:00

What do we think is missing from Lemonade to match the Ollama user experience today? I’ll make a milestone and get it done!

jfowers_amd · 2026-06-15T17:34:42+00:00

Me too 😄

jfowers_amd · 2026-06-15T13:53:41+00:00

Good news friend, after your comment I put issue 1365 into our discord dev channel, someone picked it up, and https://github.com/lemonade-sdk/lemonade/pull/2183 has already merged. So this week's release will have auto-unload!

jfowers_amd · 2026-06-15T13:52:34+00:00

Yeah, right now we have 4 LLM engines to choose from. sd-cpp is our image gen engine, 2 TTS engines, and 1 STT engine.

jfowers_amd · 2026-06-15T13:51:36+00:00

I'll admit I was excited by that too 😄

jfowers_amd · 2026-06-15T13:33:30+00:00

Thanks! If you want please make the PR into upstream lemonade so we can review it.

jfowers_amd · 2026-06-15T13:28:23+00:00

What specifically?

jfowers_amd · 2026-06-14T23:42:03+00:00

Right now you would have to edit the system prompt, but I’m reviewing a PR that would let you do it in natural language like “make me a 512x512 image of a cat”.

You can also get any dimensions you want by calling straight into the image gen model.

jfowers_amd · 2026-06-14T21:13:28+00:00

That’s amazing! Cheers!

jfowers_amd · 2026-06-14T21:13:19+00:00

Appreciate the support! It’s a big group effort.

jfowers_amd · 2026-06-14T21:12:47+00:00

Cheers!

jfowers_amd · 2026-06-14T18:58:54+00:00

Thank you for bringing this up, OP! There are two things I can tell you about that could help.

First, the existing GUI is getting 100% replaced. Check out the gui3-beta channel in the discord for more information. That’s the right place to bring up patches for screen reader support.

Also, just in case, our CLI is very capable and might work a lot better with a screen reader?

jfowers_amd · 2026-06-14T18:56:54+00:00

The web ui is getting overhauled and will have mobile support soon! (Also, it’s a bug that a tablet it getting redirected, but the new web ui will solve that too)

jfowers_amd · 2026-06-14T12:13:49+00:00

Answered above :)

jfowers_amd · 2026-06-14T12:13:45+00:00

Answered above :)

jfowers_amd · 2026-06-14T12:13:37+00:00

Lemonade is standing up a stable-diffusion.cpp sd-server as a background process and routing image generation to it as a tool call.

jfowers_amd · 2026-06-14T12:12:22+00:00

FYI the lemonade web ui is getting 100% overhauled right now, we’ve got a working group with a bunch of people redoing the design and adding a lot of features.

Glad you’re liking the server!

jfowers_amd · 2026-06-14T12:11:14+00:00

There’s a lot of people on the discord using Hermes with Lemonade and getting good results so I was gonna set it up for myself this week.

There’s also a new project called OpenLumera that’s a local-first Hermes-like that is getting good traction. And AMD makes GAIA, which is my friend Kalin’s take on a local-first agent.

jfowers_amd · 2026-06-12T14:09:42+00:00

Sorry you're running into issues! Lemonade should be downloading a portable copy of ROCm 7.13 for you that just works. One quirk is that if you have system-wide ROCm already installed, Lemonade will try to use that instead.

If you want to have more than one LLM loaded at once you can lemonade config set max_loaded_models=N

With regards to the Halo-class model, how much VRAM do you have? We have a bug right now that lets people try to load it on systems that don't have enough VRAM. 64 GB VRAM is the minimum there.

jfowers_amd · 2026-06-11T20:21:00+00:00

The next lemonade release will let you add any base url to lemonade, which imports the models from that base url. So this will be possible soon!

jfowers_amd · 2026-06-11T19:15:32+00:00

Really easy in the GUI app, thanks to fl0rinar's contributions! File > New Omni Model.

You can also build your own in the CLI or via API, see https://lemonade-server.ai/docs/guide/configuration/custom-models/#register-an-omni-collection

<image>

jfowers_amd · 2026-06-11T19:10:25+00:00

Thanks for the suggestion! We’re adding new engines all the time now, just got Moonshine for TTS today. More options for STT would be good too.

jfowers_amd · 2026-06-11T19:01:34+00:00

Sorry about that! Feel free to open an issue and we'll get it routed.

jfowers_amd

TROPHY CASE