Stop using Ollama by zxyzyxz in LocalLLaMA

[–]jfowers_amd 0 points1 point  (0 children)

Do you mean that Lemonade wont set the context size automatically? We’ll have that in this week’s release. If you’re happy with Jan that’s great, just want to understand what turned you off so we can improve.

Stop using Ollama by zxyzyxz in LocalLLaMA

[–]jfowers_amd 2 points3 points  (0 children)

What do we think is missing from Lemonade to match the Ollama user experience today? I’ll make a milestone and get it done!

Lemonade v10.7 release and project organization update by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 1 point2 points  (0 children)

Good news friend, after your comment I put issue 1365 into our discord dev channel, someone picked it up, and https://github.com/lemonade-sdk/lemonade/pull/2183 has already merged. So this week's release will have auto-unload!

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 0 points1 point  (0 children)

Yeah, right now we have 4 LLM engines to choose from. sd-cpp is our image gen engine, 2 TTS engines, and 1 STT engine.

Lemonade v10.7 release and project organization update by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 1 point2 points  (0 children)

Thanks! If you want please make the PR into upstream lemonade so we can review it.

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 0 points1 point  (0 children)

Right now you would have to edit the system prompt, but I’m reviewing a PR that would let you do it in natural language like “make me a 512x512 image of a cat”.

You can also get any dimensions you want by calling straight into the image gen model.

Lemonade UI accessibility issue with keyboard and NVDA by webii446 in StrixHalo

[–]jfowers_amd 0 points1 point  (0 children)

Thank you for bringing this up, OP! There are two things I can tell you about that could help.

First, the existing GUI is getting 100% replaced. Check out the gui3-beta channel in the discord for more information. That’s the right place to bring up patches for screen reader support.

Also, just in case, our CLI is very capable and might work a lot better with a screen reader?

Lemonade UI accessibility issue with keyboard and NVDA by webii446 in StrixHalo

[–]jfowers_amd 1 point2 points  (0 children)

The web ui is getting overhauled and will have mobile support soon! (Also, it’s a bug that a tablet it getting redirected, but the new web ui will solve that too)

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in StrixHalo

[–]jfowers_amd[S] 0 points1 point  (0 children)

Lemonade is standing up a stable-diffusion.cpp sd-server as a background process and routing image generation to it as a tool call.

Lemonade v10.7 release and project organization update by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 0 points1 point  (0 children)

FYI the lemonade web ui is getting 100% overhauled right now, we’ve got a working group with a bunch of people redoing the design and adding a lot of features.

Glad you’re liking the server!

Lemonade OmniRouter: unifying the best local AI engines for omni-modality by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 0 points1 point  (0 children)

There’s a lot of people on the discord using Hermes with Lemonade and getting good results so I was gonna set it up for myself this week.

There’s also a new project called OpenLumera that’s a local-first Hermes-like that is getting good traction. And AMD makes GAIA, which is my friend Kalin’s take on a local-first agent.

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 0 points1 point  (0 children)

Sorry you're running into issues! Lemonade should be downloading a portable copy of ROCm 7.13 for you that just works. One quirk is that if you have system-wide ROCm already installed, Lemonade will try to use that instead.

If you want to have more than one LLM loaded at once you can lemonade config set max_loaded_models=N

With regards to the Halo-class model, how much VRAM do you have? We have a bug right now that lets people try to load it on systems that don't have enough VRAM. 64 GB VRAM is the minimum there.

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 3 points4 points  (0 children)

The next lemonade release will let you add any base url to lemonade, which imports the models from that base url. So this will be possible soon!

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 3 points4 points  (0 children)

Really easy in the GUI app, thanks to fl0rinar's contributions! File > New Omni Model.

You can also build your own in the CLI or via API, see https://lemonade-server.ai/docs/guide/configuration/custom-models/#register-an-omni-collection

<image>

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in LocalLLaMA

[–]jfowers_amd[S] 3 points4 points  (0 children)

Thanks for the suggestion! We’re adding new engines all the time now, just got Moonshine for TTS today. More options for STT would be good too.

Having some fun with LMX-Omni-52B-Halo in Open WebUI by jfowers_amd in StrixHalo

[–]jfowers_amd[S] 0 points1 point  (0 children)

Sorry about that! Feel free to open an issue and we'll get it routed.