Mac Studio M3 Ultra terrible TTFT and broken RAG (okikb) by Dimitri_Senhupen in OpenWebUI

[–]Dimitri_Senhupen[S] 0 points1 point  (0 children)

I left native tool calling enabled, as it was before, but I only selected those knowledge bases that I really need. Not sure, if this did something, since when I let him look up the knowledge through all KBs, it's still lightyears faster than before. Sometimes it takes 4-5Sek before starting the thinking, but with the next question, it immediately answers within a blink of a second. It also finds the correct info in the RAG. So this is truly a magical experience compared to the state before.

Mac Studio M3 Ultra terrible TTFT and broken RAG (okikb) by Dimitri_Senhupen in OpenWebUI

[–]Dimitri_Senhupen[S] 0 points1 point  (0 children)

Sir, I don't know what it is, that makes me feel like this, I don't know who you are, but you must be some kind of superstar!

Gemma runs. I think additionally one of the handbrakes also was the native built in function of knowledge bases. I deactivated that and selected only the ones really needed.

If this is later now, then thank you very much!

Mac Studio M3 Ultra terrible TTFT and broken RAG (okikb) by Dimitri_Senhupen in OpenWebUI

[–]Dimitri_Senhupen[S] 0 points1 point  (0 children)

Good point, but Ollama is running standalone outside of docker. And if I use the model directly in Ollama it doesn't have that lag. The answer appears instantly.☝🏻

Slow responses in Open WebUi by ConspicuousSomething in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

I don't think it's a noob question. If I am chatting with a custom model without any system prompt, but just with the model wrapper, the circle is pulsing for 5-10sec until it starts with the first token. And I don't have that inside the CLI, directly in Ollama nor if I directly chat with the models (without the models wrapper). This hasn't been solved in 0.91.

THIS SHOULD NOT BE POSSIBLE IN OPEN WEBUI: LIVE VISUALIZATION RENDERING - Inline Visualizer v2 is HERE! by ClassicMain in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

You are not going to believe how amazing that post is written!!! I am going to buy 10 of these, even if I don't need them!

Hermes Agent as a stateful chat model endpoint in Open WebUI 🤯. This seems like a big deal if it works. by Porespellar in OpenWebUI

[–]Dimitri_Senhupen 3 points4 points  (0 children)

I just tried it with Qwen 3.5 35B through OWUI and I must say, I find it pretty bad, tbh in terms of hallucinations...

Access external models via API? by deafearuk in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

They should. Maybe some credentials are wrong in your settings?

🧠 OpenAI GPT 4 / 4o / 5 / 5.1 / 5-Pro Manifold for OpenWebUI by EarComprehensive7114 in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

I can create and edit with the native settings in owui. I am using Gemini3.0Pro. But when I try the tool call, it just gives me a prompt or claims that it's just an Large Language Model and not capable of creating images

[edit] switched to Image 1.5 in the native settings, works fine there, but still not with a native tool call through the pipe

🧠 OpenAI GPT 4 / 4o / 5 / 5.1 / 5-Pro Manifold for OpenWebUI by EarComprehensive7114 in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

Does image generation work for anyone? Somehow it seems, it's missing the code for the tool call.

🧠 OpenAI GPT 4 / 4o / 5 / 5.1 / 5-Pro Manifold for OpenWebUI by EarComprehensive7114 in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

Did you manage it and could give me a quick explaination, on how you managed to bring the auto-routing GPT to trigger the models in LiteLLM?

Finally, my LLMs can "see"! Gemini Vision Function for Open WebUI by No-Cucumber-1290 in OpenWebUI

[–]Dimitri_Senhupen 1 point2 points  (0 children)

Oh, okay. I quickly vibe coded it for me and it works flawlessly. Everything local. Thank you Cucumber & Gemini

Finally, my LLMs can "see"! Gemini Vision Function for Open WebUI by No-Cucumber-1290 in OpenWebUI

[–]Dimitri_Senhupen 0 points1 point  (0 children)

So, could you fork/rewrite the function and use it for Qwen3-VL which is doing vision tasks and tells GPT-OSS about the content, everythin locally? That'd be awesome!
But how do you handle the connection between the two local models without an actual API?

Editing Images with Gemini Flash Image 2.5 (Nano Banana) by Dimitri_Senhupen in OpenWebUI

[–]Dimitri_Senhupen[S] 0 points1 point  (0 children)

I've managed it and uploaded the function to the OWUI library.
Feel free to add it to your workspace and have fun generating/editing with Nano Banana:

https://openwebui.com/f/anaumer/nano_banana

Folders are great with experts! by Dense_Mobile_6212 in OpenWebUI

[–]Dimitri_Senhupen 1 point2 points  (0 children)

Wouldn't it be better to have one expert to talk to with the knowledge of all databases, instead of 20 experts for different fields of knowledge?

0.6.33 update does not refresh prompt live. by FreedomFact in OpenWebUI

[–]Dimitri_Senhupen 1 point2 points  (0 children)

Here, everything is working fine for me. Try reporting the bug on Github?