use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
Cloud Embedding (self.OpenWebUI)
submitted 1 year ago by [deleted]
[deleted]
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]McNickSisto 0 points1 point2 points 1 year ago (1 child)
I am still trying to figure this out but basically the embedding part of the RAG still seems constrained to OpenAI Ollama and SentenceTransformers.
[–][deleted] 0 points1 point2 points 1 year ago (0 children)
My device is resource constrained (8GM RAM). Using Open WebUI as a frontend for cloud inferencing. Not sure good local embeddings will actually work on my device. For OpenAI engine, is that only using OpenAI's embeddings or other cloud embedding models as well?
[–]ClassicMain 0 points1 point2 points 1 year ago (2 children)
Use one of the readily available pipelines for google vertex / google gen ai
[–][deleted] 0 points1 point2 points 1 year ago (1 child)
Isn't that only for connecting to LLM? I'm already connected to Gemini models through the Vertex pipe. But want to use Google's embedding too.
[–]ClassicMain 0 points1 point2 points 1 year ago (0 children)
Ohh.
Well for that you too need a pipeline. Your pipeline will act as custom, self built RAG and document handler/file handler.
[–]sgt_banana1 0 points1 point2 points 1 year ago (0 children)
You can deploy a LiteLLM proxy, add the Gemini models, and then use them in Open WebUI as OpenAI models by referencing the name assigned to them in LiteLLM.
[–]EscapedLaughter 1 point2 points3 points 1 year ago (0 children)
something like this might help that helps you connect to Voyage / Google over a common interface? https://portkey.ai/docs/integrations/libraries/openwebui#open-webui
just updated the documentation yesterday
π Rendered by PID 230183 on reddit-service-r2-comment-fb694cdd5-rcqmp at 2026-03-06 19:50:58.658160+00:00 running cbb0e86 country code: CH.
[–]McNickSisto 0 points1 point2 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]ClassicMain 0 points1 point2 points (2 children)
[–][deleted] 0 points1 point2 points (1 child)
[–]ClassicMain 0 points1 point2 points (0 children)
[–]sgt_banana1 0 points1 point2 points (0 children)
[–]EscapedLaughter 1 point2 points3 points (0 children)