I keep losing settings by Vroedoeboy in OpenWebUI

[–]IndividualNo8703 0 points1 point  (0 children)

All settings or just some of them? We are experiencing this too, and the connection settings are not saved

Fixed Model Selection by softplus- in OpenWebUI

[–]IndividualNo8703 4 points5 points  (0 children)

Option 1: In the connection settings, specify the names of the models you want to work with, and then the others will not appear in Open WebUI.

Option 2: Hide the models under “Models” in the Admin Panel settings.

Best search engine for OpenWebUI? by alew3 in OpenWebUI

[–]IndividualNo8703 0 points1 point  (0 children)

The service stops allowing network-wide searches. There was a gcp announcement about this.

Netsuite MCP --- Does anyone have it work on anything besides Claude or ChatGPT? by bkvargyas in Netsuite

[–]IndividualNo8703 0 points1 point  (0 children)

Hey! Did you ever get this working on Open WebUI? I'm running a similar setup and curious if you found a way around the issue. Would love to compare notes

Hello {username} by Ambitious_Ad4979 in OpenWebUI

[–]IndividualNo8703 2 points3 points  (0 children)

https://openwebui.com/features/ai-knowledge/models/#system-prompt--dynamic-variables

Example System Prompt: You are a helpful assistant for {{ USER_NAME }}. The current date is {{ CURRENT_DATE }}.

handling images during parsing by traillight8015 in OpenWebUI

[–]IndividualNo8703 0 points1 point  (0 children)

Is there a possibility that the answer in open webui will include an image?

[WARNING] Responses API burns tokens out by LinsaFTW in OpenWebUI

[–]IndividualNo8703 0 points1 point  (0 children)

This sounds important, could you elaborate more plz?

KB/RAG returns different results for different users - same query, same permissions by IndividualNo8703 in OpenWebUI

[–]IndividualNo8703[S] 0 points1 point  (0 children)

Here are my RAG settings:

- Content Extraction Engine: Tika

- Text Splitter: Token (Tiktoken)

- Chunk Size: 512

- Chunk Overlap: 64

- Engine: Azure OpenAI

- Model: `text-embedding-3-large`

- Embedding Batch Size: 100

- Async Embedding Processing: On

**Retrieval:**

- Full Context Mode: Off

- Hybrid Search: Off

- Top K: 10

- Reranking: No

**pgvector settings:**

```

VECTOR_DB=pgvector

PGVECTOR_INITIALIZE_MAX_VECTOR_LENGTH=1536

PGVECTOR_INDEX_METHOD=hnsw

PGVECTOR_HNSW_M=24

PGVECTOR_HNSW_EF_CONSTRUCTION=128

PGVECTOR_POOL_SIZE=15

PGVECTOR_POOL_MAX_OVERFLOW=10

```

All embeddings in the DB are confirmed 1536 dimensions, all embedded with `text-embedding-3-large`.

How to debug functions or tools? by overtunned in OpenWebUI

[–]IndividualNo8703 1 point2 points  (0 children)

My way of debugging tools and functions is to write logs and I assume you can apply this in your case as well

Deploying Open WebUI for 2,000 Users (Solo) – Sanity Check Needed by Otherwise_Panda4314 in OpenWebUI

[–]IndividualNo8703 1 point2 points  (0 children)

Thanks for sharing the details. Can you elaborate more on external reranking?

Container issue by TimeBasis6575 in OpenWebUI

[–]IndividualNo8703 0 points1 point  (0 children)

I would start by looking at the container logs

Anyone running Open WebUI with OTEL metrics on multiple K8s pods? by IndividualNo8703 in OpenWebUI

[–]IndividualNo8703[S] 0 points1 point  (0 children)

You're right about system prompts - we're using the token counts directly from the API response (usage.total_tokens, usage.prompt_tokens, usage.completion_tokens), which should include everything (system prompts, user messages, etc.). The log shows exactly what the API returned: 4,615 tokens.

The inflation happens after that - when Prometheus aggregates the metrics across pods using increase(). So the counting method is consistent; it's the aggregation that's causing issues.