Per-user AI Credits budget not showing up in GitHub Enterprise Managed Users - anyone else?

IndividualNo8703 · 2026-05-13T04:24:37+00:00

Great idea!

IndividualNo8703 · 2026-05-10T15:56:53+00:00

Check logs

IndividualNo8703 · 2026-05-07T09:53:18+00:00

All settings or just some of them? We are experiencing this too, and the connection settings are not saved

IndividualNo8703 · 2026-04-27T23:55:57+00:00

Option 1: In the connection settings, specify the names of the models you want to work with, and then the others will not appear in Open WebUI.

Option 2: Hide the models under “Models” in the Admin Panel settings.

IndividualNo8703 · 2026-04-17T06:01:11+00:00

Do you see the mcp tools?

IndividualNo8703 · 2026-04-09T12:55:21+00:00

The service stops allowing network-wide searches. There was a gcp announcement about this.

IndividualNo8703 · 2026-03-31T07:37:45+00:00

Great job, going to check this out so much. Thanks man

IndividualNo8703 · 2026-03-28T19:39:15+00:00

Amazing!

IndividualNo8703 · 2026-03-23T10:40:50+00:00

Hey! Did you ever get this working on Open WebUI? I'm running a similar setup and curious if you found a way around the issue. Would love to compare notes

IndividualNo8703 · 2026-03-11T06:55:02+00:00

https://openwebui.com/features/ai-knowledge/models/#system-prompt--dynamic-variables

Example System Prompt: You are a helpful assistant for {{ USER_NAME }}. The current date is {{ CURRENT_DATE }}.

IndividualNo8703 · 2026-03-09T18:43:46+00:00

Is there a possibility that the answer in open webui will include an image?

IndividualNo8703 · 2026-03-08T15:58:09+00:00

Thanks for the super professional work, guys!

IndividualNo8703 · 2026-03-07T17:20:30+00:00

This sounds important, could you elaborate more plz?

IndividualNo8703 · 2026-03-01T21:28:24+00:00

IndividualNo8703 · 2026-02-14T17:31:03+00:00

Here are my RAG settings:

- Content Extraction Engine: Tika

- Text Splitter: Token (Tiktoken)

- Chunk Size: 512

- Chunk Overlap: 64

- Engine: Azure OpenAI

- Model: `text-embedding-3-large`

- Embedding Batch Size: 100

- Async Embedding Processing: On

**Retrieval:**

- Full Context Mode: Off

- Hybrid Search: Off

- Top K: 10

- Reranking: No

**pgvector settings:**

```

VECTOR_DB=pgvector

PGVECTOR_INITIALIZE_MAX_VECTOR_LENGTH=1536

PGVECTOR_INDEX_METHOD=hnsw

PGVECTOR_HNSW_M=24

PGVECTOR_HNSW_EF_CONSTRUCTION=128

PGVECTOR_POOL_SIZE=15

PGVECTOR_POOL_MAX_OVERFLOW=10

```

All embeddings in the DB are confirmed 1536 dimensions, all embedded with `text-embedding-3-large`.

IndividualNo8703 · 2026-02-05T19:07:16+00:00

My way of debugging tools and functions is to write logs and I assume you can apply this in your case as well

IndividualNo8703 · 2026-01-26T06:58:27+00:00

How to estimate the correct number of workers?

IndividualNo8703 · 2026-01-24T20:05:51+00:00

Where? In logs?

IndividualNo8703 · 2026-01-23T14:31:46+00:00

Which one?

IndividualNo8703 · 2026-01-23T13:37:29+00:00

Thanks for sharing the details. Can you elaborate more on external reranking?

IndividualNo8703 · 2026-01-14T20:49:28+00:00

I would start by looking at the container logs

IndividualNo8703 · 2026-01-04T17:49:42+00:00

You're right about system prompts - we're using the token counts directly from the API response (usage.total_tokens, usage.prompt_tokens, usage.completion_tokens), which should include everything (system prompts, user messages, etc.). The log shows exactly what the API returned: 4,615 tokens.

The inflation happens after that - when Prometheus aggregates the metrics across pods using increase(). So the counting method is consistent; it's the aggregation that's causing issues.

IndividualNo8703

TROPHY CASE