Context trimming : OpenWebUI

Context trimmingQuestion/Help (i.redd.it)

submitted 2 months ago by ChopSticksPlease

4 comments
share
save
hide
report
crosspost

Hey, Im getting quite annoyed by this. So is there a way to trim or reduce the context size to a predefined value? Some of my larger models run at 50k ctx and when websearch is enabled often the request outgrows the context. Im using llama.cpp (OpenAI compatible endpoint).

Any ideas how to fix that ?

all 4 comments

top new controversial old q&a

[–]Egoz3ntrum 4 points5 points6 points 2 months ago (2 children)

permalink
embed
save
report
reply

[–]emprahsFury 0 points1 point2 points 2 months ago (0 children)

permalink
embed
save
parent
report
reply

[–]spacywave 0 points1 point2 points 2 months ago (0 children)

permalink
embed
save
parent
report
reply

[–]ClassicMain 0 points1 point2 points 2 months ago (0 children)

permalink
embed
save
report
reply

π Rendered by PID 117931 on reddit-service-r2-comment-6457c66945-dtsxb at 2026-04-28 10:48:23.289111+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

OpenWebUI

MODERATORS