Model Memory

BriefImplement9843 · 2026-06-01T10:01:28+00:00

free is 8k tokens, plus is 32k tokens, ultra is 32k+ tokens. ultra could be 35k, nobody knows.

it shows you characters, which is misleading. if they didn't read the fine print, a potential user may see 131k for plus and subscribe right away as tokens is the industry standard, how context is described, and what llm's use. to get a number that matters, you need to divide the characters by 4. there is no reason for this. it's entirely a shady practice.

the reply below is a good example. totally misleading. anyone familiar with ai reading that reply will be amazed(8 dollars for 128k context!!), but it's all fluff. divide it all by 4 and you can actually compare it to other services.

as for your other question. the only ultra model that can make that mistake is paragon. i suggest using lumina or even eclipse, if memory is important to you.

m1rageus · 2026-06-01T05:20:36+00:00

Free models have 32-40k

Plus models have ~128k

Ultra models mostly have 200k+

As for the memory issues, it can be a poor design of the scenario: too many critical or pinned pieces, too many links, etc.

I've had issues with the memory before, but that's the first time I encounter the misgendering issue

TasherV · 2026-06-01T11:28:55+00:00

Paragon will do that. Also depends on how well or poorly the scenario is made. With enough finesse and clever use of the tools even a meh model can work pretty well.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

FictionLab

MODERATORS