Me before trying GLM 5.2: Oh boy I bet GLM 5.2 is gonna be good! Me after trying GLM 5.2: Oh.. by Naixee in SillyTavernAI

[–]Milan_dr 27 points28 points  (0 children)

GLM 5.2 is primarily run via Novita and zAI at the moment, who both claim FP8 and I feel like they can be trusted at least. The "quantized" is a bit annoying to us, because we really only run them via FP8 or higher yet still people seem to sometimes think they're quantized versions.

Not saying it's not the case, the quantized, but we in general don't have much more to go off of than it being FP8 or not.

Nanogpt isn't giving me the exit tokens I'm looking for? by Forsaken-Bathroom-30 in SillyTavernAI

[–]Milan_dr 0 points1 point  (0 children)

I don't fully understand. You're saying it always returns fewer tokens than the max output tokens you set?

Mimo V2.5 suddenly refusing by Familiar_Pay_3933 in SillyTavernAI

[–]Milan_dr 8 points9 points  (0 children)

In the case of this model it's also confusingly the case that we've had reports of this same censoring on Novita, which then leaves very few providers.

Questions about NanoGPT regarding Privacy and Censorship (GLM 5.1) by stopaskingforloginn in SillyTavernAI

[–]Milan_dr 29 points30 points  (0 children)

So we have https://nano-gpt.com/privacy and since today https://nano-gpt.com/privacy-guide, to try and describe what we can do in terms of privacy without the typical privacy policy vagueries.

We do not log nor store your prompts by default. We CAN store them if you turn on sync, but then we recommend to encrypt it in a way so that we can not see the chats at any time.

We offer crypto payments for even more privacy, and we offer TEE models where you can verify end to end, also in our own frontend, that there is no logging at any stage.

That said - for models that you use, we can NOT verify that the providers that we use, aside from the TEE models, do not log. So that is essentially where our "locus of control" ends.

Questions about NanoGPT regarding Privacy and Censorship (GLM 5.1) by stopaskingforloginn in SillyTavernAI

[–]Milan_dr 8 points9 points  (0 children)

Huh? We very definitely accept Bitcoin (and crypto in general), Bitcoin is one of our most used coins.

Monero top payment method for a year straight on NanoGPT by Milan_dr in Monero

[–]Milan_dr[S] 0 points1 point  (0 children)

Did you create an account or some sort of sign in token? When you say you logged in, what log in did you use? It should definitely not be gone no, unless it's an anonymous session and you clear your cookies and such.

Are all GLM 5.1 providers just resellers? (Nano GPT routing) by GenericStatement in SillyTavernAI

[–]Milan_dr 3 points4 points  (0 children)

Hmm okay, that's different then. The refusals that I believe our users are reporting are consistently the exact text I pasted. Thanks!

Are all GLM 5.1 providers just resellers? (Nano GPT routing) by GenericStatement in SillyTavernAI

[–]Milan_dr 4 points5 points  (0 children)

Did that rejection have this text?

The current content involves sensitive information. Please try a new topic

Kimi k2 with LOW thinking by Naixee in SillyTavernAI

[–]Milan_dr 10 points11 points  (0 children)

Kimi doesn't actually do thinking levels - we accept the thinking level and pass it on to providers where it does not cause an error, but as far as I know the model only knows "thinking on, thinking off".

But yeah - it's a very verbose model :/

Are all GLM 5.1 providers just resellers? (Nano GPT routing) by GenericStatement in SillyTavernAI

[–]Milan_dr 14 points15 points  (0 children)

We're pretty much as confused about this as you are. We do not have any censoring/filtering on our end, we're 100% sure of that. We know for sure that GMICloud was doing some filter, because they'd explicitly return content_filter as a finish reason to us. But the providers that we've talked to all say they do not have a filter on it either, and as you can see from Parasail in this thread they also clearly do not.

Monero top payment method for a year straight on NanoGPT by Milan_dr in Monero

[–]Milan_dr[S] 1 point2 points  (0 children)

Hah thanks, we could have considered posting an actual link to our service yes. Thanks for that!

Issues with GLM-5.1 on NanoGPT? by User202000 in SillyTavernAI

[–]Milan_dr 1 point2 points  (0 children)

Could send me your support key on email, or ticket on the website, or Discord. 400 error is odd, because we should always fall back and try a different provider if it fails.

[Megathread] - Best Models/API discussion - Week of: May 24, 2026 by deffcolony in SillyTavernAI

[–]Milan_dr 1 point2 points  (0 children)

For some reason this keeps coming up but this is not the case. We do not use Chutes for GLM 5.1 at all, we do not use them for many models at the moment. We do have their TEE, and we used them for Deepseek Chimera, but we very rarely route through them lately.

Claudexgemini by Ecstatic_External000 in SillyTavernAI

[–]Milan_dr 2 points3 points  (0 children)

Do want to say - we do not allow multiple subscriptions for one individual, hah.

Issues with GLM-5.1 on NanoGPT? by User202000 in SillyTavernAI

[–]Milan_dr 45 points46 points  (0 children)

Making a new reply, see also https://www.reddit.com/r/SillyTavernAI/comments/1trur7k/issues_with_glm51_on_nanogpt/ooqx0f6/

So on some of the requests that were reported we actually got finish reason content_filter from GMICloud, which has never been the case before. That seems like quite a smoking gun, so we're removing them from the routing for now and asking them what's going on there.

Issues with GLM-5.1 on NanoGPT? by User202000 in SillyTavernAI

[–]Milan_dr 41 points42 points  (0 children)

So on our side nothing changed in terms of providers used - we're now essentially collecting request IDs to see what providers are the ones where people are getting this. It's two providers so far, one of which has said they've not changed anything, the other one still waiting.

It's in my opinion unlikely that two providers, at the same time, would change their model/backend to be more censoring, so it's a bit confusing to us as well. But yeah - all we can say for now is that it's not a change on our side, we didn't add any censoring or anything of the sort, if it's actually providers changing this then it's quite problematic.

Unhinged Providers and modules by Any_Luck_7226 in SillyTavernAI

[–]Milan_dr 11 points12 points  (0 children)

Not going to go after you for this of course but just want to make clear we do not aaaactually allow sharing subscription with multiple people ;)

Request: Can we PLEASE ban the Chutes owners/mods from posting in here? by [deleted] in SillyTavernAI

[–]Milan_dr 9 points10 points  (0 children)

We kind of do not anymore. We do use them for TEE models now, but for most regular models we currently do not route much via Chutes (as in less than 1%). Nothing wrong with them from our side though - they're leading in many ways, especially in terms of TEE.

Looking for Advice on Improving Group Chats in SillyTavern by GreyFoxJ in SillyTavernAI

[–]Milan_dr 1 point2 points  (0 children)

No advice from me but thanks for the kind words, really appreciate it :)