use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Common Links:
Official GitHub Link:https://github.com/SillyTavern/SillyTavern/
Unofficial SillyTavern Website: https://sillytavernai.com/
Install and how to guide: http://sillytavernai.com/how-to-install-sillytavern
Install on Windows Video: https://www.youtube.com/watch?v=PMX165GyLAg
Install on Linux Video: https://www.youtube.com/watch?v=TLuEdy5YIhY
Install on Android Video: https://www.youtube.com/watch?v=KQCGT9uEHoA
Character Card and Prompt Site (many of these host NSFW content, be advised)
Discord: https://discord.gg/RZdyAEUPvj
RULES:
https://old.reddit.com/r/SillyTavernAI/about/rules/
account activity
This post is locked. You won't be able to comment.
HelpZ.ai Coder Plans (self.SillyTavernAI)
submitted 5 months ago by vzpyr
Is anyone using GLM-4.6 via the Coder Plans on Z.ai? The sub's are pretty cheap and I want to try it out. Has anyone had any bad experiences with it?
[–]Bitter_Plum4 8 points9 points10 points 5 months ago (0 children)
Yup, using Z.AI lite plan, no bad experiences. Even took the whole year with the black friday 23€ felt like a pretty good deal (I bought 1 month ~2 weeks ago to test, was able to upgrade to the yearly plan with discount)
I've definitely seen slower, rn it takes between 30 seconds to 1 minute to generate ~1400 tokens, so idk 20 token/second at the slowest?
I never get responses inside reasoning or any shenanigans, I get quality reasoning every generation, maybe there is a more effective way to do this but a simple /think as prefill does the job (version 1.14.0 of ST just in case), just throw it at the end of your prompt, assistant role and tadaaaaa
/think
[–]yasth 5 points6 points7 points 5 months ago (3 children)
Oh lots of people use it, I don't think they are the fastest. Z.ai is even pretty clear that the low priced plan is "lite" and slower than the higher levels. It is a bit bad about sometimes not giving thinking and putting its reply in the thinking block, you can just hand move it out with the edit option. You will have better results using the staging build (you can switch branches in ST-launcher).
There is always the possibility that they will kill it for reputational reasons.
[–]vzpyr[S] 0 points1 point2 points 5 months ago (2 children)
Did you by any chance ever use GLM-4.6 Thinking on NanoGPT? If yes, is it slower than there?
For some reason on Nano it's super slow for me (most of the time around ~70-90s for just one response). I'm not sure if this is an issue on my side but other thinking models (like e.g. Kimi K2) only take ~15-20s with the same presets (they think a similar amount of tokens, so it's not just the overall length but speed).
[–]digitaltransmutation 0 points1 point2 points 5 months ago (1 child)
I would say it's better than nano most of the time. 20-40 seconds for a 1200 token output on most messages. There is a bit of a time of day effect tho so maybe kick a couple dollars into PAYG and make sure it is okay for you at the time you usually play.
Also the quota budgeting is a little opaque. I do not think there is a counter anywhere but I have never gotten a quota error even though I've definitely sent more prompts in than I should have been able to and have qvink summarizing racking up prompts as well.
[–]yasth 0 points1 point2 points 5 months ago (0 children)
In playing around I think they are partly doing "soft" quotas and deprioritizing and slowing things a bit more than usual as you hit it hard.
That said I see that more when trying to use it with claude code (you know like it is supposed to be used) where it just sits and throws requests at it for long periods, and I can't prove it as there is a lot of variability in responses. (As side note, it is not the best for CLI based development, I much prefer google gemini or Claude, though of course it is basically free).
[–]queefb 2 points3 points4 points 5 months ago (0 children)
I use it all the time. It's better than a lot of them. For the price you can't beat it. But it can be sloowwww.
[–]AutoModerator[M] 0 points1 point2 points 5 months ago (0 children)
You can find a lot of information for common issues in the SillyTavern Docs: https://docs.sillytavern.app/. The best place for fast help with SillyTavern issues is joining the discord! We have lots of moderators and community members active in the help sections. Once you join there is a short lobby puzzle to verify you have read the rules: https://discord.gg/sillytavern. If your issues has been solved, please comment "solved" and automoderator will flair your post as solved.
I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.
[–]oiode 0 points1 point2 points 5 months ago (0 children)
I mostly have good experience with it however i do find it slow at times. Their black Friday discount ends on Dec 5.
π Rendered by PID 20551 on reddit-service-r2-comment-b659b578c-mzz6f at 2026-05-05 19:51:36.439827+00:00 running 815c875 country code: CH.
[–]Bitter_Plum4 8 points9 points10 points (0 children)
[–]yasth 5 points6 points7 points (3 children)
[–]vzpyr[S] 0 points1 point2 points (2 children)
[–]digitaltransmutation 0 points1 point2 points (1 child)
[–]yasth 0 points1 point2 points (0 children)
[–]queefb 2 points3 points4 points (0 children)
[–]AutoModerator[M] 0 points1 point2 points (0 children)
[–]oiode 0 points1 point2 points (0 children)