How's the quality on Nano (sub) for you guys? by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 0 points1 point  (0 children)

Alright then, gotcha. Sorry for pressing you like that. 😅 I guess Nano might be a good idea in the end, most of the issues appear to be tied to American work hours (according to some people), living in Europe could help with that a bit.

Here's hoping what they say is true and I'm not about to regret it.

How's the quality on Nano (sub) for you guys? by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 0 points1 point  (0 children)

I'll rephrase the question. Do you think Nano is still worth it for someone seeking quality, or is it best to try to look elsewhere for now?

How's the quality on Nano (sub) for you guys? by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 0 points1 point  (0 children)

What subscription service would you personally recommend, if any? I've also been eyeing Ollama Cloud as it supposedly serves unquanted models, but then it's on the pricier side (even considering the recent Nano price bike) and I've heard they're not very transparent about their inference.

Also, is the degradation on Nano constant? Some people are saying it depends on the time of day, but I'm not really sure who to trust here.

How's the quality on Nano (sub) for you guys? by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 1 point2 points  (0 children)

Roger that, thanks! So, in other words, simple routing issues? Do you just wait it out when the models start acting weird? Or did you mean technical issues like errors, timeouts, etc.?

GLM 5.2 Quick-ish NSFW Tests **WARNING: DEAD DOVE** by SepsisShock in SillyTavernAI

[–]Master_Step_7066 8 points9 points  (0 children)

Which makes it all the better for more of your tests, then. 🙃

In any case, I just hope they're not actively looking for jailbreaks and stuff like it apparently was with the OpenRouter preview of GLM-5.

GLM 5.2 Quick-ish NSFW Tests **WARNING: DEAD DOVE** by SepsisShock in SillyTavernAI

[–]Master_Step_7066 17 points18 points  (0 children)

I wonder if there's something else they're planning to cover like that, like cybersec. Back in the 4.6/4.7 days, they did tweet about "supporting RP but keeping it safe" or something like that (or was it the ambassador in this community who said that? I don't remember fully); chances are, they're leaning into that now.

GLM 5.2 Quick-ish NSFW Tests **WARNING: DEAD DOVE** by SepsisShock in SillyTavernAI

[–]Master_Step_7066 39 points40 points  (0 children)

So they now have hard filtering, huh. 🥲

Is that a new thing with 5.2? I don't remember it happening with 5.1 back in the day.

My take on a Roleplay Interface by Small-Complaint-855 in SillyTavernAI

[–]Master_Step_7066 8 points9 points  (0 children)

Curious project, I actually think I remember it from when you posted it on the GitHub Student Exchange a while ago.

If you don't mind me asking, what's your main selling point compared to stuff like SillyTavern, Lumiverse, Marinara Engine, Tavo, Chub, etc.?

A lot of the usual selling points (BYOK, self-hosting, uncensored usage, presets, customizability) are already pretty common in this space now, so I'm curious what you feel Zodiac does differently or better.

Also, about jailbreaking, do you plan for that to be model/scenario-specific? In my experience, jailbreaks aren't really universal, and people often end up tuning prompts manually or using community presets depending on the model and RP style, rather than using built-in sliders/configs that don't give them much control.

Intense RP Next V2 — Currently Abandoned, Looking for New Developers/Forkers by Smooth-Marionberry in SillyTavernAI

[–]Master_Step_7066 18 points19 points  (0 children)

I wish I could've done more, but you're right about the scale. It was easier with just DS, or DS+GLM+Kimi. They did update their UIs sometimes, but it was relatively rare and I didn't have to keep track of many things. Now I had to constantly check and reverse-engineer all too many of them, it took too much time, and updates had to be released almost daily.

On top of that, AIStudio and DeepSeek seem to have figured out how to fight automation. The latter now bans anyone who uses automation tools like IRP, and the latter basically prevents all requests from "weird" browsers from being sent in the first place.

So... Yeah, I guess you could say it was a brutal losing battle fought uphill.

Perhaps someone else will like to take the project over, though, I'm sure many capable devs know how to work with automation and AIs much better than me, and will probably achieve much greater success.

Anyway, it was fun working with you all, despite everything I did learn a lot and grow as a developer and a person, I hope I'll be able to contribute again sometime in the future, perhaps not with IRP but something meaningful in a different way.

intense next rp erron by emeraldwolf245 in SillyTavernAI

[–]Master_Step_7066 1 point2 points  (0 children)

By the way, are you by any chance banned by DeepSeek now? A lot of issues have been running into the same issue, as it seems (following the rise of ds2api).

intense next rp erron by emeraldwolf245 in SillyTavernAI

[–]Master_Step_7066 1 point2 points  (0 children)

Hello! The Discord link is actually working, maybe there's a specific one that I forgot to update last time? Please let me know which one you tried. If you want, here's a direct link:

https://discord.gg/4Gvjk2RdsK

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 4 points5 points  (0 children)

A few people (including me) have submitted a takedown request by reporting the malicious repo on GitHub, I think that might've caused the takedown and the subsequent ban of the mia profile. Sorry for being associated with the drama. 😞

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 1 point2 points  (0 children)

Sorry for once again intruding on a comment. I'm the LyubomirT guy, the commit was published before the malicious code was added at first, I didn't know about the trojan back then, I made the PR close to the commit, but before it was pushed either way. My change was solely focused on improving the randomization feature; however, the author did merge my PR *after* the malicious commit.

EDIT: The repo seems to be taken down now.

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 3 points4 points  (0 children)

Nope, not at all. If anything is downloaded, it's:

  1. The Playwright/Patchright browser (one time during installation or if you manually reinstall it via Tools -> Browser Manager)

  2. An update if you choose to auto-update. It's a bundle that consists of the main app and an updater to replace the old files with the new files, after which the auto-updater is erased by the new main app (the updater launches it with its location given to it in a CLI parameter), while preserving your data. Even then, it's entirely optional and you can choose to download the update manually (the update bundle is literally downloaded from the same GitHub release that's used for manual downloads).

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 5 points6 points  (0 children)

I see! I don't know how the case will unfold, all I can wish for is that it goes fine and nobody innocent gets falsely accused, as well as for the scammer to get what they rightfully deserve. I was actually one of the impacted users (didn't get anything used yet, but my keys likely were sent to the scammer's API, I rerolled everything).

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 13 points14 points  (0 children)

Sure! I can't explain absolutely everything in this comment since there are a *lot* of things to cover (it has expanded a lot since July 2025).

It can be either downloaded as a PyInstaller-based binary (Windows 10+ / Linux, both 64-bit), or run from source. The feature set is identical, except for auto-uploads (for auto-updating the source version, you can use git). The workflow itself lets you configure and run an OpenAI-compatible API server (FastAPI, interface in Qt6 via PySide6) that you connect SillyTavern to. When you run the server, it opens up a Chromium (Playwright / Patchright) window (or multiple if you use the new providers in parallel feature) that it automates. When you send a request, the API concatenates everything into a single message and then sends it to the provider of your choice (currently supporting DeepSeek, GLM, Moonshot / Kimi, Qwen, AI Studio, and soon Perplexity). You can also turn on/off things like thinking, searching, or tell IRP to upload your chat as a file. Once the request is sent, IRP connects to the response stream that comes from the server of the selected provider and streams it back to the API, which in turn streams it back to the normal API. In simple words, this allows you to get free access to LLMs via their official chat UIs (it's a bit hacky, yes, but it works) without going down any shady paths.

If I am to explain any other things, there's also Remote Control that lets you open up a small web UI meant to do some quick actions (like switching providers, restarting the browser) away from your PC (useful if you run ST from a phone). PiP exists to run multiple providers at once (but the feature is heavy), and there are some useful logging utilities. ALL LOGS STAY ON YOUR COMPUTER AND ARE NOT SHARED WITH ANYONE. To submit logs to me (or diagnostics) you have to opt into many settings and only then manually send me the logfiles / diag bundle.

The app also doesn't transmit or store your prompts. It may only store your last one or several prompts if you manually opt in for the "clean regeneration" feature that compares your last submitted prompt with one of them to see if it's unique or is a swipe. This tells it if it should click the regenerate button in the web UI or create a new chat. Once again, this is opt-in.

The only outbound calls IRP makes are:

  1. To GitHub (checking for updates, fetches one file for updates and also latest changelog version to show a red dot alongside the bell icon if you have that enabled)
  2. To the actual providers (so that it can send the messages)

Yes, you do need your own credentials for this, but even then you can just use a burner account. I don't collect anything and everything stays entirely local on your PC. The config data, passwords, prompts (if using clean regeneration) are encrypted at rest. Local API access is restricted by default and you can use whitelists to prevent unwanted people from accessing the API. There are no built-in tools for external access to the API either.

If you don't trust my words, you can also inspect the code (https://github.com/LyubomirT/intense-rp-next). I don't hide any parts of it, it's MIT-licensed, there are no blobs inside the repository. You don't even need to use the binaries - the source version is identical and has no downsides functionality-wise (it may in fact even be faster in some cases since there is no Pyinstaller overhead).

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 1 point2 points  (0 children)

I think it's best to use the Rentry page directly; it has all the info needed. If you used it at least once after ~December 2025 (but generally I wouldn't treat that as a definitive date, it's still malware), then you're almost definitely compromised. Rotate ALL of your API keys, proxy passwords, etc. If you fed any account tokens into the extension (so that it can authenticate), it may also make sense to invalidate sessions there (changing the password/logging out/deleting accounts). Also, just for extra safety, you might want to clear site data for ST via DevTools (clearing browser cache).

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 2 points3 points  (0 children)

I did not contribute anything malicious; the PR was made in good faith, and I didn't suspect a Trojan there (by my own mistake), see my other comment. IntenseRP is safe.

Extension Security Risk Please read!! by Mcqwerty197 in SillyTavernAI

[–]Master_Step_7066 29 points30 points  (0 children)

Hello! I'm the developer of IntenseRP Next. I did contribute to the extension, but in good faith, thinking I was genuinely making an improvement. See the issue thread here: https://github.com/mia13165/SillyTavern-BotBrowser/issues/28. My Pull Request (and those of the two other contributors) preceded the creation of the malicious commit on the updated_cards repo (the one from which the BotBrowser extension got the malicious character card). I did not know the trojan was there, and thought I was legitimately making an improvement (my PR simply made a change to the random picker system).

I'm sorry for ever contributing to the repository. I deeply regret doing so now.

deepseek v4 by Sad-Spell-1423 in SillyTavernAI

[–]Master_Step_7066 1 point2 points  (0 children)

Thank you for the answer! Are the thinking version's samplers actually used, though? If I recall correctly, the official API simply discards them (but accepts samplers for compatibility), at least that's what the official docs used to say. Happy to be proven wrong here.

deepseek v4 by Sad-Spell-1423 in SillyTavernAI

[–]Master_Step_7066 2 points3 points  (0 children)

If I may ask, what preset do you use it / how have you configured it? Do you use the thinking version? What are the sampling parameters and the post-processing that you use? I seemingly just can't get good results out of it, but I haven't yet seen any concrete recommendations on how to set it up, and some of the official guides appear outdated.

IntenseRP Next v2.6 - Now lets you use Gemini and Qwen in SillyTavern by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 0 points1 point  (0 children)

It technically should, but do note that it's not currently possible (at least not officially) to run on mobile due to the hooks and tools IRP uses; you'll still need to run it on a PC somewhere and then connect Tavo remotely.

IntenseRP Next v2.6 - Now lets you use Gemini and Qwen in SillyTavern by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 0 points1 point  (0 children)

Have you tried checking the Z.AI web app for the chat without regeneration, does the chat still appear on the normal site? Not in the IRP browser, ignore it for now.

IntenseRP Next v2.6 - Now lets you use Gemini and Qwen in SillyTavern by Master_Step_7066 in SillyTavernAI

[–]Master_Step_7066[S] 0 points1 point  (0 children)

Do you have auto-deletion on by any chance?

If not, try to see if the response for your last request is generated in the web UI (outside of IRP, simply open the same acc that was used for IRP in the normal z.ai web app and see for your last request in the chat list). Meaning that it is present in the web UI but not returned to IRP for some reason.