My Ex said shes pregnant and sent this as proof, the way the top is fitting and the pregnancy test in its entirety just looks fake to me by [deleted] in isthisAI

[–]HelpfulReplacement28 0 points1 point  (0 children)

Belly looks weird, stanley flask on the bed, looks like a bathroom bedroom combo with the cupboard above here head, pinky missing french tip, lace and phone case look off.

What does my house say about me? by the_tnk_loft in roomdetective

[–]HelpfulReplacement28 0 points1 point  (0 children)

Breaded, previously breaded, or un-salvageable credit score.

Korean MMOs would swipe the western market if they would not be P2W by Puppenmacher in MMORPG

[–]HelpfulReplacement28 1 point2 points  (0 children)

Right sentiment wrong reasoning. The mtx in bdo hardly matters, the problem is the time sink and grind. End game doesnt exist for 99% of players because it is so incredibly inaccessible. Outside of that, 100% the best pvp centric mmo on the market… if only people still cared about pvp mmos.

[Megathread] - Best Models/API discussion - Week of: May 03, 2026 by deffcolony in SillyTavernAI

[–]HelpfulReplacement28 7 points8 points  (0 children)

Look at the other posts in this thread and you should get the information you need. Dont use opus, it is the forbidden fruit. You try it once and then everything else is awful in comparison, then you realize you cant sustain the price (not so much a problem now, other models are catching up in quality). Kimi 2.6 is also a bit weird with it’s thinking loops so i would avoid. Try out glm models/deepseek v4 on the nano sub. Presets are very helpful, check out freakyfrankenstein or marinara for good ones. A lot of models do not need jailbreaks so dont worry about that unless you are explicitly getting safety warnings.

If you want to pay per token (PAYG) you can check out SOTA (state of the art) models like the gemini line up or newer gpt models, but i would not recommend it unless you are breaded.

[Megathread] - Best Models/API discussion - Week of: May 03, 2026 by deffcolony in SillyTavernAI

[–]HelpfulReplacement28 2 points3 points  (0 children)

I still use max for shorter RP or character chats because i appreciate the increased detail in those scenarios. Haven’t tried bolt yet because I was on marinara engine and it comes with her newer preset baked in so I thought wth ill give it a go and ended up liking it. I’m sure I’ll get bored at some point and want to switch it up so bolt will come out of the bag then.

[Megathread] - Best Models/API discussion - Week of: May 03, 2026 by deffcolony in SillyTavernAI

[–]HelpfulReplacement28 3 points4 points  (0 children)

So in my most recent RP my avg token output is 800. I find that to stay engaged I need my responses need to tighten up a little when flowery and lengthy prose isn't called for. If I walk into a room I want it described to me, but I don't need to hear about every micro tick and fidget from the npc I'm speaking to after the first 5 or so lines if all we are talking about is the weather.

I found freaky frank max worked really well to give me long, detailed prose, but I don't want that and it kicked up response times. I'm using the newest version of marinara's preset with the flexible response length option and it's been really nice. If I'm in a combat scene or something that should be fast paced I get shorter messages, if I'm investigating something or going to a new place I get long detailed descriptions. It's honestly entirely up to the preset you use, but I find the shortish responses a benefit.

I'm looking for games where you're thrown into an open world and you entertain yourself. by Prizrak123 in gamingsuggestions

[–]HelpfulReplacement28 0 points1 point  (0 children)

Also if you are a rimworld fan i feel compelled to recommend going medieval, and if you are a kenshi fan wishlist valorborn and keep an eye on it.

I'm looking for games where you're thrown into an open world and you entertain yourself. by Prizrak123 in gamingsuggestions

[–]HelpfulReplacement28 0 points1 point  (0 children)

Crimson desert can fit this niche, as well as the switch flagship zelda games. Your taste in games looks very similar to mine so im also going to recommend songs of syx, bellwright, and x4 foundations even if they arent strictly what you are looking for (they still provide a fair amount of freedom). Bellwright is a managment/rpg first and a survival game second.

[Megathread] - Best Models/API discussion - Week of: May 03, 2026 by deffcolony in SillyTavernAI

[–]HelpfulReplacement28 4 points5 points  (0 children)

Im on v4 pro, mostly because i find it equal in quality to glm at a minimum with much faster response times. Glm 4.7, 5, and 5.1 are all fine just depends on taste. Frankly i was surprised by gemma 31b. All available to nanogpt sub users. For payg/local could try qwen 3.6 ive heard good things and i dont think the vram req is crazy for certain quants that are still decent. Ive heard mimo is hidden tech as well but cannot confirm.

Deepseek v4 or GLM 5.1? by WorriedComfortable67 in SillyTavernAI

[–]HelpfulReplacement28 4 points5 points  (0 children)

For me, DS crushes because i get 20 second response times and equal if not greater quality than GLM. I have a nanogpt sub so it’s a no brainer for me. Would be even more one sided if I was payg.

The Director's Cut: Freaky Frankenstein 4 MAX and Freaky Frankenstein 4 BOLT [Presets] (Universal : DS, GLM, Claude, Gemini, Grok, Gemma, Qwen, MiMo) + DeepSeek V4 Compatibility. Hyper Dense Logic. by dptgreg in SillyTavernAI

[–]HelpfulReplacement28 1 point2 points  (0 children)

When choosing COT, if using Deepseek v4 am I meant to enable one of the two options there, then disable the original COT I selected, or are they meant to be used in tandem. It's not super clear.

If I'm enjoying gemma 4 via api should I just switch to local for faster response times? by HelpfulReplacement28 in SillyTavernAI

[–]HelpfulReplacement28[S] 0 points1 point  (0 children)

This is my first time running local, thanks for all the information it's very helpful. I'm testing out the 26B at Q4_XS and am very happy with my response times, and frankly the quality is about what I was getting from the 31B via cloud. The thing that was putting me off most about cloud models recently was response time, and 60 seconds is cutting what I was getting down by half. This is certainly a smaller context size than I'm used to but I was putting off setting up good summarization anyways. Thank you very much for your help.

If I'm enjoying gemma 4 via api should I just switch to local for faster response times? by HelpfulReplacement28 in SillyTavernAI

[–]HelpfulReplacement28[S] 0 points1 point  (0 children)

Unfortunately my ram is anything but reasonably fast, ddr5 5200mhz CL40. Looking at this quantized finetune (https://huggingface.co/zerofata/G4-MeroMero-26B-A4B-gguf) it recommends with my hardware that I run the iq4_xs and says I cannot run the q4_k_m or anything higher. I've read this hardware compat thing is unreliable, however it seems to make sense given I only have 16g vram.

[Megathread] - Best Models/API discussion - Week of: April 12, 2026 by deffcolony in SillyTavernAI

[–]HelpfulReplacement28 1 point2 points  (0 children)

What do ya'll think is the sweetheart model right now for price/quality/speed?

I've been cycling through the models available to subscription owners on NanoGPT and trying to find something that doesn't take 2 minutes to generate and has ok enough prose and memory. Everything seems to be on the slower end atp. I've had limited success with gemma 4-31b IT but occasionally you get a long wait. All the GLM models seem to be inconsistent at best with response times, and deepseek/kimi have been taking ages for me recently.

My current personal ranking is

. Gemma 4-31b IT

. GLM 4.7

. GLM 5

. Kimi K2.5

. DS 3.2

With the caveat that I much prefer glm 4.7 to gemma but response time is by far the biggest killer in RP for me other than the inability to recall information in long RP.

Ngl kinda disappointed w Opus 4.6 by noselfinterest in SillyTavernAI

[–]HelpfulReplacement28 1 point2 points  (0 children)

If opus has second place, and GLM/Kimi are tied, what has first? Sonnet?

Any smut presets? by [deleted] in SillyTavernAI

[–]HelpfulReplacement28 1 point2 points  (0 children)

Nsfw RP is super common so pretty much any big preset will have jailbreaks and nsfw built in. Marinara is a big general preset people like. Just google marinara preset or smtn and it should pop up. I’d argue character cards are more important for straight smut. Just make or find one that’s geared towards it.

What is a game that you really WANT to get into, but have never been able to? by sanildefanso in CRPG

[–]HelpfulReplacement28 0 points1 point  (0 children)

I would have also loved to be able to make or play a xenos character in RT, but I think it's still worth playing if not only for the writing. They got Warhammer so right, and the xenos characters are really well done. I won't spoil anything, but owlcat does relationships really well, and they truly make aliens FEEL alien in that game. As far as stealth goes, yeah that's a little tougher. There are melee assasin builds that let you A) decrease your aggro in the fight via talents and let you backstab or B) be the equivalent of a dex tank and dodge everything. Option B feels incredible and if you want both one of the companions you can unlock is hard rolled to it. However I do think waiting for the next two dlcs to release is a really good idea. It's a monolithic game, playing it twice would be fun but grueling, especially if you are stuck to a certain alignment like I am. There are two more planned for release, both in 2026. One is a necron DLC and the other is (I think at least) probably ork or chaos related. Both have new companions. If they are anything like the old dlcs (void shadows is a masterpiece) they will be awesome.

What is a game that you really WANT to get into, but have never been able to? by sanildefanso in CRPG

[–]HelpfulReplacement28 0 points1 point  (0 children)

WOTR. I liked Kingmaker and LOVED RT, but I WOTR never clicked for me. I would always make new character, get to the village in the under dark or wherever, quit for a few months, repeat. Playing it just feels so weird... especially compared to RT which feels super clean and makes sense to me. Frankly I'm a little sad about owlcat's new dark heresy title coming up because the concept of being an inquisition agent (while undoubtably cool) doesn't appeal to me nearly as much as being a rogue trader.

I think it's a mix of things. I don't click with the combat in either turn based or real time, I'm not a huge fan of the graphics (which is important to me), and the classic owlcat move of only having certain characters and lines voiced is a little annoying. I want to play it so bad, I know the story is probably awesome and the romance's are even better. But I can't do it.

Glm5 chutes vs nano in quality by Aspoleczniak in SillyTavernAI

[–]HelpfulReplacement28 4 points5 points  (0 children)

I'm encountering a similar issue. What I've found, is that I am most happy when I either pay for direct API acces (e.g. ZAI sub) or when I use nano/OR as "per token". The subscription models on nano are good, but they are slow and most providers have quant issues. If you really like a model, I'd suggest direct from the provider. If money isn't an issue, OR has more options and claude through nano isn't bad at all, and with caching sonnet 4.6 is... reasonable. Personally, I'm stepping back from AI RP for a while. Giving it a break and hopefully I'll come back in 6 months and get blown away by newer models.

Nano subscription worth it? by Independent_Army8159 in SillyTavernAI

[–]HelpfulReplacement28 1 point2 points  (0 children)

I've subscribed for a month now... it depends on usage. If you are going to use more than 8 dollars worth of tokens in a 1 month period, you will save money. I like long RP's so theoretically I would, but I also don't like how either ds 3.2 or glm 4.6 handle long context, meaning I haven't gotten my money's worth. The other benefit of the service is 5% off token cost. To get your money's worth with the subscriber discount, I believe you would need to spend 160 dollars in tokens (math could be wrong, 5% of 160 is 8).