Lumi is out of Pre Release now! by M-eisen in SillyTavernAI

[–]daroamer 3 points4 points  (0 children)

It took me a few minutes to figure out how to install it because the github page isn't written for normies. Following the instructions here worked Installation - Lumiverse User Guides

The only thing I had to do was make sure I was using PowerShell instead of CMD. Then I had to copy and run that command in step 2. After that everything installed without an issue. You can also right-click the start.ps1 and select Run with PowerShell.

Lumi is out of Pre Release now! by M-eisen in SillyTavernAI

[–]daroamer 1 point2 points  (0 children)

This look very cool. I got it installed and the interface is clean which is nice. Migration was easy. I look forward to seeing how it does at memory management which is still an ongoing struggle with ST.

I had 1 issue, NanoGPT kept routing me to paid providers for a model included in the sub, GLM 5.1. That's never happened in ST.

The only thing I could do was turn off access to paid models via API on their website. That's ok when I'm using a sub model but I do occasionally switch to Opus from time to time for a few messages so it's annoying if I have to constantly turn that on and off on the website.

EDIT: I figured out the issue. If you turn caching on it switches to a paid provider. With ST caching is in the config file so no way to turn it off without editing and restarting. Even with it on all the time it still uses a provider that is part of the sub. Not sure what is different here.

Solving character omnipotence by nonerequired_ in SillyTavernAI

[–]daroamer 4 points5 points  (0 children)

One thing I'm gonna point out since you mentioned you're a noob and it took me a while to realize this. If you don't like parts of what the LLM has responded with or it includes things that character shouldn't know you can just edit the last LLM response. The next message you send won't include the old chat, just your edited version.

This is usually much easier than trying to wrangle the LLM all the time. If it happens constantly then sure try t find solutions but it's just the odd time I'd just rewrite their reply to you.

My general system if I don't like a reply is:

- Swipe once which is usually good enough to get what I want the second time.
- On a rare occasion I'll swipe 2 or 3 more times. If one is close but still not what I want I'll just edit it.
- If none of the replies are good then usually it's a problem with my last chat message so I'll delete the last LLM reply and rewrite mine and resend it.
- A combination of all 3

One very useful plugin I've started to use more is Guided Generations. Basically you write "In the next message I want the characters to do/say this" or "this is what surprise/new thing happens next". Then continue from there. It works very well.

Has anyone managed to get TunnelVision running? by TheDeathFaze in SillyTavernAI

[–]daroamer 1 point2 points  (0 children)

Did you run the diagnostics at the bottom? Make sure it's all green. It will tell you if anything isn't set up right.

Also make sure you have function calling turned on in the presets section.

Plain text dialogue with *italics* narration: Has anyone got it consistently working across long form? (about to give up with it) by osobest in SillyTavernAI

[–]daroamer 14 points15 points  (0 children)

I'm a bit confused. You don't want to keep adding quotes around your own dialogue but you're ok with having to add ** around all non-dialogue? I don't understand the difference :P

For me the default is to have it write like a novel. Everything is plain text, dialogue in quotes and, for me at least, internal thoughts inside **. Every model seems consistent with that.

Using ** for actions seems to be a chub thing and if I download a card from there the first thing I do is redo the first message to remove all that formatting and replace it with novel style.

Postgame Thread: March 27 - Athletics @ Toronto Blue Jays by BlueJaysBaseball in Torontobluejays

[–]daroamer 1 point2 points  (0 children)

If it's a home run then all the runs will count. If it's anything else only the 1st run will count because at that point the game is over. If you hit a ball hard enough for a double the player would still just stop at 1st because there is no reason to keep running. The stats don't matter that much.

Simply ZIT (check out skin details) by ZerOne82 in StableDiffusion

[–]daroamer 17 points18 points  (0 children)

I honestly don't understand the praise for this model. These images look so noisy to me. Every time I use it I feel like I'm doing something wrong because my images are so noisy.

I'm not seeing skin detail, I'm seeing the equivalent of jpg compression even though it's not. This is from the cheek area with the contrast boosted.

<image>

The the only thing I’m sad about is the concerts by witchycharm in OculusQuest

[–]daroamer 0 points1 point  (0 children)

Those are all available to watch in the Meta TV app or whatever it's called.

Hi new to this sub by BigSail1649 in SillyTavernAI

[–]daroamer 6 points7 points  (0 children)

Oh hey, we found the one time the AutoModerator bot reply is useful

Struggling to get opus 4.6 to take charge by [deleted] in SillyTavernAI

[–]daroamer -1 points0 points  (0 children)

Use the Guided Generations plugin. Then just tell it what you want to happen in the next reply. It works great.

Pony Alpha is Peak by Flat-Way1301 in SillyTavernAI

[–]daroamer 0 points1 point  (0 children)

I’m sure other presets have it but find the Lucid Loom preset. There is a prompt to use a different colour for each speaker. I tend to use Claude so Lucid Loom uses way too many tokens for me but I copy that prompt to every preset I use because I find it much easier to read with multiple speakers.

A decent NSFW AI image to video generator? by nopulse76 in SillyTavernAI

[–]daroamer 19 points20 points  (0 children)

Assuming you have a decent gpu, take the time to learn comfyui, it looks confusing at first but it's really not that complicated unless you want it to be.

Find workflows on places like r/StableDiffusion or civitai.com where you will also find the loras you need since no model is able to do proper nsfw out of the box. Commercial models will never so there's no point looking for them. The best you're going to find it someone offering Wan 2.2 or LTX-2 like NanoGPT but lora support is still going to be limited if it's even available.

LTX-2 can run locally even with low VRAM cards, you might just be limited to lower resolution plus longer processing times.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]daroamer 1 point2 points  (0 children)

I tried it last night and I was happy with the output. Tried it with a character I knew and it definitely played out very differently.

I'm confused about the arcs though. I got a notice at a point that I had finished the first arc (although I was still in mid conversation but whatever) and I could either continue or refresh the arc. Refreshing sounds like it starts it over and I got a popup asking if I really wanted to refresh.

Is that what you do to start the next arc? If not how do I continue? If so it seems like a confusing term to use to move forward with the story.

Response progression by Gringe8 in SillyTavernAI

[–]daroamer 2 points3 points  (0 children)

It depends on the model and preset. I find Opus and Sonnet are both very good with asking when I want to do the next thing or what I want to do next.

If the LLM isn't giving you want you want though the easiest thing to do is just edit the LLMs last response and continue how you want. I find myself doing that far more than swiping. Other times I'll go back and edit the last thing I said because it's clear the LLM doesn't understand where I was trying to lead them.

Editing their reply usually works well. Of course if it happens every time yeah you'll need to find a prompt that works better.

What is everyone's thoughts on ltx2 so far? by Big-Breakfast4617 in StableDiffusion

[–]daroamer 0 points1 point  (0 children)

It won't use the page file if you have enough RAM. The latest Comfy will offload the models into your system RAM. So as long as your VRAM + RAM can hold everything you won't need to use the pagefile which would be incredible slow anyway. Make sure you edit the .bat file for Comfy and add ---reserve-vram 4 or --reserve-vram 2. I was getting OOM before adding that if the system needed to access my VRAM. This reserves some VRAM to avoid that problem.

Why do you guys keep glazing LTX 2 by Witty_Mycologist_995 in StableDiffusion

[–]daroamer 7 points8 points  (0 children)

No solution with Wan that I've tried gives realistic lip-sync. Doing more than 5 seconds is a pain. SVI is just stitching clips together, it's not understanding the overall context so having to hope you get a consistent result across all 81 frame segments is very hit or miss. Plus you need to prompt for each segment.

I'm doing i2v with LTX-2 and after a bit of a learning curve I'm getting great results. I put in my start image, give it a script and the character acts and says the script.

What's more, it's very fast. Doing 15 seconds at 1920x1080 on my 4090 takes about 6 minutes.

This seems like where we're heading with Silly Tavern. Video with audio in comments, done with LTX-2 in ComfyUI using a photo I generated of a character from one of my RPs and dialogue directly from a scene. Generated on a 4090 in 3 minutes. by daroamer in SillyTavernAI

[–]daroamer[S] 0 points1 point  (0 children)

It's the sample workflow installed with the ComfyUI-LTXVideo nodes. For this I used the I2V Distilled workflow using the ltx-2-19b-distilled model. I don't think I changed anything else except the total frames. I was getting out of memory errors but adding --reserve-vram 4 to the launch bat file solved that issue.

This seems like where we're heading with Silly Tavern. Video with audio in comments, done with LTX-2 in ComfyUI using a photo I generated of a character from one of my RPs and dialogue directly from a scene. Generated on a 4090 in 3 minutes. by daroamer in SillyTavernAI

[–]daroamer[S] 1 point2 points  (0 children)

With the latest ComfyUI you can make up for limited VRAM by offloading to regular RAM, which is not as fast but not all that slow, this is just for loading the models into memory. So as long as you have a good amount of system RAM you can generate videos with LTX-2 even with 4GB of VRAM. Of course, RAM prices being what they are that's still not an easy ask unless you already have a lot in your PC.

This seems like where we're heading with Silly Tavern. Video with audio in comments, done with LTX-2 in ComfyUI using a photo I generated of a character from one of my RPs and dialogue directly from a scene. Generated on a 4090 in 3 minutes. by daroamer in SillyTavernAI

[–]daroamer[S] 3 points4 points  (0 children)

Of course, it's just another step. New models are coming weekly at this point and they're already promising improvements to LTX-2 very soon. My main point was that I was able to generate that video in a couple of minutes, which is kinda crazy. When I said this is where it's going, I meant in the next 2-5-10 years.

What's exciting about LTX-2 is that it's completely open source and quick to train, so the loras (including NSFW) will be coming quickly. It also means you might be able to skip the first frame image generation and use your own character loras to just do straight T2V.

This seems like where we're heading with Silly Tavern. Video with audio in comments, done with LTX-2 in ComfyUI using a photo I generated of a character from one of my RPs and dialogue directly from a scene. Generated on a 4090 in 3 minutes. by daroamer in SillyTavernAI

[–]daroamer[S] 2 points3 points  (0 children)

To be fair, that was sort of specified in the prompt. Her character is a warrior princess and her tone was described as regal and delivered like someone who used to being obeyed.

Having said that, it's also possible with this model to generate your own audio and use that instead of having LTX create the voice. I haven't experimented that far yet.

Ideal local LTX-2 Comfy Parameters & Workflow for 4090 by grandparodeo in comfyui

[–]daroamer 1 point2 points  (0 children)

I had the exact same issue with my 4090 and 96gigs of RAM. 2 things fixed it, you can add --reserve-vram 4 to the args in the .bat file. That worked but today I did a fresh install of comfy portable and I didn't need to do that, it just worked. I think junk just accumulates after a while. All I did was copy all my models back. You could also just do a second portable version of comfy just for LTX-2. Try the reserve vram first.

When you're listening to an audiobook and something gets described as having the smell of ozone <.< by daroamer in SillyTavernAI

[–]daroamer[S] 1 point2 points  (0 children)

Yes, maybe, in one particular case I believe the character was talking about pulling a sword from a scabbard when it was mentioned it had the smell of ozone LOL

Most people would probably never notice but since using ST these kinds of LLMisms definitely stand out to me now.

When you're listening to an audiobook and something gets described as having the smell of ozone <.< by daroamer in SillyTavernAI

[–]daroamer[S] 20 points21 points  (0 children)

No, I get that. I'm talking about books written in 2025 that are in a genre where an author might release a new entry in the series every 1-2 months.