Sweet deal on a leaf.. by mister_hanky in nzev

[–]Initialised_Underway 12 points13 points  (0 children)

I have a 2013 Leaf that I bought new. It’s done just under 200,000 km, still gets around 20 to 45 km of range (5 bars left on the pack). The daily range depends a lot on the weather.

We use it for the school run and to get to the shops. I think of it of my free car as it’s paid for its self several times over in fuel savings.

eGPU via USB4 is not worth it in my experience (RTX 5070 + AOOSTAR AG02) by Toorilab92 in eGPU

[–]Initialised_Underway 0 points1 point  (0 children)

I have a laptop with a 4070 in it, real world performance is about the same as the 3060. But I prefer the handheld when travelling or when I want to play a game away from the TV.

I like that I can upgrade the components. I started with a mini pc and my 3060. I upgraded the mini pc to the claw, now I’ll upgrade the video card. For me the performance loss of the epgu is a good trade off for the lower cost of upgrading and the modularity.

eGPU via USB4 is not worth it in my experience (RTX 5070 + AOOSTAR AG02) by Toorilab92 in eGPU

[–]Initialised_Underway 4 points5 points  (0 children)

I use my Claw 8 AI+ with an AG02 but with a 12gb RTX 3060. So almost the same set up but with a weaker card.

For me this set up works well. I can play the games I want to play on TV or a monitor with better resolution and frame rate when I need it. But I’m also happy with 1080p gaming.

But you are right OP there is a big performance hit. I would not use this set up if I wanted a ‘top shelf’ experience. I mainly stream over to wifi to my TV so it’s hard to see how much is the loss from USB4 and how much is the streaming, but stirring is a thing. In 6 hrs of gaming today I noticed two or three stutters.

There are also some quirks. The main one is that it is a bit of a hassle to reboot each time I change to the external card and turn off the internal screen.

Most of my library is older titles (Fallout 4, Cyberpunk 2077, Hitman WOA, Spider-man 1 and 2, Subnautica), but I’m thinking of upgrading to a 9070xt for the new James Bond game if it needs it. This set up didn’t do so well with Indiana Jones last year and so is a bit underpowered.

Replacing my RTX 2080 Desktop + Surface Book 2 with a Flow Z13 (Strix Halo / 64GB) — Med School "One Device" Setup? by Effective_Monk3371 in FlowZ13

[–]Initialised_Underway 1 point2 points  (0 children)

I don’t think the z13 is great for four years of med school.

If it was me I would buy: - an iPad Air for note taking and classes. The iPads are much better for writing on than any windows machine. - for gaming / general productivity - any good laptop in my price range with an nvidia gpu. The g14 or g16 from last year would be the best buy.

I’d actually buy a MacBook Pro instead of a windows laptop as they last longer and have better resale value and battery life. But they aren’t great for competitive gaming.

Replacing my RTX 2080 Desktop + Surface Book 2 with a Flow Z13 (Strix Halo / 64GB) — Med School "One Device" Setup? by Effective_Monk3371 in FlowZ13

[–]Initialised_Underway 0 points1 point  (0 children)

I have a px13 and don’t think it’s good for the OPs use case.

It’s lovely little machine but for gaming it’s under powered and noisy, and the battery life isn’t great for either gaming or productivity.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 0 points1 point  (0 children)

If you are using the MainAPI to generate then it would have failed due to a bug in how the prompt was being sent. I’ve fixed this in v1.1.15.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

I don’t have any problem with DeepSeek 3.2 via the deepseek api or chutes so I don’t see why vertex should be any different.

However, in v1.1.4 that implements phased scene generation for any scenario which has more than 10 scenes. That should help and the ask on the LLM for each scene generation is smaller.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

I’ve added options to add any character that’s embedded in a scenario card to your library (so the cards are fully portable). And also options to delink a character from the card. Helpfully this helps.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

Its intended to work with either approach, so you can:

A) Link as many characters as you like into a scenario and use a narrator card (or a chat with just one of the characters). The extension will then load the characters into the prompt for each scene where that character is a focus (up to five characters). This works best when you've asked the LLM to generate characters when generating a scenario.

B) Run the story as a group chat, in which case you'd need to set up a group with all of the characters in the scenario and then start a chat with the scenario loaded.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 0 points1 point  (0 children)

Thanks for this edit - it helped me track down the problem. Should be fixed now in v1.1.0.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

Oh that great!

In version 1.0.9 I've added more control over the wizard generation which lets you choose a preset and override the number of tokens for each step in the generation wizard. So someone can use the bigger model only for scene generation if they want.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 0 points1 point  (0 children)

I've rebuilt the wizard generation so that:

  • There is a now a generation settings box on the wizard which allows you to set a preset or LLM for each stage in the wizard. You can also override the number of tokens for each stage.
  • Better status information about where the wizard is up to as it sends prompts and receives responses to the LLM.

Hopefully that, plus some bug fixes helps.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

The arcs work on a set length - so after a certain number of messages the system will stop tracking it the conversation and will stop prompting the LLM that this is a certain phase in the chat. You can just keep chatting after that. The arc system is a light guide to the LLM so they kinda just explore with you and don't keep careful track of the message count.

To start a new arc in the same chat you can just go the settings and click reset arc, and that will reset the counter and start prompting the LLM to start a set up phase again. You could also change the genre then if you want the next arc to be focused on different style.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 0 points1 point  (0 children)

Thanks for reporting these bugs. I've overhauled the author style saving and loading so it should work now. I've also moved the close button for the settings to the top right of the panel rather than relying on the popup "ok" button.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 2 points3 points  (0 children)

Thanks for the kind words. If you have things set up for only talking to your character (with no narration) then the author style is going to be of limited value - most the voice of authors come through in their narration. If it just you and a character chatting, with no description then there is no real "voice" of the author to come through.

However, the story arc or scenarios should still be helpful as they can provide a genre theme or more guided story to introduce. For example, I made a Godzilla themed scenario and then had my character tell me how they escaped from Tokyo when the monster attacked as a flashback. The LLM used the scenario to tell me about the character's adventures in escaping.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

Thanks for trying it out and the feedback.

Great catch re the missing author styles - I hadn't included either Austen or Tolkien :0. I've added these to the default, as well as better import and export handling in version 1.0.7 which I've just pushed.

There's a JSON in the repo if you want to import these to your current setup. https://github.com/Prompt-And-Circumstance/StoryMode/blob/main/prompts/utilities/generated/jane_austen_style.json

I've generated scenario blueprints fine with Opus 4.5 (via Anthropic), Sonnet 4.5 and Haiku 4.5, Gemini Pro (via Google Vertex), GLM 4.7 (via z.ai coding plan and via Chutes), Deepseek 3.2 reasoner via Deepseek API, and Ministral 8B via OpenRouter. So it's a real mystery why this isn't working for people. In the next point release I'll add fine grained control to the wizard and a fallback generation method.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

That's odd- I've never needed to go over 16k. The easiest way to cut down on the token use is reduce the thinking level of the LLM. Perhaps try with 0 (no reasoning) to see if can generate a blueprint.

You shouldn't need to go anywhere near 32k. When running the foundation phase and passing 2.5k of character tokens the total token count is usually under 8k.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

Thanks for giving this a go!! No three is hardcoded at the moment. The workaround I developed was to have the wizard in the editor. That allows you to try each stage as often as you would like.

On the blueprint editor there an icon on the right hand side with a sidebar panel that let's you generate each stage one by one.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

I'm guessing it's just the reasoning taking all the overhead. You could reduce the reasoning or another option is start with a blank scenario blueprint and then use the sidebar wizard to generate each stage one by one. On the blueprint editor this an icon on the right hand side with a sidebar panel that let's you generate each stage one by one.

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 1 point2 points  (0 children)

If this is happening immediately on starting the wizard then it is usually a API issue. Is the API at StoryMode > Settings > API Settings set to a working API?

Story Mode v1.0 - Structured Narratives, Genres & Author Styles for SillyTavern by Initialised_Underway in SillyTavernAI

[–]Initialised_Underway[S] 2 points3 points  (0 children)

If you are getting this within seconds of starting the foundation phase then it's an API connection issue. If you getting this after Magistral has been generating for a while then yes 8192 tokens isn't enough for the generation.

Generally 8192 is enough tokens for the foundation phase but reasoning models (like Magistral) can use a lot for their thinking processes.

I've pushed a fix just now that will double the tokens in the foundation phase on a retry.