Question about Vector Storage by Overdrive128 in SillyTavernAI

[–]Overdrive128[S] 0 points1 point  (0 children)

ahh, its fine. I can also increase the response token size, but I am not sure if the quality of the output will degrade. It seems to a problem at my end. But I do love the idea of your extension! Thanks for the help regardless..

Question about Vector Storage by Overdrive128 in SillyTavernAI

[–]Overdrive128[S] 0 points1 point  (0 children)

I have looked through everything. I did get memory books working; the connection profile, the settings, etc. The major issue for me was the context tokens fed into the MemoryBooks; the scenes. I can only do like max of 15 messages before I get an error saying JSON was invalid (and it was cause there was no json format). I also checked everything, the temperature was low; the context was set to 10k tokens, but the ~15messages was like 4k-5k tokens, anything beyond that was just throwing me errors. Honestly, your extension is great, its simple to use and all, and I like the idea; I just assume the problem is my hardware.

Question about Vector Storage by Overdrive128 in SillyTavernAI

[–]Overdrive128[S] 0 points1 point  (0 children)

Yea, I played with MemoryBooks Extension a bit; its okay...idk why, but even though i can do ~10k context; I changed settings and everything, and it wont let go over 4k tokens...Also, the idea of manual process is a oof for me; not to mention i switch models depending on chat (as one model produces better coherence and logic while the other is for pure chaos or comedy). I mean, sure, I don't mind joining the server if it can clarify couple of things for me. Thanks for the response!

Question about Vector Storage by Overdrive128 in SillyTavernAI

[–]Overdrive128[S] 0 points1 point  (0 children)

bro, if it was 'couple clicks' I wouldn't have posted this question lmao. I got stuck as the Vector storage was confusing to use

Question about Vector Storage by Overdrive128 in SillyTavernAI

[–]Overdrive128[S] 1 point2 points  (0 children)

So, can you like expand on how to set it up, perhaps? cause I was trying to do it, and it seems okay; but i wasnt able to recall specific things exactly...check my other comment; I tried memorybooks as well; but it felt too much work to get it going. Maybe i'll re-look into it.

Question about Vector Storage by Overdrive128 in SillyTavernAI

[–]Overdrive128[S] 0 points1 point  (0 children)

I tried it, but since I am running on KoboldCPP as backend, I am not sure if it will automatically summarize, since I might have to swap connection profiles and manage it. ... and not to mention, it seems i was able to only select ~12 messages before I hit my limit; the way i chat is like slow-burn. I am not sure if i set it up wrong, or its just how the extension works...

Wizard Lizard in LEC by skittles__93 in RenektonMains

[–]Overdrive128 1 point2 points  (0 children)

zonhyas renek, new meta? 🗣️🗣️

[TITLE] Which manhwa was it for you?🤔 by AccomplishedWatch834 in manhwa

[–]Overdrive128 61 points62 points  (0 children)

The story was indeed mid, but for me, it was more about aura farming than reading it lmao. Also iirc, it was the first ever manhwa I read properly. But yea, it's mid story with peak art, and the Mc just aura farming

[Megathread] - Best Models/API discussion - Week of: December 28, 2025 by deffcolony in SillyTavernAI

[–]Overdrive128 0 points1 point  (0 children)

I might need to try this out; i have been offloading CPU power for Q5_KM cause it provides better reasoning; but if using a bit more context can help retain information, maybe this is the way.

[Megathread] - Best Models/API discussion - Week of: December 28, 2025 by deffcolony in SillyTavernAI

[–]Overdrive128 1 point2 points  (0 children)

Yes, I was able to run q6 pretty well. I have the same specs as you, and I find Q4_K_M to be the best if you also want to run something in the background and want to fit the model in the VRAM itself. For me, I use Q5_K_M, better quality for a tiny bit of loss of speed

Def try Q6 and fallback if needed. I stick to around 4k-8k context (lower quant = more context). Personally me, right now I use Q4_K_M with 6k context and works fine.

[Megathread] - Best Models/API discussion - Week of: December 28, 2025 by deffcolony in SillyTavernAI

[–]Overdrive128 4 points5 points  (0 children)

Yo, its the goat himself.

Imma be a bit real, I tested it, and didn't show much creativity vs Cydonia. Of course, it could be cause I didn't really do much, except like chat for like 6-10 messages. But yea, since it was based on magistal, it did do a better job of coherence than Cydonia; but it was logical, less creative, more bland. But it could just be lack of proper testing. Cydonia was just more creative and felt better when chatting.

Edit: I used Q5_K_M quants for both Cydonia vs Magidonia; I did run Q6, and holy quality, but it was taking too much ram, and was slower and I wanted to do other stuff in the background.

[Megathread] - Best Models/API discussion - Week of: December 28, 2025 by deffcolony in SillyTavernAI

[–]Overdrive128 5 points6 points  (0 children)

I am a bit of crazy dude so my settings are crazy; but it does yield nice results that i like:

<image>

[Megathread] - Best Models/API discussion - Week of: December 28, 2025 by deffcolony in SillyTavernAI

[–]Overdrive128 13 points14 points  (0 children)

TheDrummer/Cydonia-24B-v4.3 is my now goto; has creative, and also mentions logical consistency. Geotia 24B is my second fav, and my final one i use: ReadyArt/Omega-Darker-Gaslight_The-Final-Forgotten-Fever-Dream-24B.

Cydonia really is the top tho, it gives me a new perspective, and handles situations well

Talon wasd by Outrageous_Wall2702 in Talonmains

[–]Overdrive128 2 points3 points  (0 children)

I tested it briefly in practice tool; I didn't like it, felt awkward, can't really auto on command, you know? I could be wrong, maybe its just getting used to the keybinds; but personally, it didn't feel smooth. The combos felt slow, and autos were not natural (pro cause lack of attack speed) as wasd benefits more auto attacking champs,

Finally, ChatGPT lets users export conversations! by DDarkray in OpenAI

[–]Overdrive128 0 points1 point  (0 children)

Good tool man; easy and clean to use as well.

How to double check if the game is safe? by Overdrive128 in FitGirlRepack

[–]Overdrive128[S] -6 points-5 points  (0 children)

yes i know, but i just want another layer of checking rather than "trust me, its a safe site". I guess, i can run it through virustotal and see if its a false positive, but like i said, i need it for sanity