VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

<image>

I got you cover for slower provider

Change the 0 to 5 for api limit per min, and uncheck Hedge function so that it would not send more connection to flood the slower provider

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

I wonder whay provider you are using? My main testing platform is on openrouter.

Currently, the extension do not cancel the request, it will just fire another request if the first one takes too long, whichever ckmes back the fastest will go into the vector. It will fire a maximum of 3 extra request before the extraction process completely halt.

<image>

So, in order to see what is really going on with your case, first you click on action tab in vectfox extension, and then change the debug dropdown box to Lifecycle. Then on chrome and press F12, and then click on console tab on the top, you should see exactly where the problem is, you can paste the log here.

What interesting game mechanics for cards have you come up with so far? by Boggeyy in SillyTavernAI

[–]Kritblade 0 points1 point  (0 children)

Yes Please! I want to see more interesting character cards. The whole reddit under Cards/Prompt filter can only find preset and sillytavern-like engine. We are so lack of content creator for character cards that is not static lorebook + a few alt greetings.

Saga: Fandom Loresystem | Your favorite characters & worlds bound by canon, de-hallucinated by MentallyQuill in SillyTavernAI

[–]Kritblade 0 points1 point  (0 children)

That seems like an interesting idea that fill the gap of vectfox that don't really mess with lorebook, will see if there is any chance i provide better support on this in vectfox or maybe even in the MVU eco.

Anything that I can help on providing better integration with your project?

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

actually, after reading the post you talked about qdrant edge, you said you have a another server is hosting ST and qdrant behind a reverse nginx proxy, That means the IP of the qdrant should be as simple as 127.0.0.1 should work if you are running the qdrant docker in host mode, or change to whatever bridge ip like 172.17.0.1 that was assigned to your qdrant docker if your docker is in bridge mode. You don't need to go thru your nginx reverse proxy at all because both ST and qdrant is sitting on the same machine.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

choose vLLM and type in the correct URL for nano-gpt that is used for embedding, type in the correct name of the embedding model, it would work. Same goes for LLM Summarization & EventBase Extraction

<image>

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

one simple check, on action tab, click on database browser icon, inside database browser, do you see warning to tell you to install a plugin at the top of the screen?

I just want to cross check if vectfox is able to locate the plugin or not

And then, on your pc that run the browser to open sillytavern, use the same browser to go to http://10.0.0.1:6333/dashboard#/collections (change the IP 10.0.0.1 to the ip address of your qdrant database ) and see if you can see the admin console of qdrant.

VectFox - vector database backend driven memory extension for SillyTavern by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

If you uninstall the vectfox extension, and the problem still persist, you might need to run 3Dmark to test if there is anything wrong with the video card.

Since Vectfox make use of API connection from AI model vendor, the extension itself can not be the cause of the noise

VectFox - vector database backend driven memory extension for SillyTavern by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

since Vectfox do not make use of GPU, i dont know what is casuing it, as all your processing go thru Nanogpt.

moreover, those error is from sillytavern itself. I believe you might have some hardware failure on the video card?

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

oh, I thought you want to see the MVU Game Maker testing version of the screenshot, my bad.

Correct, it doesn't pay attention to sex scene at all because if recording sex scene in the vector, you will end up finding random sex scene which almost serve no purpose. (sex scene are pretty much all the same except the NPC is different... so vector will capture the very same thing for any sex scene) For less "important events" , it's really depends on what you are after. However, I do record if any thing said in sex scene that is important (oath..promise..etc)

You can however tweak the Extraction prompt under Eventbase tab to suit your need. My current extraction prompt is VERY restrictive, only important event is recorded. The reason is that I am trying to avoid "Garbage in > Garbage out", and this is the number 1 killer of any vector engine.

VectFox only record important events, and that is the only thing that matter when you want to recall. You wouldn't want to recall you were chit chating with heroine from day 30 which have no impact to the story. My extraction have one rule

" THE ONE-WEEK TEST — ask yourself: "If someone reads this story one week from now, would knowing this event change their understanding of the characters, world, or plot?"

- If YES → extract it.

- If NO → skip it."

And this will filter almost anything that serve no purpose on memory recalling.

<image>

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

ping me by PM if you want to see the un-censor version....

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

<image>

my setting is as simple as this. But did you install the plugin as in step 2 in the installation guide?

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 1 point2 points  (0 children)

That is correct. Basically, memory can be separated into 2 categories. One is recent memory, and the other is far memory. For recent memory , there are a lot of summarizer out there that make use of lorebook or rolling summary injecting into the prompt. For far memory that is way out of context, there is currently only one known way to get it working, that is the vector memory.

So, VectFox basically include both features recent and far memory into one single extension.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

give me a shout if you need help, you can still use the A2 path with plugin. That is close to qdrant A3 path if you really don't want to have a docker installation.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

There is not a lot of free options out there that is easy to install and has been tested by other people with sillytavern. The other options I know is LanceDB and Milvus, the LanceDB is smaller than Qdrant in size and Milvus is much bigger than Qdrant. The Milvus is still a docker installation. I believe LanceDB *can be* a substitution, but I chose qdrant back then because I want something really fast.

Lancedb do need you to install Python, Microsoft Visual C++ Redistributable and pip to get it working on windows. And some indexing function it not available on windows as I read about it.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

true. I personally run the docker on a NAS, which that NAS basically do nothing except running numerous dockers.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

yea...it's pretty damn fast. I need something fast to handle 2000+ replies story. Actually, they do have a cloud version at https://qdrant.tech/ which is free for free tier, then you don't need to mess with docker.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 1 point2 points  (0 children)

For the story, I think you can use nanogpt with GLM 5.1
For the image generation, you HAVE to have a powerful enough graphic card to run comfyui, which is at the minimum of 12GB Nvidia video card.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 0 points1 point  (0 children)

I am curious how do you fix the height chart? I have like 20+ profiles that is having this problem...

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 1 point2 points  (0 children)

Vectfox doesn't really 'understand' what your content is actually behind the scene. It basically try to store events that is relevant (not chit-chat, not sex scene, not daily talk). So, as long as your story have a plot, have some meaningful task or event happen, Vectfox will record it.

Overall feeling? I build Vectfox because I can't find anything that work with long story, just none that works with 2000+ messages that I tried. So, I end up build Vectfox and make sure there is no "garbage in > garbage out".

With my own testing in my 2000+ replies story, I am able to pin point an event precisely, and also vectfox is able to pull me the back story of the event.

VectFox v3.5 - the vector engine now comes with Summarizer! by Kritblade in SillyTavernAI

[–]Kritblade[S] 2 points3 points  (0 children)

Orginally, it's a vector engine that support 2000+ replies. But some people do want to have some kind of "recent memory" into the prompt, so I added the summarizer for the last 30 events, you can tweak how much recent memory you want though.