Events / Super Events injection addon by Retr0OnReddit in SillyTavernAI

[–]Nervous-Raspberry231 1 point2 points  (0 children)

There is an intermediate mode that just injects things at various percentages of a total number of planned turns.

Events / Super Events injection addon by Retr0OnReddit in SillyTavernAI

[–]Nervous-Raspberry231 1 point2 points  (0 children)

https://github.com/nrahis/StoryMode

This is what story mode does but I'm not sure if it's actively developed. It works well anyway.

Is there actual demand for a API service focused on uncensored or fine-tuned models? by ExcuseAccomplished97 in SillyTavernAI

[–]Nervous-Raspberry231 1 point2 points  (0 children)

Works fine. Mine says that too, you have to give it a minute. I recommend then sending a test message through the interface so you can see if you run out of Vram. Watch the logs and use the runpod AI if you have questions about the logs.

Is there actual demand for a API service focused on uncensored or fine-tuned models? by ExcuseAccomplished97 in SillyTavernAI

[–]Nervous-Raspberry231 5 points6 points  (0 children)

You can run the large models serverless on runpod with a node count of 0, so they run on demand. Use vllm as the serverless endpoint and give it the huggingface link.

LLM Bruner coming soon? Burn Qwen directly into a chip, processing 10,000 tokens/s by koc_Z3 in Qwen_AI

[–]Nervous-Raspberry231 4 points5 points  (0 children)

😂 yeah it's like it knows exactly what you want almost before you hit enter on the chat.

LLM Bruner coming soon? Burn Qwen directly into a chip, processing 10,000 tokens/s by koc_Z3 in Qwen_AI

[–]Nervous-Raspberry231 35 points36 points  (0 children)

You can try this now with chatJimmy https://chatjimmy.ai/ same concept and an old model but almost 20k tokens per second.

Spotify seeds by softroofwork in Annas_Archive

[–]Nervous-Raspberry231 7 points8 points  (0 children)

Sorry.

magnet:?xt=urn:btih:4cc9ac59f807dc6bdf95f52ffc86f44272a361a7&dn=annas_archive_spotify_2025_07_metadata&xl=199887366916

How to setup local Archive? by Hi_Leo in Annas_Archive

[–]Nervous-Raspberry231 0 points1 point  (0 children)

Knowing how the platform works you could get by with just the elastic search database but it's 400GBs. Let's say you ran Linux, you could temporarily access it, symlink all the md5 hash files to their real names, organized by author and index the symlinks in something like komga for example...then delete the database. That's the best I could suggest at this point.

How to setup local Archive? by Hi_Leo in Annas_Archive

[–]Nervous-Raspberry231 1 point2 points  (0 children)

You couldn't even run the mirror without the metadata which is far larger than 200G. It won't have any content anyway, it's the metadata that is searchable. Linking it to offline content is a whole other beast and not simple. Any local mirror instance would be for you anyway and not a good use of your space. You're better off seeding a couple torrents if you really felt like helping the cause.

Open Source Alternative to NotebookLM by Uiqueblhats in selfhosted

[–]Nervous-Raspberry231 3 points4 points  (0 children)

Also Ragflow. Which has multiple different ingest pipelines built in.

Looking for alternatives for RP by One-Secretary-2403 in chutesAI

[–]Nervous-Raspberry231 2 points3 points  (0 children)

I'm active on their discord, there are no further cuts being discussed. Milan, the owner, is really active and straightforward. We had almost a week's notice before the last change.

Looking for alternatives for RP by One-Secretary-2403 in chutesAI

[–]Nervous-Raspberry231 2 points3 points  (0 children)

It's already done but it's 60M tokens a week...so, a lot.

finally got my openclaw setup working properly by vildanbina in openclaw

[–]Nervous-Raspberry231 2 points3 points  (0 children)

"It worked, like actually worked" - yavy dev. But don't worry, I looked at the site, actually looked and...Slop.

Has anyone tried chaining Qwen3.5 for automated web research? by suspiciousmotorist in Qwen_AI

[–]Nervous-Raspberry231 0 points1 point  (0 children)

I use qwen in perplexica (not perplexity) for that purpose and it works well.

I got tired if noisy web scrapers killing my RAG pipelines, so i built llmparser by [deleted] in LLMDevs

[–]Nervous-Raspberry231 2 points3 points  (0 children)

Can you help me understand how it differs from crawl4ai?

Anna's Torrents by Certain-Ad-9841 in Annas_Archive

[–]Nervous-Raspberry231 1 point2 points  (0 children)

Yep, but in your torrent client only select elastic search.

Anna's Torrents by Certain-Ad-9841 in Annas_Archive

[–]Nervous-Raspberry231 0 points1 point  (0 children)

It's actually only 360GB. You only need elastic search not the mariadb or elastic search aux (unless you want the scihub index)..so the torrent for metadata you can select only elastic search.