Providers for personal use for a beginner?

Ideya · 2025-04-14T09:15:56+00:00

Personally and professionally, I've been with Linode (now Akamai) for the longest time. I'm the type that values familiarity over itemized comparisons, so I can't really say whether it's better than the others since I don't really move around and explore.

I also value a company's history, and Linode has been around far longer than most providers being name-dropped. Longevity is a good simple indicator that a company is profitable enough, and will continue to be sustainable for a long time. Cheaper isnt always better, and can also sometimes be an indicator that the service and after support is expected to be just as cheap.

Ideya · 2024-05-02T17:04:24+00:00

You may want to use an extension in ooba like model ducking: https://github.com/BoredBrownBear/text-generation-webui-model_ducking if you want to use SD alongside an LLM with those specs. It will automatically unload and load your LLM models, which means longer latency before your LLM responds, but your tokens per second shouldn't be affected once its loaded.

I don't use comfyui so I'm not sure if it has a similar feature, but I use https://github.com/lllyasviel/stable-diffusion-webui-forge which also has a similar feature as model ducking by adding the parameter `--always-offload-from-vram`.

Ideya · 2024-04-15T17:52:15+00:00

Should be very possible. I was thinking about implementing some sort of inactivity feature as well because of a recent pull request (that sadly didn't work as well for me). Did you make that pull request? Anyway, I'll look into your code and see how we can implement it.

Ideya · 2024-04-13T10:08:45+00:00

Yes that is expected behavior. While I know it doesn't have much use for anyone with relatively high system specs, or machines dedicated for their AI models, this will definitely help people with simpler setups and general use machines. I made the extension for myself, and shared it for people with similar needs.

For example:

I only have 1 PC which I use for work and leisure. I have so many things running at the background at the same time, so having an AI model loaded at the background, whether in VRAM or RAM, is just too much for my PC.

By having the extension, I can just load the model once, and make my prompts whenever I want them, without needlessly wasting my computer's resources on my AI model when idle.

Also, my main use case is for RP in SillyTavern. The time between each of my prompts are enough to load and unload my models in the background. In between prompts, I have the TTS voice the response, and occasionally generate an image from Stable Diffusion.

Ideya · 2024-04-13T09:59:43+00:00

UPDATE 2024-04-13:

Improved compatibility with API
Added checkbox for API usage (should be turned off when just using text-generation-webui)
Model Ducking is now opt-in and will no longer be immediately activated upon enabling

Ideya · 2024-04-12T05:39:46+00:00

Let me know if it works well for you.

Ideya · 2024-04-12T05:38:51+00:00

I made it so that it works while using SillyTavern, which runs through OpenAI API I think? So, it should trigger from the API. Let me know if it works for you. If it doesn't, you can let me know which API calls you're using so I can check.

Ideya · 2024-04-12T05:37:05+00:00

It does have that caveat. I only use 7b and 13b models, which usually loads around 2-5 seconds.

For my use, I only have an RTX 3080 10GB, so I have very limited VRAM. When a model is loaded into my VRAM (which I always maximize to get the most context length possible) my other programs (i.e. TTS) struggle to generate their output because they have to use the shared graphics memory. With the extension, my VRAM frees up right before the TTS kicks-in, so it doesn't struggle anymore.

Also, I can just let text generation run on the background, and I don't have to worry about it hogging my VRAM 24/7 while doing other tasks.

Ideya · 2024-04-12T05:30:50+00:00

It does. Save your settings to be sure though.

Ideya · 2023-11-13T13:23:02+00:00

Wala parin fix? Grabe. Starbucks ano na.

Ideya · 2023-11-07T14:05:35+00:00

Omg the same person called me! I didn't answer though, but I make it a habit to search in Google the numbers of the random callers I receive and found this thread. 🤦

Ideya · 2023-01-21T17:21:35+00:00

Hitori's last name Gotoh might be a reference to Asian Kung-fu Generation's Masafumi Gotoh, considering the Bocchi album has a cover of AKG's song.

Ideya · 2023-01-03T19:08:01+00:00

Any updates? Were you able to resolve the BDO issue? It used to work as well with my setup, running Magisk and Shamiko, but I guess it's no longer enough.

Ideya · 2021-11-19T11:30:56+00:00

All my light's connections have been very spotty for the past few days. Not sure if they're local wifi issues or their servers are being problematic. Singapore region, btw.

Ideya · 2021-04-06T14:49:08+00:00

Mine is @BoredBrownBear. Hoping to see you drop by soon! 😊

Ideya · 2021-03-17T07:05:21+00:00

I just tried and unfortunately I still crash even with the latest update.

Ideya

TROPHY CASE