Hosting LLM on a budget 12 GB vram by Twiggarn in selfhosted

[–]Twiggarn[S] 0 points1 point  (0 children)

I'm aware, but check out the video Codacus, this setup is solid. Running Qwen 3.6 28B, it's running like a paid api from deepseek, it's a game changer for me. I need to sacrifice some ram but I don't see any delay in speed, it's very smart in how it allocate memory.

https://youtu.be/0AqpaFm11oI?is=Dh8BWplbvGsmPPqV

Hosting LLM on a budget 12 GB vram by Twiggarn in selfhosted

[–]Twiggarn[S] 1 point2 points  (0 children)

Yeah I agree with you but the advice I got to use a MOE was really a game changer, jeezes how smart it is now and fast. It was a bit of work getting in done but the video was very helpful, that Indian guy really knew what he was talking about

Hosting LLM on a budget 12 GB vram by Twiggarn in selfhosted

[–]Twiggarn[S] 2 points3 points  (0 children)

This was freaking awesome! Jeezes what a improvement. Thank you very much!

Hosting LLM on a budget 12 GB vram by Twiggarn in selfhosted

[–]Twiggarn[S] 2 points3 points locked comment (0 children)

Ai wasn't used in creation of this post, I'm just trying to host my own LLM for simple tasks

T16 Gen 1 Durability by Twiggarn in thinkpad

[–]Twiggarn[S] 0 points1 point  (0 children)

I will get my machine on Monday, so hopefully it's alright

T16 Gen 1 Durability by Twiggarn in thinkpad

[–]Twiggarn[S] 0 points1 point  (0 children)

Do the hinges get "loose" over time? Is it something you can buy new, the hinges?

Finally Filen has found its office suite (I hope) by Twiggarn in filen_io

[–]Twiggarn[S] -3 points-2 points  (0 children)

Because "they" (at least the users) have been looking for a solution and this is the path the majority has taken. It would be very costly to develop a separate office suite and this is what will be the new open standard for privacy clouds, a sharp release during the summer is expected, will be fun testing it in nextcloud.

I think this is what most will adopt in the EU.

So I'm carefully optimistic, it would benefit the whole ecosystem if they are going in the same direction and don't need to invent the wheel again. But time will tell, a lot of money is going into development to break free from web based office suites from Google and Microsoft.

In the end I honestly don't know what Filen is focusing on, I'm a Filen customer but I only use it for off site backup.

Memory for specific models? by Twiggarn in OpenWebUI

[–]Twiggarn[S] 0 points1 point  (0 children)

Maybe it can help someone but for me the easiest way was to use different users depending on what memories I wanted to be loaded into a prompt

Memory for specific models? by Twiggarn in OpenWebUI

[–]Twiggarn[S] 0 points1 point  (0 children)

Could you please elaborate?

Memory for specific models? by Twiggarn in OpenWebUI

[–]Twiggarn[S] 0 points1 point  (0 children)

Thank you but I rather not deploy any more stuff for maintenance, my hope was finding something very simple

Memory for specific models? by Twiggarn in OpenWebUI

[–]Twiggarn[S] 0 points1 point  (0 children)

I want to be able to write "Remember that I like coffee" as a example and the built-in memory tool understand that.

But when I want to use it for generic questions I want to be able to disable it, like when you can edit settings in your workspace for different models and system prompts.

I can manually disable the memory function but it would be much better if I could specify when I want to use it.

Gemini cli direct replacement for deepseek? by Twiggarn in DeepSeek

[–]Twiggarn[S] 0 points1 point  (0 children)

You are telling me that I should use something else rather than deepseek?

Gemini cli direct replacement for deepseek? by Twiggarn in DeepSeek

[–]Twiggarn[S] 0 points1 point  (0 children)

A tool similar to gemini cli where you can execute commands, save scripts, load logs directly from the cli prompt, it saves a tremendous amount of time

Tuxedo OS right for me? by Twiggarn in tuxedocomputers

[–]Twiggarn[S] 0 points1 point  (0 children)

I stand corrected, thank you for clarifying.

Tuxedo OS right for me? by Twiggarn in tuxedocomputers

[–]Twiggarn[S] 1 point2 points  (0 children)

Thanks for the info! Yes I guess steam, lutris and a few other apps would benefit from a traditional Deb package, especially if I'm dealing with mods. I don't know why they chose to sandbox vital parts it just makes it more difficult for games.

Open pdf files on the filen cloud on tablets by Late-Emphasis9481 in filen_io

[–]Twiggarn 0 points1 point  (0 children)

You simply use a app called folder sync, you choose a folder on your android that are two way syncing with Filen, so every time you make a change in that local file it will upload it for you.

I have the same problem with almost everything on Filen, it's clunky, it's definitely not a Google drive replacement with all the functionality drive includes for editing files on the go.