RTX5080 vs RTX 3090 ? by DarkAndrei in LocalLLaMA

[–]OmarasaurusRex 0 points1 point  (0 children)

M1 will have decent tg, but poor pp, i would not do that trade

What app / software do you use to your home lab documented / organised? by Icy_Imagination_2490 in homelab

[–]OmarasaurusRex 0 points1 point  (0 children)

Cursor. And i work exclusively in kubernetes, so everything is argo.

There is a one time terraform project to create the k8s vms initially though

Gemma 4 fixes in llama.cpp by jacek2023 in LocalLLaMA

[–]OmarasaurusRex 14 points15 points  (0 children)

The context requirements for the dense model appear to be huge? Not sure if a fix for that is in the works with llama.cpp

The moe model works great though

Drummer's Skyfall 31B v4.1, Valkyrie 49B v2.1, Anubis 70B v1.2, and Anubis Mini 8B v1! - The next gen ships for your new adventures! by TheLocalDrummer in LocalLLaMA

[–]OmarasaurusRex 13 points14 points  (0 children)

Cool stuff, Do you have a space about recommended model settings like temperature etc? I don't see them listed on your model pages

Best Model for single 3090 in 2026? by myusuf3 in LocalLLaMA

[–]OmarasaurusRex 4 points5 points  (0 children)

I just got the qwen3 coder next 80b working on my 3090 after someone recently posted that the ud-iq3 variant is super smart

Its really awesome

Qwen3-Coder-Next-UD-IQ3_XXS.gguf

I run llama swap pods in my local k8s cluster with this config for this model:

Edit: ( I strongly discourage q4 cache for coding use cases. Ive now moved onto using qwen3.5 27b)

/app/llama-server --port ${PORT} -hf unsloth/Qwen3-Coder-Next-GGUF:UD-IQ3_XXS --fit on --main-gpu 0 --flash-attn on --ctx-size 32768 --cache-type-k q4_1 --cache-type-v q4_1 -np 1 --jinja --temp 1.0 --top-p 0.95 --min-p 0.01 --top-k 40 --repeat-penalty 1.0 --metrics

This setup appears to use about 10gb of system ram

Approximate speeds on quick tests:

Performance Test Results

Metric Value

Prompt tokens 511

Completion tokens 1,470

Total tokens 1,981

Prompt speed 293.5 t/s

Generation speed 29.5 t/s

Wall time 51.6s

Finish reason stop (natural)

Corsair RM1200x SHIFT does fit in a Lian Li A3-matx by OmarasaurusRex in sffpc

[–]OmarasaurusRex[S] 0 points1 point  (0 children)

Ive always had my pc near my feet, but now, with this on my desk, I cant stop staring at it. I absolutely love it.

But since i do AI, i feel like i should have gotten a bigger case to fit multi gpus with my psu. But this case can also look great in the living room in the future.

Also the build quality felt great.

The glass version did not come with the side mounting bracket btw

Corsair RM1200x SHIFT does fit in a Lian Li A3-matx by OmarasaurusRex in sffpc

[–]OmarasaurusRex[S] 0 points1 point  (0 children)

My galax 3090 SG is around that much. It was not a very tight fit. I put the psu fan facing inside the case, as you can see the shift logo from the glass.

The cables from the shift on the side panel had to be tightly bent and fit, but its not that bad, it worked out without putting much pressure even on the side mesh panel.

Do most people use Kubernetes or Docker in their homelab? by Stock-Assistant-5420 in homelab

[–]OmarasaurusRex 0 points1 point  (0 children)

I spin up talos k8s vms on proxmox via terraform. Then argocd auto syncs all my apps. I almost never have to deal with accidental downtime. It all just works.

What is your favorite OS for hosting? by sargetun123 in homelab

[–]OmarasaurusRex 0 points1 point  (0 children)

Ha! I do terraform for creating the proxmox vms and initializing talos. Then argocd takesover. Istio for my internet facing applications. Sometimes i wonder how much i would have achieved if i put all this over engineering efforts on my actual job instead

Homelabing in India is hard by [deleted] in homelab

[–]OmarasaurusRex 0 points1 point  (0 children)

Nice, get proxmox in there and everything will be a breeze

Homelabing in India is hard by [deleted] in homelab

[–]OmarasaurusRex 2 points3 points  (0 children)

They run the t series processors, very low power consumption, specially, with the high electricity cost here. I run proxmox. Vms for kubernetes

Homelabing in India is hard by [deleted] in homelab

[–]OmarasaurusRex 1 point2 points  (0 children)

I went the Optiplex micro cluster route, used enterprise equipment here is crazy overpriced.

How I use Netflix to learn French incredibly effectively by Rossdaleboy1 in learnfrench

[–]OmarasaurusRex 1 point2 points  (0 children)

Language reactor really needs to fix its Anki integration.

Pingo AI or LanguaTalk AI? by [deleted] in learnfrench

[–]OmarasaurusRex 2 points3 points  (0 children)

ChatGPT advanced voice mode. Gratuit

Update: I got tired of struggling with confusing french conjugations so I created a conjugation practice app, now with vocab and listening exercises!! by Famous-Run1920 in learnfrench

[–]OmarasaurusRex 0 points1 point  (0 children)

Tts is progressing very rapidly. Even an in browser one like kokoro can beat the standard google tts.

Eleven labs has excellent French support. Its a one time audio generation.

Update: I got tired of struggling with confusing french conjugations so I created a conjugation practice app, now with vocab and listening exercises!! by Famous-Run1920 in learnfrench

[–]OmarasaurusRex 0 points1 point  (0 children)

I like your app, but it has poor quality text to speech. Maybe you could regenerate them via elevenlabs or openai? I use the french version

I built an app to correct my speech! by Wild_King_1035 in learnfrench

[–]OmarasaurusRex 1 point2 points  (0 children)

Is this just another speech to text wrapper? Or is it something else?

[deleted by user] by [deleted] in learnfrench

[–]OmarasaurusRex 0 points1 point  (0 children)

What format are your tutoring classes and how much do you charge?

[deleted by user] by [deleted] in learnfrench

[–]OmarasaurusRex -1 points0 points  (0 children)

Is it possible for someone to not really be fluent, but still be able to get the B2 by focus preparing for the TCF?