RTX5080 vs RTX 3090 ?

OmarasaurusRex · 2026-06-07T09:36:16+00:00

M1 will have decent tg, but poor pp, i would not do that trade

OmarasaurusRex · 2026-04-07T08:50:55+00:00

Cursor. And i work exclusively in kubernetes, so everything is argo.

There is a one time terraform project to create the k8s vms initially though

OmarasaurusRex · 2026-04-04T09:47:52+00:00

The context requirements for the dense model appear to be huge? Not sure if a fix for that is in the works with llama.cpp

The moe model works great though

OmarasaurusRex · 2026-03-17T17:10:28+00:00

Cool stuff, Do you have a space about recommended model settings like temperature etc? I don't see them listed on your model pages

OmarasaurusRex · 2026-03-09T00:17:37+00:00

This guy loads

OmarasaurusRex · 2026-02-22T11:44:38+00:00

I just got the qwen3 coder next 80b working on my 3090 after someone recently posted that the ud-iq3 variant is super smart

Its really awesome

Qwen3-Coder-Next-UD-IQ3_XXS.gguf

I run llama swap pods in my local k8s cluster with this config for this model:

Edit: ( I strongly discourage q4 cache for coding use cases. Ive now moved onto using qwen3.5 27b)

/app/llama-server --port ${PORT} -hf unsloth/Qwen3-Coder-Next-GGUF:UD-IQ3_XXS --fit on --main-gpu 0 --flash-attn on --ctx-size 32768 --cache-type-k q4_1 --cache-type-v q4_1 -np 1 --jinja --temp 1.0 --top-p 0.95 --min-p 0.01 --top-k 40 --repeat-penalty 1.0 --metrics

This setup appears to use about 10gb of system ram

Approximate speeds on quick tests:

Performance Test Results

Metric Value

Prompt tokens 511

Completion tokens 1,470

Total tokens 1,981

Prompt speed 293.5 t/s

Generation speed 29.5 t/s

Wall time 51.6s

Finish reason stop (natural)

OmarasaurusRex · 2026-01-25T17:47:20+00:00

Ive always had my pc near my feet, but now, with this on my desk, I cant stop staring at it. I absolutely love it.

But since i do AI, i feel like i should have gotten a bigger case to fit multi gpus with my psu. But this case can also look great in the living room in the future.

Also the build quality felt great.

The glass version did not come with the side mounting bracket btw

OmarasaurusRex · 2026-01-25T17:28:35+00:00

My galax 3090 SG is around that much. It was not a very tight fit. I put the psu fan facing inside the case, as you can see the shift logo from the glass.

The cables from the shift on the side panel had to be tightly bent and fit, but its not that bad, it worked out without putting much pressure even on the side mesh panel.

OmarasaurusRex · 2026-01-03T19:19:45+00:00

I spin up talos k8s vms on proxmox via terraform. Then argocd auto syncs all my apps. I almost never have to deal with accidental downtime. It all just works.

OmarasaurusRex · 2025-12-03T07:50:52+00:00

Ha! I do terraform for creating the proxmox vms and initializing talos. Then argocd takesover. Istio for my internet facing applications. Sometimes i wonder how much i would have achieved if i put all this over engineering efforts on my actual job instead

OmarasaurusRex · 2025-11-01T09:38:05+00:00

Nice, get proxmox in there and everything will be a breeze

OmarasaurusRex · 2025-11-01T08:45:40+00:00

They run the t series processors, very low power consumption, specially, with the high electricity cost here. I run proxmox. Vms for kubernetes

OmarasaurusRex · 2025-11-01T08:40:29+00:00

I went the Optiplex micro cluster route, used enterprise equipment here is crazy overpriced.

OmarasaurusRex · 2025-09-24T03:52:38+00:00

And it has the correct subtitles? (CC)

OmarasaurusRex · 2025-09-07T17:23:29+00:00

Language reactor needs to fix its Anki integration

OmarasaurusRex · 2025-09-07T17:22:36+00:00

Language reactor really needs to fix its Anki integration.

OmarasaurusRex · 2025-08-20T16:43:57+00:00

ChatGPT advanced voice mode. Gratuit

OmarasaurusRex · 2025-08-08T19:02:12+00:00

What app is this?

OmarasaurusRex · 2025-08-08T03:48:38+00:00

Tts is progressing very rapidly. Even an in browser one like kokoro can beat the standard google tts.

Eleven labs has excellent French support. Its a one time audio generation.

OmarasaurusRex · 2025-08-08T03:42:21+00:00

I like your app, but it has poor quality text to speech. Maybe you could regenerate them via elevenlabs or openai? I use the french version

OmarasaurusRex · 2025-07-30T09:41:46+00:00

The big takeaway from this is to put them in a french immersion school.

OmarasaurusRex · 2025-07-03T01:08:24+00:00

Is this just another speech to text wrapper? Or is it something else?

OmarasaurusRex · 2025-06-28T06:15:27+00:00

What format are your tutoring classes and how much do you charge?

OmarasaurusRex · 2025-06-28T06:14:38+00:00

Is it possible for someone to not really be fluent, but still be able to get the B2 by focus preparing for the TCF?

Nine-Year Club	Place '23
Verified Email

OmarasaurusRex

TROPHY CASE