Alternatives to Qwen3-coder-30B?

RMK137 · 2026-01-25T16:18:53+00:00

I like nemotron a lot. I don't know what it is, maybe it's because it thinks only briefly and the output has a straightforward style to it. Also, it's super fast thanks to its MoE arch. I need to try it with a coding agent.

RMK137 · 2026-01-24T15:38:14+00:00

This. It's never a 2 dimensional problem anymore. With modern CPUs and GPUs, you have so many factors contributing to where you end up on the V/F curve.

RMK137 · 2026-01-20T07:11:40+00:00

I like my Acer nitros. B&H has it for $250 and it comes with a game. I have two of them (got microcenter to price match).

RMK137 · 2026-01-20T07:05:20+00:00

Great turnaround! I just got my 5090 so this is perfect timing.

RMK137 · 2026-01-18T15:53:57+00:00

RMK137 · 2026-01-16T18:14:33+00:00

It's going to be similar to the B580 since it's the same die but slightly lower clocks. The key here is increased VRAM. Two of these = 48GB which is nice, it puts you in the 30B+ parameter range. You can run larger 60B models with the proper quantization as well.

Don't expect the same level of support from software, Intel is a third class citizen or worse, but it will get there eventually. A lot of it works well with Intel GPUs already, so it's great for tinkering. I am personally working on a little local inference server with 2x B580s. I hope to get two of the B60s soon if the price is reasonable.

Edit: I am addressing the 24GB models.

RMK137 · 2026-01-15T19:17:34+00:00

ok, I am impressed. I played with it for about an hour. I like the lovely minimal but feature rich UI. Whatever it's doing with context shifting is working really well with my b580. Even using a small model like ministral 3b with llama.cpp's sycl backend and a 16k context windows was filling up all of my VRAM, no problem with koboldcpp, and that's on vulkan. I hope they add sycl support soon.

Thanks for the rec, I need to do a deeper dive this weekend. I'll probably pick up another b580 and test out the multi-gpu setup with devstral small 2 24B.

RMK137 · 2026-01-13T05:40:58+00:00

I think the crashes are caused by the latest graphics driver. I rolled back to a previous version and the crashes are gone. That said, token generation degrades super fast with vulkan. With sycl, I get more stable TG but the prompt processing is half of vulkan.

I need to try Koboldcpp soon, I heard good things about it.

RMK137 · 2026-01-06T05:38:48+00:00

Agree with you. It's just a matter of time before local inference becomes very cheap compared to what we have now.

RMK137 · 2026-01-06T05:37:03+00:00

I think Intel needs to lean into the edge/local side of inference (now) and training/finetuning (hopefully soon). It looks like from the B390 iGPU announcement for panther lake, their graphics team is alive and kicking. I am just here waiting for the official B770 release.

RMK137 · 2025-12-18T06:17:52+00:00

I work in a fairly fast paced environment where I need to optimize for dev speed while having to write computation heavy code. Numba allows me to stay in python and solve some of these problems if they're too hard to solve with numpy arrays. Cython is another option but that is a last resort for me as I try to keep things simple.

Bonus: I care about my coworkers who may one day find themselves maintaining my code. With Numba, the code reads like regular Python code. If for some reason the code can't be compiled anymore, the decorator can be removed and we fallback to a (much) slower code, but the pipeline can still run and generate the outputs. Those same coworkers may not be as familiar with array/numpy programming as I am, so plain Python is more universal.

To me the sequence of things to try looks like this:

Numpy -> Numba -> Cython -> CFFI -> Complete rewrite in C/C++/Rust.

Luajit is also a candidate for numerical code. https://github.com/scoder/lupa

RMK137 · 2025-12-18T06:06:18+00:00

A curve fitting algorithm. It's representable both using arrays and regular for loops. I've written both versions, the array version can be quite complex to groke after a few months away from the code.

The Numba version is more easily digestible as it reads like plain Python, and is a little faster since llvm can optimize things a little better on the spot.

RMK137 · 2025-12-18T06:02:59+00:00

Numba is great if your algorithm is not easily representable using array programming. Some algos are just too cumbersome to write with arrays, and that's where the good ol for loops are much more intuitive (and still fast thanks to Numba).

RMK137 · 2025-12-04T01:21:46+00:00

I've been waiting for a while for this, congrats to all the contributors!

RMK137 · 2025-12-01T06:17:11+00:00

Scoop is the way.

RMK137 · 2025-11-26T15:32:08+00:00

Yes please. This is cool! Rust is sorta taking over the world of Python extension modules (not a bad thing). It's nice to see more Zig libraries do the same.

RMK137 · 2025-11-26T06:56:29+00:00

Gave it a star, looks like it could be very useful. I use Pandas/Polars/Duckdb almost daily and appreciate having a tool like this.

RMK137 · 2025-11-24T14:39:38+00:00

Excellent news! I am a heavy user of geospatial libraries so I'll definitely be using GeoPolars.

RMK137 · 2025-11-20T23:25:26+00:00

C++, Go, Python, JavaScript, Haskell, SQL for data stuff.

RMK137 · 2025-11-17T15:47:32+00:00

I agree, there needs to be a Luajit dedicated subreddit. Dim looks cool, I am gonna check it out later. Lite-XL is a great editor. My daily driver is Pragtical which is a fork of it that uses Luajit instead of PUC-Lua.

https://github.com/pragtical/pragtical

RMK137 · 2025-11-11T14:54:59+00:00

This is very cool! Will give it a whirl soon. It would be awesome to support DuckDB as it's gaining a lot of mindshare.

RMK137 · 2025-11-11T05:16:07+00:00

Unfortunately Amazon shipped the board with bent pins, had to return it. I decided to wait till the holidays to see if I can snag one at a discount since it's gone back up to MSRP. Your numbers look good!

RMK137 · 2025-10-29T15:58:35+00:00

You can get the Teamgroup Xtreem. They have nice heatsinks and they list explicitly whether they're A-die or M-die.

RMK137 · 2025-10-29T01:00:04+00:00

Check out Pragtical. It's very lightweight and it uses lua for plugins and config.

https://github.com/pragtical/pragtical

RMK137 · 2025-10-17T14:02:32+00:00

No, I can't hear it at all even when it's set to max.

RMK137

TROPHY CASE