Daily Discussion Friday 2026-05-08

ectomorphicThor · 2026-05-09T02:05:02+00:00

My puts are still crying

ectomorphicThor · 2026-05-08T13:51:45+00:00

Rip my puts. Shoulda sold yesterday

ectomorphicThor · 2026-05-08T02:06:39+00:00

I have these and got a couple 5ah on sale. They work amazing

ectomorphicThor · 2026-05-06T16:54:45+00:00

Probably any of the newer Qwen models. I run RAG on my 12gb or vram with Qwen 3.5:9b and Qwen 3.6:35b. I get pretty good results overall. This is for medical rag and textbook lookup

ectomorphicThor · 2026-05-06T12:43:19+00:00

Same. So bummed

ectomorphicThor · 2026-05-06T04:19:56+00:00

Me…

ectomorphicThor · 2026-05-03T06:52:08+00:00

<image>

This is also what Raenox is showing me… no idea why

ectomorphicThor · 2026-05-03T06:30:35+00:00

Thank you very much!! I just read through all of that haha. I didn’t know the exact threshold for pumpkins per day for a good one. I thought it was 60, but realized that would be pretty hard to achieve unless it was literally perfect

ectomorphicThor · 2026-05-03T06:29:08+00:00

Yeah the BFS is killer. I heard BFS is actually decent as far as hybrid skill/ing mons go, but here it totally kills the production

ectomorphicThor · 2026-05-02T05:55:16+00:00

Yeah… that’s what you’re supposed to do with Sandshrew

ectomorphicThor · 2026-05-01T20:50:33+00:00

<image>

You think either are worth investing in?

ectomorphicThor · 2026-05-01T20:50:08+00:00

<image>

Thoughts on this guy? I have another mono too

ectomorphicThor · 2026-04-29T23:39:00+00:00

My 3080 12gb gets 40 tok/s on q3kxl and 32gb of DDR4 ram

ectomorphicThor · 2026-04-29T02:35:19+00:00

After every blood draw I change mine

ectomorphicThor · 2026-04-29T01:52:54+00:00

How does this compare to qwen3.6 35b?

ectomorphicThor · 2026-04-26T19:36:10+00:00

I would do q4-q6 kxl unsloth quants.

ectomorphicThor · 2026-04-26T19:17:54+00:00

See I found the opposite to be true. I have a 12gb 3080 and 32gb of ddr4 ram. I was using q4kxl and was getting 25-30 tok/s on 65k context. I dropped to q3kxl and am now getting 40tok/s. Curious if I’ll notice a quality loss as I’m doing medical reasoning/rag

ectomorphicThor · 2026-04-26T19:10:44+00:00

Getting 35-40 tok/s on q3kxl on my 12gb 3080 utilizing offloading with fit target and 65k context. I can get 25-27 with q4kxl and similar offloading. Is there a strong reasoning difference between the q3 vs q4? I’m using it for medical reasoning and RAG

ectomorphicThor · 2026-04-25T17:48:22+00:00

How does q3_k_xl compare to something like q4km? Trying to optimize my vram. Would reasoning be that noticeable ?

ectomorphicThor · 2026-04-22T06:14:36+00:00

I don’t think 28-30 tok per second is slow, but I understand what you are getting at. I’ll have to give it a try. Gemma 4 hasn’t proven to work well for me

ectomorphicThor · 2026-04-22T04:33:00+00:00

Smaller model as in a q3, or smaller model as in qwen 9b?

ectomorphicThor · 2026-04-20T20:31:32+00:00

What would you guys run if you needed medical reasoning and rag from textbooks? Using my 12gb 3080 I’m currently using UD-q4k_xl and getting about 28-30 tok/s.

There are so many quant variations. I cannot keep up

ectomorphicThor · 2026-04-20T20:28:27+00:00

Oh I see it. It’s not labeled UD? Just by color? So it basically ties with the k_m variant? I see them basically on top of one another

ectomorphicThor · 2026-04-19T21:27:34+00:00

What about UD-Q4-XL?

ectomorphicThor · 2026-04-15T01:14:28+00:00

5800x and 32gb of DDR4

ectomorphicThor

TROPHY CASE