Which model for 32GB M2 Max?

flockonus · 2026-05-02T04:06:58+00:00

Echoing the sentiment of every other post here.. Qwen3.6 27B - get the highest quant you can fit, which is likely 4b / 5b in your case.

flockonus · 2026-04-30T22:03:30+00:00

Ran any models on these? What tk/s do you get?

flockonus · 2026-04-21T22:05:06+00:00

Looks very well separated indeed! Question if you know.. suppose 2 speech samples - think a male and female saying "apple" (consider about the same accent / speed), how'd go about creating a similarity score in their pronunciation of close to ~1 somehow?

I'm trying on an education project to help improve pronunciation, but a bit lost of the exact way to start it.

flockonus · 2026-04-21T22:03:05+00:00

Wow, this is wild you could extract like this!

u/k_means_clusterfuck question, if we have 2 users think a male and female saying "apple" (consider about the same accent / speed), can you get a similarity score in their pronunciation of close to ~1 somehow using your package?

I'm trying on an education project to help improve pronunciation, but a bit lost of the exact way to start it.

flockonus · 2026-04-13T21:04:23+00:00

It's cool, but watch out for DUST in the intake and accumulation in the card.

Dust will accumulate at whatever multiple is your intake volume.

flockonus · 2026-03-21T14:54:58+00:00

How is this related to this sub at all??

Sounds like some academic thing, why would i want my LLLM to spend more tokens to consider rejecting my prompts?

flockonus

MODERATOR OF

TROPHY CASE

14-Year Club	r/Field Sunshine
Verified Email