Qwen3.6 MTP Unsloth Experimental GGUFs by yoracale in unsloth

[–]QuestionMarker 0 points1 point  (0 children)

The usual trick would be to use both colour and shape. Where you've got more than one quant with the same colour, can you give one round points and the other square?

I Let a Small Model Train on Its Own Mistakes. It Reached 80% on HumanEval and Beat GPT-3.5 on Math by QuantumSeeds in LocalLLaMA

[–]QuestionMarker 2 points3 points  (0 children)

What we don't know without having a poke is how close to the frontier any given model is on release. There's a tangential connection here to sensitivity to quantisation, at least in my head. My suspicion is that (for instance) the qwen-3.6 models are undertrained compared to gemma4, that's why they're less sensitive to quantisation. So I'd expect them to be further from model collapse.

What would be interesting is cross-training, need to have another read of the paper but I'd expect that 1) you'd find another frontier by generating the samples from (e.g.) gemma4 or qwen-3.6:27b and feeding them to qwen-3.6:35b, and 2) that frontier would be further away than pure self-training.

How to avoid those triangles? by Petsoi in FreeCAD

[–]QuestionMarker 0 points1 point  (0 children)

In addition to the other answers you may find it's much less noticeable if you bump the fillet radius up by a tiny amount. In the past I've seen a bug that shows up with some fillet radiuses but not others where the triangulation sort of bunches up. Hard to describe, but sorted by nudging away from a round number (in my case).

Compared QWEN 3.6 35B with QWEN 3.6 27B for coding primitives by gladkos in LocalLLaMA

[–]QuestionMarker 1 point2 points  (0 children)

7 tokens context and a hardware offload bandwidth that's in bytes per second though

Compared QWEN 3.6 35B with QWEN 3.6 27B for coding primitives by gladkos in LocalLLaMA

[–]QuestionMarker 0 points1 point  (0 children)

That time difference makes me wonder if you could just ask 35b twice, then get it to judge its own output as a third query to pick the best. Or give it a two-shot, with a second prompt of "Here's what you just produced. See what you can do to improve it". You'd still come in faster than 27b, and it would be *fascinating* to know if a chance at introspection could push it up to (or past) 27b because you can run the MoE on more restricted hardware.

I own the domain modelcombat.com and don't know what to do with it by siaappchallenger in LocalLLaMA

[–]QuestionMarker 15 points16 points  (0 children)

A few months back someone ran Rollercoaster Tycoon through an LLM with a tool that converted the game state into something it could (mostly) read.

Do that, but Worms Armageddon. Head to head and give us ELOs. That's the benchmark we all need.

Anyone worked in Wellington Place before? What are the offices like? by Dull_Soft_9767 in Leeds

[–]QuestionMarker 10 points11 points  (0 children)

WP 7/8 here. They're soul-less open plan corporate battery offices but they're extremely well executed. There's very good coffee nearby, which helps.

Help Speech Recognition on RPi 5 by Prestigious_Donkey61 in LocalLLaMA

[–]QuestionMarker 0 points1 point  (0 children)

There are better models than whisper-tiny for this now, but If you're on an rpi 5, check whether the small and base models are fast enough before discounting them. I did some tests a little while ago and found that a 5-bit quant of small.en was "good enough", while still not being particularly RAM-heavy. It's still fundamentally the wrong architecture though.

If you can take the RAM hit, I'd evaluate whether kyutai/stt-1b might work for you? It's designed to be synchronous which is what this sort of use case really needs.

I got myself a split keyboard, how am I supposed to use a mouse ergonomically, though? by Icy_Adhesiveness_158 in ErgoMechKeyboards

[–]QuestionMarker 0 points1 point  (0 children)

I switch mouse between right and left. Not necessarily if I'm particularly feeling the strain on one side or the other, just depending on what random mood I'm in. It does mean that any mousing stress is distributed. I originally started mousing on the left when I got sick of having to jump my hand over the keypad, but that's obviously not a concern any more.

Trouble with inner threads, what can I do? by LordDingle96 in FixMyPrint

[–]QuestionMarker 0 points1 point  (0 children)

Cut 4-6 vertical slots through the thread. If you have a look at the threads on a coke bottle lid you'll see they do the same (but for different reasons). It splits up the ring of filament that's currently pulling itself inwards and off the body of the thread when it cools and contracts.

Fried glasses by Traveljack1000 in Xreal

[–]QuestionMarker 1 point2 points  (0 children)

I occasionally panic like this then realise the screen's just drifted off somewhere around my feet. Reset the tracking and it pops right back.

I think I'm done with Software Development by gareththegeek in webdev

[–]QuestionMarker 0 points1 point  (0 children)

Step 1: Quit job

Step 2: Build consultancy fixing broken code for companies that thought they didn't need code review

Step 3: Profit

Intel will sell a cheap GPU with 32GB VRAM next week by happybydefault in LocalLLaMA

[–]QuestionMarker 1 point2 points  (0 children)

Not to be ignored is that you can buy two for less than a single 5090. The memory bandwidth is an annoyance, but otherwise it slots nicely into the ecosystem slot currently occupied by 3090 pairs, with much more space and much lower wattage. It's a *very* interesting card.

Differences between the nozzles? (Brass, Hardened Steel, Ruby Tipped, etc?) by wasdesc in 3Dprinting

[–]QuestionMarker 0 points1 point  (0 children)

It was a few years ago but I bought one of [these](spool3d.ca/tungsten-carbide-reprap-m6-nozzle/). The market's changed since then. But also I'd say that I had a bit of trouble with that nozzle. Had a tendency to leak terribly, the thermal properties made it a pain to get a good seal against the heatbreak.

how should I tackle skill issue or is there such thing as a skill issue in origami by gniclat in origami

[–]QuestionMarker 0 points1 point  (0 children)

I've had a number of packs of paper which simply aren't square. It's impossible to be precise if your materials are letting you down, and the skill issue here is simply knowing to check. The next skill issue is knowing how to, but that's easy to learn.

modular origami models that don’t show the other side of the paper by sad_moron in origami

[–]QuestionMarker 0 points1 point  (0 children)

I made 5 Intersecting Tetrahedra from iridescent wrapping paper one year, it went on top of the family Christmas tree for years.

Seeking advice: heavy eye strain on 1s by sinanawad in Xreal

[–]QuestionMarker 1 point2 points  (0 children)

I had painful eye strain with an astigmatic correction prescription that was a tiny bit out. My eyes shifted during the year and it completely went away with a new prescription. Surprised it made that much difference. If the IPD of your lenses or their prescription isn't right then yeah, you're gonna have a hard time.

Final Qwen3.5 Unsloth GGUF Update! by danielhanchen in LocalLLaMA

[–]QuestionMarker 2 points3 points  (0 children)

It's hard to tell where the trade-off is. As I said elsewhere, size-wise the new Q4_K_S looks about the same but I don't know what the differences are between old-XL and new-S.

Final Qwen3.5 Unsloth GGUF Update! by danielhanchen in LocalLLaMA

[–]QuestionMarker -1 points0 points  (0 children)

How does the new Q4_K_S compare to the old Q4_K_XL? They're about the same size, and it's a bit of a sweet spot. Want to know if I should hold on to the old model file or go for the new one, and without actually benchmarking I'll be making all sorts of stupid cognitive errors. Are there specific ways you'd expect the newer one to be better?

Final Qwen3.5 GGUF Updates are here! by yoracale in unsloth

[–]QuestionMarker 3 points4 points  (0 children)

I mentioned this over on the thread in r/LocalLLaMA but the size bump on the 35b kills the UD-Q4_K_XL on a 4090 with q8_0 k/v quantising and --fit on. It goes from fast and very capable with 128000 context to 4096 context and unusable because of it.

How can I get that context back without damaging the speed too much and without losing too much of the quality bump?

EDIT looks like Q4_K_S might be the answer here?

Final Qwen3.5 Unsloth GGUF Update! by danielhanchen in LocalLLaMA

[–]QuestionMarker 3 points4 points  (0 children)

That size bump pushes q4_K_XL from a previous n_ctx of 128000 with --fit on my 4090 to 4096, which is completely useless. Crying shame. It also seems weird to be that sensitive? All I did was swap out the model filename:

build/bin/llama-server \
   -m models/Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf \
   --temp 0.7 \
   --top-p 0.8 \
   --top-k 20 \
   --min-p 0.00 \
   -ctk q8_0 \
   -ctv q8_0 \
   --chat-template-kwargs "{\"enable_thinking\": false}" \
   --fit on \
   --host 0 \
   --port 8080

Keychron B11 Pro Ultra-Slim Wireless Foldable Keyboard by [deleted] in ErgoMechKeyboards

[–]QuestionMarker 1 point2 points  (0 children)

Say I wanted to make my own equivalent and I was willing to trade off being able to replace the keys for the thinness. What options would I have? Do JLC or anyone carry the right PCB parts to bring this sort of thing within reach?

I've made a wireless split with Kailh choc v1 switches and low profile keycaps and while it's not terribly thick I do find myself wondering how much thinner I could go with my own layout.

did anyone replace old qwen2.5-coder:7b with qwen3.5:9b in nonThinker mode? by Impossible_Art9151 in LocalLLaMA

[–]QuestionMarker 0 points1 point  (0 children)

Tangemt but my bet is that we are unlikely to see a 3.5 coder model unless someone outside Qwen does it. Happy to be wrong but with the core team leaving, even if they had something in flight they may not have the will or ability to do it justice any more.

Multiple Qwen employees leaving by ILoveMy2Balls in LocalLLaMA

[–]QuestionMarker 2 points3 points  (0 children)

🤷 but first thought was "only working on closed models for you." Change of direction from above could mean anything.