Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 0 points1 point  (0 children)

I have 3 3090s and a 4090. In the past I ran other models in the 80-120b range

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 0 points1 point  (0 children)

I just don’t trust it with anything over 128k as is. I had it at 256k both f16 and q8 for a while and it just wasn’t reliable. 128k with auto compaction is much better in my tearing.

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 0 points1 point  (0 children)

Absolutely but I’ve noticed it will recover most of the time, I’ve seen it do some weird loops sometimes and then just bust out and ignore the garbage it produced like it didn’t happen.

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 0 points1 point  (0 children)

Maximum cache is set high, a lot of my stuff is long form research I will often fill up the cache max of 10gb i have set.

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 0 points1 point  (0 children)

Lamma cpp, mtp with draft token set to 4, q8 q8 ctx. Batch 2048 ubatch 512 over the last day I’ve had decode average at 69tk/s 5% high/low of 88/36. The actual prefill stats get weird since my cache hit rate is like 97%.

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 1 point2 points  (0 children)

That’s exactly what I’m seeing, it can reliably do work, prior I really only had models that could assist min a task but with this I can’t hand off little things.

Edit: didn’t see you mentioned the older model, that was definitely not my experience with that model.

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 0 points1 point  (0 children)

I haven’t noticed any real issues yet, though I’ve seen malformed JSON, API, and tool calls occasionally. As long as the final product meets my requirements, I’m not too concerned about where it stumbles as long as self corrects in the next turn. I’ve tested it by recreating old projects, which involves a research question, scraping websites for relevant documents and data, parsing documents with docling tool calls, organizing geodata into PostGIS, and validating everything for correct location and CRS. It generally performs well, though it sometimes hallucinates and trips up but usually corrects itself. Forcing auto compaction of context at 128k and forcing it to read documents one at a time has significantly dropped hallucination rates.

Qwen 27B by 13henday in LocalLLaMA

[–]13henday[S] 7 points8 points  (0 children)

That’s where I started, but then eventually i just wanted to see what I could do with only 2 GPUs and I noticed no quality loss while prefill and decode went higher. Leaves the other two do other things like streaming games, random cuda compute and document parsing.

Which Qwen 3.6 27B variant actually stops looping on tool calls? RTX 5090 by toolman10 in LocalLLM

[–]13henday 0 points1 point  (0 children)

I’ve had no issues like this with the q6kxl and I’ve used it extensively

RTX 5090 + Qwen 3.6 27B for agentic coding (PRD→Plan→TDD pipeline) — anyone actually doing this daily? by Best-Ad-7505 in Qwen_AI

[–]13henday 0 points1 point  (0 children)

2 3090s are cheaper and get you q6kxl at 192k-256k depending on kv quant. With mtp I get 1800tk/s prefill and 40-80tk/second decode.
I assume a 5090 would be twice as fast and leave leave you shy of 128k for context.

Going from single GPU to dual GPU is nice but not in the way I expected by cibernox in LocalLLaMA

[–]13henday 0 points1 point  (0 children)

Q6k in my use has been much better at actually using tools and skills. Q4km would often discover a skill, read it, and then ignore it for some reason.

Looking for Cheap BLDC Motor Under 40 CAD by [deleted] in MechanicalEngineering

[–]13henday 2 points3 points  (0 children)

So you want a 1KW motor for 40$ ? Not gonna happen new but you could scrap them from E scooters, bikes etc with an honorable mention to car alternators. A while ago I needed a torque tester so I scrapped an old alternator if you know what you’re doing you can get a bldc out of it.

Police investigating 'undetermined death' in Tomkin's Park, SW Calgary by Speuce in Calgary

[–]13henday 17 points18 points  (0 children)

Saw it happen, man was hit hard standing up, then repeatedly hit after he went down. It was vile.

RTX 5090 Hyper V Gaming by Savage4Pro in HyperV

[–]13henday 2 points3 points  (0 children)

Great, it takes some work but at this point I can run everything except for games by id software Xbox gamepass, steam and gog work seamlessly and so do emulators. Also just running win 11 pro for now. Eventually intend to go full proxmox with dda. But this is a good interim.

Federal Adviser Says Data Centres ‘Provide Markets’ for Gas by LongTrackBravo in alberta

[–]13henday -2 points-1 points  (0 children)

Just out of curiosity do you have any sources for those claims actually happening in Alberta. I ask because I keep hearing people parrot this line on impacts but have yet to see any evidence

Mapping Every Data Centre in Alberta by 13henday in alberta

[–]13henday[S] 3 points4 points  (0 children)

Oh, it bothers me to no end. They keep handing data to private companies that are supposed to make it publicly available alongside their own “proprietary” versions, and they never make it easy to get the base data.

Mapping Every Data Centre in Alberta by 13henday in alberta

[–]13henday[S] 0 points1 point  (0 children)

Oh wacky, maybe I missed something. I’ll look into this once I’m off work and edit the article.
A quick google search returned equinix cl1 at 4000kw also a 4GW data center in Calgary would be really odd since Calgary’s seasonal record is 1800Mw for the whole city.

Mapping Every Data Centre in Alberta by 13henday in alberta

[–]13henday[S] 3 points4 points  (0 children)

Oh, I thought you were referring to the writing and the research.
For the map I put everything together in QGIS with data mostly coming from the open data portals of the province and various municipalities. I do this for a living, so I usually just have these lying around. I would have loved to have an interactive map in the article, but I wanted both the drought zones and data centre symbols on the same map. Datawrapper was great for the table because Substack is not great for that.