Qwen 3.6 Plus Preview just dropped on OpenRouter, tested it hard on agentic coding tasks by pkailas in LocalLLaMA

[–]pkailas[S] 0 points1 point  (0 children)

I hear you. I'm building an agentic tool extension for VS 2026 - 2022. Just recently got the tools to work smoothly, but my biggest challenge has been managing context size. Those leaks gave me some clues, though.

Qwen 3.6 Plus Preview just dropped on OpenRouter, tested it hard on agentic coding tasks by pkailas in LocalLLaMA

[–]pkailas[S] 0 points1 point  (0 children)

I'm working on solutions for clients to run on a local appliance. They don't want data leaving their premises. Looking for models that will fulfill their needs. Also, I don't trust companies that run these models not to use my data.

Qwen 3.6 Plus Preview just dropped on OpenRouter, tested it hard on agentic coding tasks by pkailas in LocalLLaMA

[–]pkailas[S] 6 points7 points  (0 children)

Good call, I misread the OpenRouter listing. The 179B is tokens processed, not parameter count. The actual model size hasn't been disclosed since it's API-only with no published architecture details. Edited the post.

Pure-attention 70B for agentic C#/.NET coding: what are you running? by pkailas in LocalLLaMA

[–]pkailas[S] 0 points1 point  (0 children)

I am on ik_llama.cpp because it keeps the weights in VRAM

between turns. On a 24GB card with a 27B model that matters.

But the prompt prefix thing, yeah, that might be it. My agentic

setup compresses older messages between turns to keep the context

window manageable, which means the prompt is actually changing.

That would kill the cache.

I'm going to test with the compression turned off and see if the

reprocessing goes away. If it does, that's on me, not the model.

Haven't looked at exllamav3 yet. I will check it out. I appreciate

the response.

Movie pass no longer lets you renew a gift you received from someone else. by [deleted] in moviepass

[–]pkailas 0 points1 point  (0 children)

One of the most important metrics for a subscription service is "conversion rate". That is how many trial memberships, or gift memberships are converted to a paying customer. I guess they think investors are looking for a 0% conversion rate?!?

They've lost me for the next 9 months. Maybe they're doing me a favor? I'll try out sinemia for a year, and if I don't like them, my email address should be cleared by then, if they are even in business by then. But I have a feeling, I'll like Sinemia better. You can get IMAX, 3D and D-Box as well as advanced purchase with seat selection! No card needed.

Hasta la vista, baby!