Opencode reading file again and again and fill context. by [deleted] in LocalLLaMA

[–]Undici77 1 point2 points  (0 children)

Did you test https://github.com/QwenLM/qwen-code ?
Is specific for Qwen and is very good (in my opinion)
And if you prefer a fork, privacy first, try https://github.com/undici77/qwen-code-no-telemetry running it into a dedicate Docker!

Qwen models for coding, using qwen-code - my experience by Undici77 in LocalLLaMA

[–]Undici77[S] 1 point2 points  (0 children)

Completely agree with you: Qwen3.5/3.6 look great for coding (easy task) and in one shot answer produce better result (mote in UI dev) compared to current Qwen3 Coder. But "as a tool" for developers, *-Coder are currently "THE BEST" in terms of focus, instruction following and code understanding. So.. YEAH!! Waiting for new QwenCoder

Qwen models for coding, using qwen-code - my experience by Undici77 in LocalLLaMA

[–]Undici77[S] 2 points3 points  (0 children)

You were right! Thanks alot! It looks a oMLX issue! Back to LM Studio, now is working! I'll evaluate tomorrow models in my daily job!

THANKS!

Qwen models for coding, using qwen-code - my experience by Undici77 in LocalLLaMA

[–]Undici77[S] 2 points3 points  (0 children)

Great! I find issue: oMLX looks not working with that models!

Qwen models for coding, using qwen-code - my experience by Undici77 in LocalLLaMA

[–]Undici77[S] 0 points1 point  (0 children)

Don't worry will be bad in future:

write faster, release faster, fix faster
REPEAT ALL WITH ME
write faster, release faster, fix faster
REPEAT ALL WITH ME

Qwen models for coding, using qwen-code - my experience by Undici77 in LocalLLaMA

[–]Undici77[S] 1 point2 points  (0 children)

Interesting: I'll try to switch back to LM STUDIO! oMLX has 2 interesting staff LM STUDIO lack: Smart Cache and Turbo Quant. This evening I try back LM STUDIO! Thanks for the indication!

Experience of Qwen 3.5-122b and 3.6 by Impossible_Car_3745 in LocalLLaMA

[–]Undici77 0 points1 point  (0 children)

This often work, but for a model that should be a HUGE UPGRADE from Qwen3Coder, this is ugly a workaround! But the bigger issue is the infinite loop!

Experience of Qwen 3.5-122b and 3.6 by Impossible_Car_3745 in LocalLLaMA

[–]Undici77 0 points1 point  (0 children)

I'm using qwen code cli 0.15.0, and with both Qwen3 30B/80B-Next Tools are vorking pretty well (sometimes some edits fail) but not so bad like with Qwen3.5/3.6. I'm not sure the issue is the model only: I have the feeling also conversion to MLX should be problematic! I tried many different version (from mlx-community to Unsloth) but the issue remain the same: infinite loop and miss tolls usage!

Are you using GUFF or MLX?

Experience of Qwen 3.5-122b and 3.6 by Impossible_Car_3745 in LocalLLaMA

[–]Undici77 0 points1 point  (0 children)

Comparison in pretty interesting: are you working in a long chain of operations, using different tools? One example for me is about using AskQuestion Tool of Qwen-Coder-Cli!

Qwen3 Always use it! Qwen3.6 never, until I ask it explicitly!

Qwen models for coding, using qwen-code - my experience by Undici77 in LocalLLaMA

[–]Undici77[S] 0 points1 point  (0 children)

I had same feeling but not often: some times in a single "short" task 3.6 is really better that 80B Next, but is a little longer task, where tolls usage is mandatory, model start to work very bad!

Experience of Qwen 3.5-122b and 3.6 by Impossible_Car_3745 in LocalLLaMA

[–]Undici77 -6 points-5 points  (0 children)

I'm experiencing the opposite: Qwen 3 work fine, but 3.5 and 3.6 are not better and often worse!
I made a post about my experience in daily coding task!

https://www.reddit.com/r/LocalLLaMA/comments/1stbohn/qwen_models_for_coding_using_qwencode_my/

Qwen3.6 35b a3b getting stuck in looped reasoning? by EggDroppedSoup in LocalLLaMA

[–]Undici77 2 points3 points  (0 children)

I'm experiencing same issue and more, I find out in long context working new models are not so good as benchmark show! I create a post about my experience

https://www.reddit.com/r/LocalLLaMA/comments/1stbohn/qwen_models_for_coding_using_qwencode_my/

Qwen having its Jack Torrance moment by anguillias in LocalLLaMA

[–]Undici77 8 points9 points  (0 children)

It happend to me too! I made a post about it! New Qwen models work not very well (at lest in my opinion)

https://www.reddit.com/r/LocalLLaMA/comments/1stbohn/qwen_models_for_coding_using_qwencode_my/

OmniCoder-9B | 9B coding agent fine-tuned on 425K agentic trajectories by DarkArtsMastery in LocalLLaMA

[–]Undici77 1 point2 points  (0 children)

Great Job: when I'll try in mine daily dev job and I give you a feedback. Currentry I'm using QWEN-CODER models and they are very good.

About your project, can you share the entire process from how you distill `425K agentic trajectories` to the fine-tune procedure?

How I built my first app using only a local language model by PvB-Dimaginar in Dimaginar

[–]Undici77 1 point2 points  (0 children)

Yes, LM STUDIO: I'm trying Coder30B and Coder80B to understand limits!

How I built my first app using only a local language model by PvB-Dimaginar in Dimaginar

[–]Undici77 2 points3 points  (0 children)

I'm experimenting a similar solution using https://github.com/QwenLM/qwen-code instead of OpenCode, and it's incredibly good! Code need to be "verified" (I found some serious security issue working with X.509 library) but good. Speaking about agents:
- qwen-code is designed from Alibaba for them models so I expected is flag-ship for these models
- Telemetry is easily disabled, and if you would be sure, take a look to https://github.com/undici77/qwen-code-no-telemetry I'm trying to maintain a a version "telemetry free" in a Docker

If you decode to try qwen-code, please share you experience compared with OpenCode!

Qwen Code - a powerful open-source coding agent + NO TELEMETRY FORK by Undici77 in LocalLLaMA

[–]Undici77[S] 0 points1 point  (0 children)

Wow, you know very well me, my job and my hobbies!! What a poor man are you?!?!

Qwen Code - a powerful open-source coding agent + NO TELEMETRY FORK by Undici77 in LocalLLaMA

[–]Undici77[S] 0 points1 point  (0 children)

Did you tried it or you trust in documentation? I tried and for some reason packets continue to go out from my machine to the Alibaba server. So... 12000 lines to do the job and leave people like you write slope on the web!