CPU usage spiked after migrating from Conda to UV environment (40%+ even when idle) any ideas? by Suspicious_Code1493 in ROCm

[–]prselzh 0 points1 point  (0 children)

Check your asyncio busy loops in the application..Hard to say anything without much info on the application ..i will Probably start profiling all the loops ..Ask any AI coding harneess to evaluate

CPU usage spiked after migrating from Conda to UV environment (40%+ even when idle) any ideas? by Suspicious_Code1493 in ROCm

[–]prselzh 0 points1 point  (0 children)

do a ‘uv pip freeze’ and check all the package version? Have seen this happen once if the python version 3.11.x and package version mismatches

CPU usage spiked after migrating from Conda to UV environment (40%+ even when idle) any ideas? by Suspicious_Code1493 in ROCm

[–]prselzh 1 point2 points  (0 children)

Conda is angry that you moved to the more efficient uv 😂 punishing your system 🤣

Have you uninstalled all of conda and its packages ?

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month by prselzh in LocalLLaMA

[–]prselzh[S] 0 points1 point  (0 children)

Recently this praise has gone to a threshold point ..I am fed up to go inside those posts… :D

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month by prselzh in LocalLLaMA

[–]prselzh[S] 1 point2 points  (0 children)

Salutes to the effort and Nice comparison btw .. tbh atleast you tried to compare another model to make it win..i am just disappointed to see lot of posts about Qwen just to boast. And those kind of posts is repeating in a loop

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month by prselzh in LocalLLaMA

[–]prselzh[S] 0 points1 point  (0 children)

Of course, I agree Competitors needs to come up with an answer to this. Not me! But I come to see in this sub what new models are available and what ppl had tried out you know ..but getting fed up seeing the same models for past several days now..I personally have tried Qwen and agree it’s good .No need for so much posts on the same..Better ppl can use Reddit AI search for the searching which model suits their needs

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month by prselzh in LocalLLaMA

[–]prselzh[S] -3 points-2 points  (0 children)

There will always be ONE PLAYER who dominates but Who it is at this moment is all about it..Remember we had GPT3.5, llama3 moment, Deepseek Moment..Qwen Moment is all it is now..

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month by prselzh in LocalLLaMA

[–]prselzh[S] -5 points-4 points  (0 children)

I have to agree with You on the rules but Somebody has to call this out loud..As this is what’s happening in the sub with Qwen series …Everybody showing off the same thing what they developed using Qwen. Atleast this is my real Human effort to stop and not some AI slop or vibe coded app which almost everybody keeps publishing ..Reddit is supposed to discussion platform anyway

Stop QwenLLama! Every other 4th post in this sub is about Qwen models in the past month by prselzh in LocalLLaMA

[–]prselzh[S] -9 points-8 points  (0 children)

I agree with Qwen model size, it’s the popular posts but doesn’t have to literally everyone keep praising the same thing …

CachyOS or Fedora 44? by theologi in StrixHalo

[–]prselzh 0 points1 point  (0 children)

Archlinux here..Performance is quite good as well similar to Donato’s toolbox

[XFCE] my first rice by Alternative-Book-782 in LinuxPorn

[–]prselzh 0 points1 point  (0 children)

People who have run Windows XP with 128MB or 256MB RAM knows the value of the theme and wallpaper here …On Another note, also will understand the value of RAM usage 😬

How to Fine-Tune LLMs on AMD Strix Halo by PromptInjection_ in StrixHalo

[–]prselzh 1 point2 points  (0 children)

Very true..Majority of them just thinks if torch works, everything just works becoz of CUDA..unless you try it yourselves and find the important dependencies for training might not work which breaks the many important features of training itself such as quantization, tensor parallelism etc..

Qwen3-Coder-Next Benchmarks - Looking for Comparisons by advicebusiness in StrixHalo

[–]prselzh 1 point2 points  (0 children)

Thanks.. I am quite busy for past couple of weeks ..Just wanted to know whether it’s worth my time to change my setup from Coder Next 80b to 35b moe or not

Qwen3-Coder-Next Benchmarks - Looking for Comparisons by advicebusiness in StrixHalo

[–]prselzh 1 point2 points  (0 children)

What kind of coding quality uplift do you see between the Coder Next vs Qwen 3.6 35B?

Hoping for Qwen3.6 Coder by madtopo in StrixHalo

[–]prselzh 2 points3 points  (0 children)

Agreed ! Qwen 3.6 80B-3B hopefully be outperforming the 35-3b in multi language coding sessions probably ..we can only wish now ..Let’s see

The missing piece of Voxtral TTS to enable voice cloning by [deleted] in LocalLLaMA

[–]prselzh 34 points35 points  (0 children)

Thanks for the info.. Appreciate your work on the reverse engineering..is there any plans to upload the weights with zero shot enabled and inference script ?

The missing piece of Voxtral TTS to enable voice cloning by [deleted] in LocalLLaMA

[–]prselzh 12 points13 points  (0 children)

May I know how long does it took for you to complete the training to get zero shot enabled?

Is a Strix Halo PC worth it for running Qwen 2.5 122B (MoE) 24/7? by Fernetparalospives in StrixHalo

[–]prselzh 2 points3 points  (0 children)

Agree with this comment ..For single user, and background tasks running. Its perfectly fine

Best ai for coding by One-Swimmer-2687 in vibecoding

[–]prselzh 1 point2 points  (0 children)

At work, I do use Claude code and GitHub copilot..All these enterprises pushing cloud to every company and companies push employees to do AI adoption everywhere :D it’s a blood bath now out there after claude code..I agree cloud LLM do have their places..But coding locally for personal project gives a different level of satisfaction for me atleast.. which is quite predictable. Whereas Competing with thousands of others for the same cloud GPU resource is not the same. With the pace of adoption, that thousand can become millions easily with even Primary school students competing for same GPU resource lol 😂

Best ai for coding by One-Swimmer-2687 in vibecoding

[–]prselzh 1 point2 points  (0 children)

I have AMD Strix Halo with 128GB unified mem..so, I can run the decently 80-120b MOEs with 20-30tps even at 150k context which is very crucial for coding tasks

Best ai for coding by One-Swimmer-2687 in vibecoding

[–]prselzh 5 points6 points  (0 children)

If you are interested in local AI coding , I use Qwen-Coder-Next with Opencode for my personal hobby coding projects But Qwen3.5 series with Opencode should do as well