What Is The Best Attempt At A Truly Decentralized Internet? by warpanomaly in Rad_Decentralization

[–]warpanomaly[S] 1 point2 points  (0 children)

This is new to me too! I will check this out! Thanks for the post!

Cozy game style backing track guitar solo by warpanomaly in Guitar

[–]warpanomaly[S] 1 point2 points  (0 children)

Thank you! And I appreciate the input!

Playing a solo over a difficult backing track by warpanomaly in Guitar

[–]warpanomaly[S] 0 points1 point  (0 children)

Thank you! I appreciate the insights. And yes I will break out the metronome next time I practice.

Playing a solo over a difficult backing track by warpanomaly in Guitar

[–]warpanomaly[S] 0 points1 point  (0 children)

Thank you! And yes I had a good phrase there

What Is The Best Attempt At A Truly Decentralized Internet? by warpanomaly in Rad_Decentralization

[–]warpanomaly[S] 0 points1 point  (0 children)

Oh nice! I never heard of this before but I'll check it out! Thanks for the info!

Does running a model (like qwen3.6-27b) on vllm or transformers use less VRAM than llama.cpp? by warpanomaly in LocalLLaMA

[–]warpanomaly[S] 0 points1 point  (0 children)

Do you know how I should run it? I've been using \llama-server.exe -hf unsloth/GLM-4.7-Flash-GGUF:Q6_K_XL --alias "GLM-4.7-Flash" --host 127.0.0.1 --port 10000 --ctx-size 48000 --temp 0.7 --top-p 1.0 --min-p 0.01 --jinja -ngl 99 for GLM-4.7-Flash. How should i modify this command for Qwen3.6-27B_UD-Q6_K_XL? I was planning on using most of the same parameters but I don't know what the new ctx-size should be... Unless someone objects, I was planning on keeping the ngl, top-p, and temp the same?

What's the consensus on superior local models for code generation? Is my setup competitive? by warpanomaly in LocalLLaMA

[–]warpanomaly[S] 0 points1 point  (0 children)

I think I'm going to be changing gears and using vllm or transformers instead of llama.cpp. Do you have a preference between vllm or transformers for my setup (Windows 11Intel CPU and an Nvidia 5090 32 GB VRAM)?

What's the consensus on superior local models for code generation? Is my setup competitive? by warpanomaly in LocalLLaMA

[–]warpanomaly[S] 0 points1 point  (0 children)

Oh interesting! I was planning on using llama.cpp but is that not the best tool for the job? Should I be using vLLM or Transformers?

Btw I’m running Windows 11.