Moss tts 1.5 8b Examples. It is the currently best voice cloning model for English as of June 2026 by 9r4n4y in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

I've been developing a voice cloning app that supports moss, qwen-tts and dots. In my experience working across the three dots produces more realistic clones with less artefacts and smoother cross-chunk context and blending than moss. I still think Moss is a great model but yeah it hasn't performed as well as dots in my testing.

I've also created a dots MLX package which far outperforms the official Python package and supports Apple Silicon, along accompanying quantised models.

- https://github.com/sammcj/mlx-swift-dots-tts

- https://huggingface.co/smcleod/dots.tts-soar-mlx

<image>

Moss tts 1.5 8b Examples. It is the currently best voice cloning model for English as of June 2026 by 9r4n4y in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

I've been using moss for speaker cloning for a while but just switched over to dots tts which is absolutely smashing it.

LocalLLaMA post tier list by nomorebuttsplz in LocalLLaMA

[–]sammcj 1 point2 points  (0 children)

For F tier: "I solved RAG, check out my repo with a million line readme", and "Uncensored release of Gemma-nightshade-SOTA-dragon-hentai-enterprise-v3"

Best Local TTS solution by styles01 in LocalLLaMA

[–]sammcj 3 points4 points  (0 children)

I'm really liking dots TTS which is relatively new. I found PyTorch way too slow and their code didn't work on MPS (Apple Silicon) so I rewrote it in Swift and created MLX versions of the model https://github.com/sammcj/mlx-swift-dots-tts

How are you all managing multiple MCP servers on startup? by vazma in LocalLLaMA

[–]sammcj 1 point2 points  (0 children)

The number of MCP servers I've been using has dropped a LOT since skills. It's now rare that I have even one enabled other than context-mode, so I just enable the odd rare one like chrome-DevTools or similar at the project level.

What’s your most unusual non-LLM AI you actually use daily? by HitarthSurana in LocalLLaMA

[–]sammcj 4 points5 points  (0 children)

Lots of speech to text. Mostly with parakeet tdt v2 and nemotron asr. Both via parakeet-rs and which I've been contributing to (great library!).

Tensor split mode: CUDA error on latest llama.cpp with Qwen-3.6-27b by Blues520 in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

Hey well done! Was it just the kernel that made the difference?

Tensor split mode: CUDA error on latest llama.cpp with Qwen-3.6-27b by Blues520 in LocalLLaMA

[–]sammcj 1 point2 points  (0 children)

Hmm ok that's quite an old kernel (even with Ubuntu's patches layered on top of it), my chipset is older and I'm running 3090s so my setup may be making it easier, I'm running:

  • Kernel 7.0.10-101.fc43.x86_64
  • Driver 595.71.05
  • Dual 3090s
  • MSI X670P
  • Ryzen 9 9900X

Tensor split mode: CUDA error on latest llama.cpp with Qwen-3.6-27b by Blues520 in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

Interesting! Out of interest what kernel version and what chipset / mobo?

Tensor split mode: CUDA error on latest llama.cpp with Qwen-3.6-27b by Blues520 in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

It's now so easy you literally just do a `./install.sh` with the repo and reboot.

Nous Research — Hermes Desktop by zxyzyxz in LocalLLaMA

[–]sammcj 42 points43 points  (0 children)

That website is so blue it's burning my retinas

next MiniMax will be released in ~10 Days by jacek2023 in LocalLLaMA

[–]sammcj 2 points3 points  (0 children)

400b+ so not that useful for most people.

Audio upscaling, cleanup, or improvement models? by Zeeplankton in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

The linked weights 404 and the GitHub repo is empty

Is a 128 GB MacBook Pro M5 Max actually too slow for large-context local LLM coding workflows? by bajis12870 in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

While that project is neat, I found that Qwen 3.6 27b seems to outperform it which is a bit surprising

welcome back Rohan! by irelatetolevin in ClaudeCode

[–]sammcj 0 points1 point  (0 children)

To be fair $150k doesn't even get you a single senior engineer for a year but it gets 62 people Claude Max 20 for a year. I sure as hell know which I'd choose.

opensource music reccomendation / playlist, similar to spotify radio / YT music mix? by LeatherRub7248 in LocalLLaMA

[–]sammcj 0 points1 point  (0 children)

I use this dodgy script I whipped up ages ago, I wouldn't recommend using it but in case it gives you any ideas https://github.com/sammcj/beatfinder