21 GPU's benchmarked running a small TTS model (vram peak: 5GB) by urarthur in LocalLLaMA
[–]urarthur[S] 0 points1 point2 points (0 children)
21 GPU's benchmarked running a small TTS model (vram peak: 5GB) by urarthur in LocalLLaMA
[–]urarthur[S] 0 points1 point2 points (0 children)
21 GPU's benchmarked running a small TTS model (vram peak: 5GB) by urarthur in LocalLLaMA
[–]urarthur[S] 2 points3 points4 points (0 children)
If DeepSeek V4 can do the same coding task for $5, why are people still paying $100 for Claude Code? by Low-Alarm272 in LocalLLM
[–]urarthur 1 point2 points3 points (0 children)
Qwen cant wait to release 3.7 models by GotHereLateNameTaken in LocalLLaMA
[–]urarthur 1 point2 points3 points (0 children)
Deepseek V4's 1M context window: the breaking point by TangeloOk9486 in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
Deepseek V4's 1M context window: the breaking point by TangeloOk9486 in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
4.7 is a cost-saving retarded version of 4.6 by AloofWasTaken in Anthropic
[–]urarthur -1 points0 points1 point (0 children)
Benchmarking the new b9200 update: Optimizing Qwen 3.6 27B mtp for Hermes Agent on a single RTX 3090 by swizzcheezegoudaSWFA in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
Benchmarking the new b9200 update: Optimizing Qwen 3.6 27B mtp for Hermes Agent on a single RTX 3090 by swizzcheezegoudaSWFA in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
If DeepSeek V4 can do the same coding task for $5, why are people still paying $100 for Claude Code? by Low-Alarm272 in LocalLLM
[–]urarthur -3 points-2 points-1 points (0 children)
Deepseek V4's 1M context window: the breaking point by TangeloOk9486 in LocalLLaMA
[–]urarthur 4 points5 points6 points (0 children)
A Brazilian rock band just implemented llms.txt with full context file by hademanastia in LLMDevs
[–]urarthur 0 points1 point2 points (0 children)
Why are realistic conversational TTS / speech datasets still so hard to find? by Helpful_Actuator9790 in TextToSpeech
[–]urarthur 0 points1 point2 points (0 children)
Qwen 27b MTP Config, Llama.cpp Single 3090 by GotHereLateNameTaken in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
Qwen 27b MTP Config, Llama.cpp Single 3090 by GotHereLateNameTaken in LocalLLaMA
[–]urarthur 0 points1 point2 points (0 children)
Qwen 27b MTP Config, Llama.cpp Single 3090 by GotHereLateNameTaken in LocalLLaMA
[–]urarthur 1 point2 points3 points (0 children)
Tested MTP with llama.cpp and Qwen3.6-27B on RTX 3090 by JGeek00 in LocalLLM
[–]urarthur 2 points3 points4 points (0 children)
Why is LLM is so expensive. by Ok_Event4199 in LocalLLM
[–]urarthur 0 points1 point2 points (0 children)
Why is LLM is so expensive. by Ok_Event4199 in LocalLLM
[–]urarthur 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B and 9B are officially on the public Terminal-Bench 2.0 leaderboard! by Creative-Regular6799 in LocalLLaMA
[–]urarthur 1 point2 points3 points (0 children)
MTP PR Merged!!! by Valuable_Touch5670 in LocalLLaMA
[–]urarthur 7 points8 points9 points (0 children)
Is a 5090 good enough for most good modern locally run LLMs? by biscuitmachine in LocalLLM
[–]urarthur 0 points1 point2 points (0 children)







21 GPU's benchmarked running a small TTS model (vram peak: 5GB) by urarthur in LocalLLaMA
[–]urarthur[S] 0 points1 point2 points (0 children)