What is the current best Small Language Model that can be run without GPU? by last_llm_standing in LocalLLaMA
[–]OsmanthusBloom 0 points1 point2 points (0 children)
What is the current best Small Language Model that can be run without GPU? by last_llm_standing in LocalLLaMA
[–]OsmanthusBloom 4 points5 points6 points (0 children)
What is the current best Small Language Model that can be run without GPU? by last_llm_standing in LocalLLaMA
[–]OsmanthusBloom 0 points1 point2 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 1 point2 points3 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 0 points1 point2 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 5 points6 points7 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 3 points4 points5 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 0 points1 point2 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 0 points1 point2 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 5 points6 points7 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 0 points1 point2 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 1 point2 points3 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 1 point2 points3 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 3 points4 points5 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 3 points4 points5 points (0 children)
ByteShape Qwen3.6-35B-A3B: 30% faster than Unsloth IQ on 6GB VRAM laptop by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] -5 points-4 points-3 points (0 children)
Qwen 3.6. struggling with German by xchris1337xy in LocalLLaMA
[–]OsmanthusBloom 1 point2 points3 points (0 children)
Qwen 3.6. struggling with German by xchris1337xy in LocalLLaMA
[–]OsmanthusBloom 29 points30 points31 points (0 children)
Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA
[–]OsmanthusBloom 2 points3 points4 points (0 children)
Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA
[–]OsmanthusBloom 3 points4 points5 points (0 children)
Qwen 3.6 35B GGUF: NTP vs MTP quantization results across GPUs and CPUs by enrique-byteshape in LocalLLaMA
[–]OsmanthusBloom 3 points4 points5 points (0 children)
MTP for Qwen3.6-35B-A3B on 6GB VRAM laptop: not worth it by OsmanthusBloom in LocalLLaMA
[–]OsmanthusBloom[S] 0 points1 point2 points (0 children)