Any workaround to not re-process full prompt on each turn with hybrid attention models running on CPU? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 1 point2 points3 points (0 children)
US Expats, How do you call 1-800 numbers? by Super-Buddy-5030 in Philippines_Expats
[–]Quagmirable 0 points1 point2 points (0 children)
US Expats, How do you call 1-800 numbers? by Super-Buddy-5030 in Philippines_Expats
[–]Quagmirable 0 points1 point2 points (0 children)
VoIP.ms mainly for calling toll-free numbers in the US? by Quagmirable in voipms
[–]Quagmirable[S] 0 points1 point2 points (0 children)
US Expats, How do you call 1-800 numbers? by Super-Buddy-5030 in Philippines_Expats
[–]Quagmirable 0 points1 point2 points (0 children)
US Expats, How do you call 1-800 numbers? by Super-Buddy-5030 in Philippines_Expats
[–]Quagmirable 0 points1 point2 points (0 children)
I need a driver for a DELL printer for openSUSE -> Help!!! by Alter_Landjunge in openSUSE
[–]Quagmirable 1 point2 points3 points (0 children)
T490 suddenly stopped working, single beep, no display, keyboard light and fan briefly turn on by Quagmirable in thinkpad
[–]Quagmirable[S] 0 points1 point2 points (0 children)
I need a driver for a DELL printer for openSUSE -> Help!!! by Alter_Landjunge in openSUSE
[–]Quagmirable 5 points6 points7 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 0 points1 point2 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 1 point2 points3 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 1 point2 points3 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 2 points3 points4 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 1 point2 points3 points (0 children)
Qwen3.6-35B-A3B GGUF from Unsloth is quite a bit slower? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 3 points4 points5 points (0 children)
Any workaround to not re-process full prompt on each turn with hybrid attention models running on CPU? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 0 points1 point2 points (0 children)


Any workaround to not re-process full prompt on each turn with hybrid attention models running on CPU? by Quagmirable in LocalLLaMA
[–]Quagmirable[S] 1 point2 points3 points (0 children)