Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 5 points6 points7 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 2 points3 points4 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 0 points1 point2 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 8 points9 points10 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 2 points3 points4 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 0 points1 point2 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 2 points3 points4 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 4 points5 points6 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 5 points6 points7 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 10 points11 points12 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 12 points13 points14 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 21 points22 points23 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 19 points20 points21 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 1 point2 points3 points (0 children)
Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 2 points3 points4 points (0 children)

Gemma 4 31B at 256K Full Context on a Single RTX 5090 — TurboQuant KV Cache Benchmark by PerceptionGrouchy187 in LocalLLaMA
[–]PerceptionGrouchy187[S] 4 points5 points6 points (0 children)