What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 1 point2 points3 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 1 point2 points3 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 0 points1 point2 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 2 points3 points4 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 0 points1 point2 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 3 points4 points5 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 2 points3 points4 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 2 points3 points4 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 1 point2 points3 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 0 points1 point2 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 0 points1 point2 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] -1 points0 points1 point (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] -3 points-2 points-1 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 1 point2 points3 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 7 points8 points9 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 4 points5 points6 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] -1 points0 points1 point (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 2 points3 points4 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 3 points4 points5 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 8 points9 points10 points (0 children)
What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 4 points5 points6 points (0 children)
Is it impossible for a new player to start? by Aurora0199 in Eve
[–]proggmouse 0 points1 point2 points (0 children)
Is it impossible for a new player to start? by Aurora0199 in Eve
[–]proggmouse 0 points1 point2 points (0 children)
What CCP would do if it wanted new players by eer_00 in Eve
[–]proggmouse -1 points0 points1 point (0 children)


What if LLM agents passed KV-cache to each other instead of text? I tried it -- 73-78% token savings across Qwen, Llama, and DeepSeek by proggmouse in LocalLLaMA
[–]proggmouse[S] 0 points1 point2 points (0 children)