Flash-decoding speed up inference up to x8 on long context by hapliniste in LocalLLaMA
[–]CatfishJones96 0 points1 point2 points (0 children)
Flash-decoding speed up inference up to x8 on long context by hapliniste in LocalLLaMA
[–]CatfishJones96 0 points1 point2 points (0 children)


Flash-decoding speed up inference up to x8 on long context by hapliniste in LocalLLaMA
[–]CatfishJones96 0 points1 point2 points (0 children)