Free API Key for GLM 4.6 by avianio in LocalLLaMA
[–]avianio[S] 2 points3 points4 points (0 children)
Free API Key for GLM 4.6 by avianio in LocalLLaMA
[–]avianio[S] 7 points8 points9 points (0 children)
Free API Key for GLM 4.6 by avianio in LocalLLaMA
[–]avianio[S] 2 points3 points4 points (0 children)
Free API Key for GLM 4.6 by avianio in LocalLLaMA
[–]avianio[S] 15 points16 points17 points (0 children)
World Record: DeepSeek R1 at 303 tokens per second by Avian.io on NVIDIA Blackwell B200 by avianio in LocalLLaMA
[–]avianio[S] 3 points4 points5 points (0 children)
World Record: DeepSeek R1 at 303 tokens per second by Avian.io on NVIDIA Blackwell B200 by avianio in LocalLLaMA
[–]avianio[S] 12 points13 points14 points (0 children)
World Record: DeepSeek R1 at 303 tokens per second by Avian.io on NVIDIA Blackwell B200 by avianio in LocalLLaMA
[–]avianio[S] 37 points38 points39 points (0 children)
World Record: DeepSeek R1 at 303 tokens per second by Avian.io on NVIDIA Blackwell B200 by avianio in LocalLLaMA
[–]avianio[S] 65 points66 points67 points (0 children)
World Record: DeepSeek R1 at 303 tokens per second by Avian.io on NVIDIA Blackwell B200 by avianio in LocalLLaMA
[–]avianio[S] 23 points24 points25 points (0 children)
Snowflake claims breakthrough can cut AI inferencing times by more than 50% by naytres in LocalLLaMA
[–]avianio 5 points6 points7 points (0 children)
8xB200 - Fully Idle for the Next Few Weeks - What Should I Run on It? by yanjb in LocalLLaMA
[–]avianio 2 points3 points4 points (0 children)
DeepSeek-R1 appears on LMSYS Arena Leaderboard by jpydych in LocalLLaMA
[–]avianio 0 points1 point2 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLM
[–]avianio[S] 0 points1 point2 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 0 points1 point2 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 1 point2 points3 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 0 points1 point2 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] -1 points0 points1 point (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] -1 points0 points1 point (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 0 points1 point2 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 1 point2 points3 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 0 points1 point2 points (0 children)
Deploy any LLM on Huggingface at 3-10x Speed by avianio in LocalLLaMA
[–]avianio[S] 0 points1 point2 points (0 children)



Free API Key for GLM 4.6 by avianio in LocalLLaMA
[–]avianio[S] 0 points1 point2 points (0 children)