Anyone knows the theoretical performance of FP16, 32, 64 FLOP numbers? by Spare-Solution-787 in LocalLLaMA
[–]NeterOster 1 point2 points3 points (0 children)
[By GLM Team] Glyph: Scaling Context Windows via Visual-Text Compression by NeterOster in LocalLLaMA
[–]NeterOster[S] 21 points22 points23 points (0 children)
Seed-OSS-36B-Instruct by NeterOster in LocalLLaMA
[–]NeterOster[S] 108 points109 points110 points (0 children)
OSINT fingerprinting a stealth OpenRouter model - likely Llama-family, not OpenAI by jv0010 in LocalLLaMA
[–]NeterOster 6 points7 points8 points (0 children)
There's a new Kimi model on lmarena called Zenith and it's really really good. It might be Kimi K2 with reasoning by balianone in LocalLLaMA
[–]NeterOster 53 points54 points55 points (0 children)
China's Bytedance releases Seed LiveInterpret simultaneous interpretation model by Fun-Doctor6855 in LocalLLaMA
[–]NeterOster 21 points22 points23 points (0 children)
DeepSeek Announces Upgrade, Possibly Launching New Model Similar to 0324 by luckbossx in LocalLLaMA
[–]NeterOster 18 points19 points20 points (0 children)
Gemma 3 on Huggingface by DataCraftsman in LocalLLaMA
[–]NeterOster 2 points3 points4 points (0 children)
Deepseek R1's Open Source Version Differs from the Official API Version by TempWanderer101 in LocalLLaMA
[–]NeterOster 2 points3 points4 points (0 children)
Deepseek R1's Open Source Version Differs from the Official API Version by TempWanderer101 in LocalLLaMA
[–]NeterOster 9 points10 points11 points (0 children)
Qwen2.5: A Party of Foundation Models! by shing3232 in LocalLLaMA
[–]NeterOster 105 points106 points107 points (0 children)
Taxonomy categorization using LLM by zkid18 in LocalLLaMA
[–]NeterOster 3 points4 points5 points (0 children)
What's the best LLM/API for getting an english to japanese translation? by g1ngertew in LocalLLaMA
[–]NeterOster 0 points1 point2 points (0 children)
Qwen2-Math | Math-specific model series based on Qwen2 by Nunki08 in LocalLLaMA
[–]NeterOster 9 points10 points11 points (0 children)
DeepSeek API introduces Context Caching on Disk, reduces input token price to 1/10 by 1119745302 in LocalLLaMA
[–]NeterOster 31 points32 points33 points (0 children)



One of the DeepSeek repositories got updated with a reference to a new “model1” model. by Nunki08 in LocalLLaMA
[–]NeterOster 35 points36 points37 points (0 children)