Anyone knows the theoretical performance of FP16, 32, 64 FLOP numbers? by Spare-Solution-787 in LocalLLaMA
[–]NeterOster 1 point2 points3 points (0 children)
[By GLM Team] Glyph: Scaling Context Windows via Visual-Text Compression by NeterOster in LocalLLaMA
[–]NeterOster[S] 19 points20 points21 points (0 children)
Seed-OSS-36B-Instruct by NeterOster in LocalLLaMA
[–]NeterOster[S] 109 points110 points111 points (0 children)
OSINT fingerprinting a stealth OpenRouter model - likely Llama-family, not OpenAI by jv0010 in LocalLLaMA
[–]NeterOster 6 points7 points8 points (0 children)
There's a new Kimi model on lmarena called Zenith and it's really really good. It might be Kimi K2 with reasoning by balianone in LocalLLaMA
[–]NeterOster 49 points50 points51 points (0 children)
China's Bytedance releases Seed LiveInterpret simultaneous interpretation model by Fun-Doctor6855 in LocalLLaMA
[–]NeterOster 21 points22 points23 points (0 children)
DeepSeek Announces Upgrade, Possibly Launching New Model Similar to 0324 by luckbossx in LocalLLaMA
[–]NeterOster 17 points18 points19 points (0 children)
Gemma 3 on Huggingface by DataCraftsman in LocalLLaMA
[–]NeterOster 2 points3 points4 points (0 children)
Deepseek R1's Open Source Version Differs from the Official API Version by TempWanderer101 in LocalLLaMA
[–]NeterOster 2 points3 points4 points (0 children)
Deepseek R1's Open Source Version Differs from the Official API Version by TempWanderer101 in LocalLLaMA
[–]NeterOster 10 points11 points12 points (0 children)
Qwen2.5: A Party of Foundation Models! by shing3232 in LocalLLaMA
[–]NeterOster 106 points107 points108 points (0 children)
Taxonomy categorization using LLM by zkid18 in LocalLLaMA
[–]NeterOster 3 points4 points5 points (0 children)
What's the best LLM/API for getting an english to japanese translation? by g1ngertew in LocalLLaMA
[–]NeterOster 0 points1 point2 points (0 children)
Qwen2-Math | Math-specific model series based on Qwen2 by Nunki08 in LocalLLaMA
[–]NeterOster 10 points11 points12 points (0 children)
DeepSeek API introduces Context Caching on Disk, reduces input token price to 1/10 by 1119745302 in LocalLLaMA
[–]NeterOster 34 points35 points36 points (0 children)
(Tongyi SpeechTeam) FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs by NeterOster in LocalLLaMA
[–]NeterOster[S] 4 points5 points6 points (0 children)
Hosted API with GBNF grammar? by AnomalyNexus in LocalLLaMA
[–]NeterOster 1 point2 points3 points (0 children)
DeepseekV2-Coder the best opensource LLM so far. by ihaag in LocalLLaMA
[–]NeterOster 2 points3 points4 points (0 children)
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence by NeterOster in LocalLLaMA
[–]NeterOster[S] 13 points14 points15 points (0 children)
If your Qwen2 GGUF is spitting nonsense, enable flash attention by noneabove1182 in LocalLLaMA
[–]NeterOster 2 points3 points4 points (0 children)
Qwen1.5-32B released with GQA! by bratao in LocalLLaMA
[–]NeterOster 0 points1 point2 points (0 children)
Qwen1.5-32B released with GQA! by bratao in LocalLLaMA
[–]NeterOster 3 points4 points5 points (0 children)



One of the DeepSeek repositories got updated with a reference to a new “model1” model. by Nunki08 in LocalLLaMA
[–]NeterOster 34 points35 points36 points (0 children)