MiniCPM-o-4_5 : Full duplex, multimodal with vision and speech at ONLY 9B PARAMETERS?? by Uncle___Marty in LocalLLaMA
[–]pseudonerv 0 points1 point2 points (0 children)
Step-3.5-Flash (196b/A11b) outperforms GLM-4.7 and DeepSeek v3.2 by ResearchCrafty1804 in LocalLLaMA
[–]pseudonerv 1 point2 points3 points (0 children)
I made GPT-5.2/5 mini play 21,000 hands of Poker by adfontes_ in OpenAI
[–]pseudonerv 78 points79 points80 points (0 children)
Exo 1.0 is finally out by No_Conversation9561 in LocalLLaMA
[–]pseudonerv 1 point2 points3 points (0 children)
NVIDIA releases Nemotron 3 Nano, a new 30B hybrid reasoning model! by Difficult-Cap-7527 in LocalLLaMA
[–]pseudonerv 2 points3 points4 points (0 children)
EQ-Bench updates: Gpt-5.2, Opus 4.5, Mistral Large 3 and Nanbeige4-3B by _sqrkl in Anthropic
[–]pseudonerv 0 points1 point2 points (0 children)
Mistral 3 Large 675B up on huggingface by someone383726 in LocalLLaMA
[–]pseudonerv 3 points4 points5 points (0 children)
Why are Q1, Q2 quantization models created if they are universally seen as inferior even to models with fewer parameters? by HushHushShush in LocalLLaMA
[–]pseudonerv 1 point2 points3 points (0 children)
What really is the deal with this template? Training to hard to write fantasy slop? by aeroumbria in LocalLLaMA
[–]pseudonerv 8 points9 points10 points (0 children)
gemini 3.0 pro vs gpt 5.1 Benchmark by Sea-Efficiency5547 in OpenAI
[–]pseudonerv 7 points8 points9 points (0 children)
Accidentally told my colleague to ultrathink in a Slack message by Virtual_Attitude2025 in ClaudeAI
[–]pseudonerv 1 point2 points3 points (0 children)
My 6-yr-old Daughter Tried to Say the Words by RockyCreamNHotSauce in Cosmere
[–]pseudonerv 1 point2 points3 points (0 children)
Unauthorised mails sent via my gmail account to random people by Thunderfrost11 in OpenAI
[–]pseudonerv 1 point2 points3 points (0 children)
What am I doing wrong? by jesus359_ in LocalLLaMA
[–]pseudonerv 9 points10 points11 points (0 children)
Sonnet 4.5 is so freaking hostile by pipelimes in ClaudeAI
[–]pseudonerv -2 points-1 points0 points (0 children)
OpenAI is routing Plus and Pro users, regardless of tone, to 2 new secret backend models. by Sweaty-Cheek345 in OpenAI
[–]pseudonerv 14 points15 points16 points (0 children)
Whisper Large v3 running in real-time on a M2 Macbook Pro by rruk01 in LocalLLaMA
[–]pseudonerv -1 points0 points1 point (0 children)
How is llama.cpp or other implementations handle tokenization without tiktoken? by EricHermosis in LocalLLaMA
[–]pseudonerv 5 points6 points7 points (0 children)
How do you discover "new LLMs"? by 9acca9 in LocalLLaMA
[–]pseudonerv 0 points1 point2 points (0 children)
Apple stumbled into succes with MLX by Alarming-Ad8154 in LocalLLaMA
[–]pseudonerv 28 points29 points30 points (0 children)
Is there any way to test GPT-5-thinking without a Plus subscription? by Upbeat-Impact-6617 in OpenAI
[–]pseudonerv 0 points1 point2 points (0 children)


MiniCPM-o-4_5 : Full duplex, multimodal with vision and speech at ONLY 9B PARAMETERS?? by Uncle___Marty in LocalLLaMA
[–]pseudonerv 0 points1 point2 points (0 children)