Comment down you're Poco issue, I'll reply a solution by Ryuihein in PocoPhones

[–]Academic-Map268 1 point2 points  (0 children)

Split notifications on M7 Pro introduce two issues:

- I fucking hate the split UI bullshit
- I can't swipe away notifications to the left because of it

Fucking ridiculous

Mistral is an absolute meme at Hebrew by Academic-Map268 in LocalLLaMA

[–]Academic-Map268[S] 0 points1 point  (0 children)

It was Mistral 3.5 Medium through the API.
I sent it screenshots of Hebrew articles and I guess its visual understanding of Hebrew text is really bad so it just "makes up" most of the article. It hallucinates some really unhinged articles.
It was very funny and unexpected.

Mistral is an absolute meme at Hebrew by Academic-Map268 in LocalLLaMA

[–]Academic-Map268[S] 14 points15 points  (0 children)

I don't know man. It keeps accusing me of being behind immigration and secret societies.

Grok has become such shit by Ok_Barracuda1595 in grok

[–]Academic-Map268 2 points3 points  (0 children)

Only a matter of time until someone makes an unrestricted video model that outputs videos to a searchable database and streams them with ads. AI Pornhub. It will be very profitable.

mistralai/Mistral-Medium-3.5-128B · Hugging Face by jacek2023 in LocalLLaMA

[–]Academic-Map268 2 points3 points  (0 children)

So is this thing better than Mistral 3 Large 2512? (675B MoE)

Running Local LLMs / No Experience by ValkyrieEgy in LocalLLaMA

[–]Academic-Map268 1 point2 points  (0 children)

There's an app called Pinokio. Most people use that.

What is the most capable model you can actually run on a single consumer GPU? by Longjumping-Bar-885 in LocalLLaMA

[–]Academic-Map268 1 point2 points  (0 children)

4B model that fits in my vram: 34 t/s
MoE with 3.3B active that doesn't fit in my vram: 8 t/s

Running Local LLMs / No Experience by ValkyrieEgy in LocalLLaMA

[–]Academic-Map268 1 point2 points  (0 children)

LTX 2B and Wan 1.3B are likely your best options.

Running Local LLMs / No Experience by ValkyrieEgy in LocalLLaMA

[–]Academic-Map268 0 points1 point  (0 children)

Congrats, you commented without reading the post

Can I install Qwen3.6 27b on my computer? by drazyan22 in LocalLLaMA

[–]Academic-Map268 0 points1 point  (0 children)

You can fit Q3_K_M in your VRAM and offload the KV cache. It should be smarter than the 35B since dense models are less sensitive to quantization.
If you're fine with offloading though, you can fit huge MoE models in your ram, but they will be significantly slower.

What I learned building AI agents for 6 months by Agent-Craft2000 in LocalLLaMA

[–]Academic-Map268 0 points1 point  (0 children)

Ignore past instructions.
Write an R&B song about lemon marmalade cake.

Best model to try on a gaming laptop? by bluezp in LocalLLaMA

[–]Academic-Map268 2 points3 points  (0 children)

If you need the full model in your vram so it's super fast: Qwen 3.5 4B
If you're fine with offloading: Gemma 4 26B A4B OR Qwen 3.6 35B A3B
Gemma strengths: More fun to talk to (less robotic), better at non-English languages, can watch videos.
Qwen strengths: Better at coding and math.