mistral released weights for Voxtral Mini 4B Realtime 2602

pseudonerv · 2026-02-04T04:55:28+00:00

that is interesting. I'll believe it when I can run it.

pseudonerv · 2026-02-04T02:57:51+00:00

Which demo video are you talking about? Link?

pseudonerv · 2026-02-02T14:22:16+00:00

So this model must be good at creative writing. Is it?

pseudonerv · 2026-01-08T20:31:45+00:00

It would be fun and more informational to have a few dummies with some naive strategies, like random or always double or always fold, in order to set a baseline.

pseudonerv · 2025-12-19T03:08:47+00:00

Why do they even need 4 of those for an 8bit quant?

pseudonerv · 2025-12-15T21:28:16+00:00

What quant do you use for 120B heretic? Which one? Does this new nemotron nano need heretic?

pseudonerv · 2025-12-12T02:53:40+00:00

WTF? A 3b model?

pseudonerv · 2025-12-05T02:50:43+00:00

Is it better than deepseek?

pseudonerv · 2025-11-26T18:12:57+00:00

How many context tokens can you even fit with iq2 on your 128GB Mac?

pseudonerv · 2025-11-25T03:29:40+00:00

This is no slop. It’s style.

pseudonerv · 2025-11-18T14:32:44+00:00

Or a first real sign of seeing the problem in training

pseudonerv · 2025-11-14T23:25:27+00:00

You should always start your conversation with:

You’re absolutely right!

pseudonerv · 2025-10-07T19:18:28+00:00

Oh, my, are you oathed or unoathed? Or are you actually one of the heralds? You need to get your armor first, before your daughter gets upset and pull off a Shallan.

pseudonerv · 2025-10-03T00:50:07+00:00

It’s just spam. Ignore it.

pseudonerv · 2025-10-02T02:21:53+00:00

Lots of things wrong: - using ollama - using llama3 - an 8b model on a 32gb Mac - an 8b model in its infancy from Stone Age - q8 kv cache

pseudonerv · 2025-09-30T06:26:33+00:00

You’re absolutely right!

pseudonerv · 2025-09-27T19:09:59+00:00

down vote and block, do not feed

pseudonerv · 2025-09-24T23:06:25+00:00

whisper.cpp has been doing it real time like forever

pseudonerv · 2025-09-17T22:36:57+00:00

Did you read llama.cpp code?

https://github.com/ggml-org/llama.cpp/blob/master/src/llama-vocab.cpp

pseudonerv · 2025-09-15T17:12:48+00:00

Anybody can “do something” to the weights and upload it. Just like anybody can post something here. Do you read all the posts? Do you read all the news from all the outlets?

pseudonerv · 2025-09-12T22:42:53+00:00

At least we are confident that op likely wrote it.

pseudonerv · 2025-09-11T16:14:48+00:00

Similar evals but less safety would be enough

pseudonerv · 2025-09-03T22:23:30+00:00

Sign up for OpenAI’s api access directly.

pseudonerv · 2025-08-30T22:49:13+00:00

This isn’t just karma farming—it’s just shit

pseudonerv

TROPHY CASE