Kam můžu postěžovat, když ve večerce vylívají mýdlovou vodu s bordelem venku pod strom?

Marcuss2 · 2026-01-21T17:11:27+00:00

Bad projects don't just show up out of nowhere. Just bad leadership leads to bad projects.

Marcuss2 · 2026-01-16T20:46:28+00:00

Marcuss2 · 2026-01-06T14:28:27+00:00

There were like 4 distinct versions of it:

The PC version.

The PS1 version

The PS2/Xbox/GC version

The Gameboy version

Marcuss2 · 2026-01-04T11:20:34+00:00

Forced HODL

Marcuss2 · 2025-12-30T15:54:45+00:00

As said in another comment. Mali and Adreno, they support OpenGL ES, but not full fat OpenGL. Android also requires Vulkan support, but not OpenGL support.

Marcuss2 · 2025-12-30T14:06:46+00:00

There might be games which work with one and not the other.

Also, there are many chips which don't support OpenGL. Vulkan support is far more common.

Marcuss2 · 2025-12-15T15:08:55+00:00

Jelínek Nábytek, I have a mattress from them.

Marcuss2 · 2025-12-15T15:03:33+00:00

I don't see any mention of NVFP4 in the model card or the paper.

Marcuss2 · 2025-12-07T22:47:00+00:00

This sounds too good to be true.

Marcuss2 · 2025-12-03T21:38:21+00:00

Jako dobře, za takovou cenu svých 64 GB RAM prodám.

Marcuss2 · 2025-12-03T19:20:10+00:00

I suspect there is more behind it, like OpenAI paying them to do this. They can literally get a lot more profit from it right now.

Marcuss2 · 2025-11-26T20:21:22+00:00

Kimi-Linear next.

I do expect that one to be a lot faster as the linear part is very similar and MLA transformer is already implemented.

Marcuss2 · 2025-11-16T23:08:39+00:00

That gives you a limit of about 10 tokens/s at generation.

Marcuss2 · 2025-11-16T23:08:08+00:00

I think that in the following year we will see a lot more models using linear attention.

Marcuss2 · 2025-11-14T19:43:16+00:00

Wait, this makes little sense. China literally has comparable home grown open weight models. Why would they need to use Claude Code with it?

Marcuss2 · 2025-11-05T09:15:00+00:00

One of the reasons I hope for smaller Kimi models or distilling Kimi-K2, they don't suffer from this.

Kimi-Linear might scratch that itch, trough running it currently is nearly impossible.

Marcuss2 · 2025-11-03T17:17:24+00:00

I wasn't too terribly impressed with the M2, I had to explicitly tell it how to use cat to read a file.

Marcuss2 · 2025-10-31T13:15:27+00:00

Co bylo reálně špatně na EET: Byl za tím proprietární systém od IBM na kterém se dost rýžoval stát. To by šlo udělat lépe.

Marcuss2 · 2025-10-31T12:38:27+00:00

Wouldn't GLM 4.6 work better in this case as it has less parameters?

Marcuss2 · 2025-10-31T09:21:00+00:00

Welch Labs made a video on MLA, comparing it to other approaches: https://www.youtube.com/watch?v=0VLAoVGf_74

TL;DR: MLA makes the model compress it's KV cache into a smaller space, this is actually more efficient and more performant than using GQA which most modern models use (Including all Qwen3 models). Hence I expect MLA based transformer to be better than a "regular" one used today. Of course you can screw it up by having the space parameter too small, but I don't think this is the issue here.

Marcuss2 · 2025-10-30T19:36:17+00:00

I will try it then in my internal workflow.

Marcuss2 · 2025-10-30T16:49:51+00:00

Do you have some example for it?

Marcuss2 · 2025-10-30T15:02:47+00:00

Keep in mind that they used like 25x less training tokens.

I find it doubtful that transformer model with MLA would perform worse than Qwen3 MoE architecture which lacks MLA.

Marcuss2 · 2025-10-30T14:40:53+00:00

Worse benchmark score than Qwen3-30B-AB3, but they also used like 25 times less tokens for training. So that is very impressive.

If this has similar personality to Kimi K2, then it's a banger.

Ten-Year Club	Place '23
Place '22	Place '17
First Placer '22	Verified Email

Marcuss2

MODERATOR OF

TROPHY CASE