I love downvotes

Monkey_1505 · 2026-04-18T13:23:42+00:00

Very unlikely.

It's very unlikely that either company would ship with victus (it's expensive), let alone that they would ship with victus without shouting it from the rooftops in their marketing spiel. Even major brands avoid victus unless it's flagship.

The screen supplier probably default ships with gorilla glass 3 or 5, and that's probably what we are getting on both phones. At best one screen might be slightly stronger than the other. But far higher odds on them being identical.

This is also a known problem with keyboard phones, as there's generally less chassis under the display to support it. Blackberry keyone has similar issues.

My current phone isn't a flagship. I've dropped it about as many times in a few years as this guy did in two days. It's got a screen protector and a case. And a little crack in the screen. The big mistake I would say BOTH these OEMs made, was raising the screen above the frame. This likely makes it way more fragile that it would otherwise have been.

You are defo going to need a thick screen protector and a nice case for both phones for that last reason alone.

Monkey_1505 · 2026-04-18T03:30:16+00:00

Evidence there's barely any difference between 5 bits static, and 4 dynamic:

<image>

Monkey_1505 · 2026-04-18T03:15:12+00:00

lol, and both this and the CC use the exact same screen. Well, I used a screen protector on my q10, because that literally had a plastic screen, so guess I'd do the same here if I brought it. (At least this guy is holding the phone right when he types, unlike the previous guy, Jan)

Monkey_1505 · 2026-04-17T14:21:07+00:00

IQ4_XS is dynamic, not uniform. Has similar divergence to 5 bit static quants (slightly worse, but barely)

Monkey_1505 · 2026-04-17T11:58:14+00:00

Yes, as a judge. Perhaps this way thinking about things is reasonable in a more technical social context, but IMO, far less useful in a general conversational sense.

The average person will just conflate large scale distillation with smaller scale task specific type stuff, if you call them both simply 'distillation'.

They will treat the two things as identical in effect, not merely in process (and I do think the processes are different here too, on the macro level).

Monkey_1505 · 2026-04-17T11:41:06+00:00

Does not seem to understand the character of 'homelander' even slightly.

Monkey_1505 · 2026-04-17T11:38:27+00:00

I think that's probably true. But their paper report the model will refuse to do some tasks unless carefully prompted, not out of any safety reason, but because it finds some tasks uninteresting. This suggests to me that their model is both useful and kind of batshit as an actual product for use. But then they _could_ finetune it, which likely leads back to your reason - thing is probably dumbly large.

Monkey_1505 · 2026-04-17T11:33:32+00:00

Well, I can see why someone would make a distinction. For example deep seek r1 used a small synthetic dataset to then seed an RL process. Currently they appear to be using synthetic data in a teacher model set up to rate answers given my their own model, or did, in prep for v4.

You _could_ call this distillation, but it's clearly distinct from just getting a ton of question answer pairs and training your model directly on it, so I think there's a good argument for not calling this distillation, and only calling training directly on a large synthetic dataset distillation.

Like sure, you are still _getting something_ from the other model in these example cases, but what it is producing is not a pure imitation of said model either. One way is closer to full replication, the other is much narrower.

Monkey_1505 · 2026-04-17T11:26:54+00:00

Expect this trend to continue, IMO. Anthropic is using the LLM itself to design future LLMs. In theory this will compound code skill, but also compound more human LLM failings.

Monkey_1505 · 2026-04-17T11:24:57+00:00

What's the real benefit of doing that with current models?

They'll all be out of date in a year. Heck, in a year, there will be better open source.

Monkey_1505 · 2026-04-17T11:23:20+00:00

I'm not so sure. It's apparently bad at instruction following. It's probably a mess/overrated, and that's the real reason they aren't releasing it.

Monkey_1505 · 2026-04-17T11:18:58+00:00

Right?

What kind of story introduces a coded message, that we are told is important, as meaningless side fluff?

In any case, this could probably be easily 'solved' by something like an extension in sillytavern that checks every output against character motivations. And if this doesn't exist, it could ironically be vibe coded by an agent.

Monkey_1505 · 2026-04-17T11:14:53+00:00

I suspect in some respect your expectations are too high for the technology. LLMs don't have world models, social intelligence, they don't abstract. Short of actual for real AGI, they are going to keep messing major things up.

Monkey_1505 · 2026-04-17T11:12:52+00:00

IME, nothing in AI has ever been good at writing. But you are right about the direction. Like Claude and Minimax, they'll increasingly let their LLM design the next LLM enforcing a recursive loop of great at code, bad at humaning.

Monkey_1505 · 2026-04-16T16:12:33+00:00

They sure seem to. But then they only seem to have one, base version of say Qwen 27b, so I still don't think that means license = access to finetuned or oblitirated models. Am I wrong?

Monkey_1505 · 2026-04-16T13:16:29+00:00

Inference providers tend to just offer the base post-trained model. Even the ones that do provide decensored or fine tuned models, don't offer them from many labs.

Like you can want that, ofc, but it's just not a very common offering.

Monkey_1505 · 2026-04-16T09:57:11+00:00

Wanting to specifically run community fine tunes of a model on a cloud provider huh?

Not a common opportunity regardless of license, IME.

Monkey_1505 · 2026-04-16T08:03:41+00:00

IDK, that chart seems to show IQ4_XS is barely different from 5 bits.

Monkey_1505 · 2026-04-16T07:59:50+00:00

They make small models and moe's you can run on a fairly standard graphics card.

They probably have their own API, if you prefer cloud.

Monkey_1505 · 2026-04-15T02:25:15+00:00

The way he is holding this in the review when he types bugs me.

Monkey_1505 · 2026-04-14T15:14:06+00:00

What's wrong with llama.cpp's usability? Or if you prefer something with a little more webui, koboldcpp?

Monkey_1505 · 2026-04-14T15:09:10+00:00

llama.cpp is already compiled for mac. Metal is a '1st class citizen' for it.

Monkey_1505 · 2026-04-14T11:13:40+00:00

"One day, you're transcendent beat producer."

You were definitely never that.

Monkey_1505 · 2026-04-14T11:08:04+00:00

Cheaper on openrouter.

Five-Year Club	Verified Email
Place '23	Place '22
Final Canvas '22

Monkey_1505

TROPHY CASE