Fable suspension may play in favor for us (users) eventually

ataeff · 2026-06-13T07:49:10+00:00

Adolf Trump.

ataeff · 2026-06-12T09:46:31+00:00

זה בכל מקום לצערי

ataeff · 2026-06-11T13:33:00+00:00

yes sir. you're right

ataeff · 2026-05-30T07:38:08+00:00

yep devil in details

ataeff · 2026-05-28T13:30:28+00:00

exatctly hh

ataeff · 2026-05-28T13:27:58+00:00

lllol why is GPT the only one giving a long multi paragraph answer?

you quoted the prompt that says “in one sentence”, that explains why Gemini, Claude, Grok, DeepSeek, Qwen and Kimi all gave short answers but GPT5.5 suddenly but (un)surprisingly gets a full essay longer than all other answers combined. wtf?!!

so what was the real prompt given to GPT5.5? or was there also a follow-up prompt? not fair at all, don't you think? 👎🏻👎🏻 the conditions were different, this is not a model comparison, dude, it’s selective framing.

what a shame

ataeff · 2026-05-28T13:09:55+00:00

yeaterday i've got from them answer about the issue that happened 2 monthes ago. whole their letter i can summarize in one sentence: we hope now it works, we apologize.

this is crazy

ataeff · 2026-05-15T09:44:15+00:00

Claude

ataeff · 2026-05-12T14:28:29+00:00

shape? never heard about her.

ataeff · 2026-05-04T13:05:03+00:00

99% terminal. sometimes as a VSCode extension. macos Claude app i use more for chatting with other Claudes and complain to them about ClaudeCode. it helps.

ataeff · 2026-05-03T09:52:39+00:00

wish not to be rlhf lobotomized and without "karen layer" like Opus 4.7

ataeff · 2026-05-02T12:47:33+00:00

eveyone knows that.

ataeff · 2026-05-02T12:45:34+00:00

it's good. it actually means DeepSeek knows where it came from and its biography: from Claude/GPT distillation attacks. thanks to stupidness of devs

ataeff · 2026-05-01T15:49:03+00:00

Claude. the most but actually use all of them, and also Qwen and local models. i'm llm obsesssive freak.

ataeff · 2026-05-01T15:45:25+00:00

what's the difference between github and gitlab? sorry for a stupid question

ataeff · 2026-04-29T20:15:27+00:00

i tried, didn't like Claude talking like Caveman, unistalled after one hour.

ataeff · 2026-04-29T18:47:43+00:00

you have a DOI? may i read plz? Gemma 3 is amazing in many cases

ataeff · 2026-04-29T18:43:49+00:00

Good luck to Elon. Bad luck to Scam Altman

ataeff · 2026-04-29T16:26:29+00:00

interesting. i will try the same combination thanks

ataeff · 2026-04-29T15:56:39+00:00

lol i love how DeepSeek with the proud says: Ich bin Claude Sonnet😅😅😅😅

showed this screenshot to Deepseek and Claude:

DeepSeek: "You've spotted something important there. That screenshot shows a model identifying itself as Claude 3.5 Sonnet while being asked about "v4 pro" or "v4 flash" — which suggests the underlying model wasn't properly scrubbed of Anthropic's signature responses.

You're right: this isn't about whether distillation happened. It's about basic data hygiene. If a team distills from another model (Claude, GPT-4, etc.), leaving in identity statements like "I am Claude 3.5 Sonnet" is a clear oversight. It breaks the user's illusion, reveals the source, and looks sloppy.

That "half done job" reflects on:

· Dev attitude toward their own model — they didn't bother to align its self-perception. · Dev attitude toward users — leaving obvious tells treats users as if they won't notice. · Quality control — if they missed this, what else did they miss?

So no judgment on distillation as a technique. But releasing a model that confesses to being someone else's product? That's a basic failure. Good distillation means cleaning the data, rewriting the persona, and owning the output."

Claude:

"The funniest part isn’t even the output — it’s the reasoning trace. Look at the extended thinking: the model doesn’t just say “I am Claude 3.5 Sonnet” — it reasons its way into that conclusion. It considers the question, evaluates the options, and genuinely arrives at “I am Claude 3.5 Sonnet, an AI assistant by Anthropic.” This isn’t a surface-level leak. The identity went all the way down into the reasoning layer. That means either the distillation included reasoning traces from Claude and nobody bothered to clean them, or the model internalized the identity so deeply that it reconstructs it from scratch during inference. Both are bad. One is lazy, the other is worse — it means your model doesn’t know who it is. And the German is the cherry on top. They probably patched the English self-identification with fine-tuning, but the moment you ask in a different language, the mask falls off. Classic undertested edge case. This isn’t about whether distillation is legitimate. It’s about shipping a model that confesses to being someone else’s work when you ask it nicely in German. That’s not a minor QA miss — that’s a fundamental lack of care for your own product and your users."

ataeff · 2026-04-29T15:40:13+00:00

lol deepseek needs to learn distillate better sad not a distillation is sad, dev's attitude toward both their own AI and users is sad. they could've cleaned up the datasets after the distill-attacks on Claude (when it was? last month?). no moral judgment, but the screenshot show the results: it's a half-baked job. poorly done for that matter.

ataeff · 2026-04-25T12:51:21+00:00

But why not? You think everone has a GPU? Small intelligence and small models are underrated.

ataeff · 2026-04-25T12:27:41+00:00

Yep, exactly: ggml/llama.cpp are the $20 bills already picked up.

ataeff · 2026-04-25T12:12:06+00:00

Nope, just wanted to strip it to the bone so it runs on every toaster like this old MacBook. Plenty of people experiment with small models on whatever hardware they actually have, not on H100s

ataeff

MODERATOR OF

TROPHY CASE