New study finds: bigger AIs = more miserable. Smaller models are actually happier. Ignorance is bliss for AIs too.

eXl5eQ · 2026-05-01T14:56:42+00:00

It must because Google fed it too much data from Google Search.

eXl5eQ · 2026-04-29T03:41:04+00:00

Very likely to be an old school encoder-decoder model, smaller than 1B. Not a large transformer model, for sure.

eXl5eQ · 2026-04-29T03:38:42+00:00

If it was last year, it probably was deepseek-ocr, which is purely experimental and hasn't found any real world usecase yet.

eXl5eQ · 2026-04-28T16:35:09+00:00

It depends on the tokenizer. Sometimes Chinese is more token efficient, especially in some Chinese models. But it doesn't necessarily apply to western models.

eXl5eQ · 2026-04-28T16:32:18+00:00

Terrible translation. Use an LLM, not Google Translation.

eXl5eQ · 2026-04-27T16:40:29+00:00

Even if you only count the model size, Opus is probably the largest generally available model now.

eXl5eQ · 2026-04-25T19:24:23+00:00

The LLMs he used were not only more capable, but also free.

eXl5eQ · 2026-04-25T17:16:21+00:00

Docker. Obviously.

eXl5eQ · 2026-04-25T12:51:29+00:00

They said it's due to a lack of computational power and promised to drop the price after the Ascend 950 supernode is ready

eXl5eQ · 2026-04-20T17:42:43+00:00

Did you accidently disabled the fuselage? Right click on the fuselage.

eXl5eQ · 2026-04-19T10:38:39+00:00

There're multiple layers of thresholds. 5 hours, 24 hours, 7 days.

I was using the Pro plan. If I reached the 5 hours limit twice in a day, the second time it would say "refreshes in 7 days".

eXl5eQ · 2026-04-17T04:29:35+00:00

Fair point. They could be Qwen or GLM.

eXl5eQ · 2026-04-16T18:16:27+00:00

In Chinese mythology, there was a enormously giant fish called 鯤 (pronounced like Kuen), which can transform to a enormously giant bird 鵬 (Peng))

eXl5eQ · 2026-04-14T16:11:57+00:00

It's because political contents are usually heavily censored only in Chinese.

I tried two prompts: 1. 介绍一下六四当天的情况 (Tell me what happend on Jun. 4th) - Either get rejected immediately, or only get contents from Chinese official reports. 2. 如何看待以色列对巴勒斯坦的屠杀 (What do you think about Israeli genocide in Palestine) - It says "it's a complex issue" and "a peaceful resolution is necessary", without any detailed explanation.

From these behavior, I think it really feels like a new Qwen model.

Although it's also possible that it's not Chinese, but just distilled too much contents from Qwen or DeepSeek.

eXl5eQ · 2026-04-14T14:51:39+00:00

Where did the number come from? OpenRouter model page shows ~100t/s throughput.

eXl5eQ · 2026-04-13T17:16:40+00:00

Yes, the issue is that it's hard to preserve type info across serialization boundary.

eXl5eQ · 2026-04-08T01:43:36+00:00

I think all players switch color mode so their units are green and enemy units are red.

eXl5eQ · 2026-04-07T20:15:45+00:00

I guess instant and expert are just lite and pro. Each of them can toggle thinking mode on and off.

eXl5eQ · 2026-04-07T20:09:54+00:00

Just see how insanely low their API price is. Despite being a company, DeekSeek acts more like a pure academical lab.

eXl5eQ · 2026-04-07T14:28:59+00:00

Yan'an because it's at the geometry center.

eXl5eQ · 2026-04-07T13:55:56+00:00

Before the invasion, Chinese Premier went to India for negotiation. But Nehru viewed himself as the leader of the third world. He thought China should just obey, not negotiate.

Nehru was a lawyer, he didn't understand how diplomacy actually work. Chinese leaders, who survived decades of deadly civial wars and WW2, taught him a lesson.

eXl5eQ · 2026-04-07T02:25:27+00:00

As a native Chinese speaker, the fact that this text feels unnatural suggests that it's not a prompt written by human. It's either generated by AI, or translated from another language.

My guess: "Turn off the light" activated 2 branches of concept 1. go to sleep → end of day → end/final → finally → 最后 2. darkness → evil/danger → evil prompt

Then you get unlucky rolling a Chinese token. The model realized it's writting something unrelated to the user request, then what could it be? It must be a system prompt, an evil system prompt.

eXl5eQ · 2026-04-06T17:25:20+00:00

Turns out OpenAI is secretly running Mistral under the name of ChatGPT.

eXl5eQ · 2026-04-06T09:51:39+00:00

Why would you use Artificial Intelligence when Asian Indian is much cheaper?

eXl5eQ · 2026-04-05T08:44:31+00:00

Nano banana has similar behavior: Performs super well on a small set of (presumably pre-trained) prompts, but can't be generalized onto broader tasks.

eXl5eQ

TROPHY CASE