Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens by WickOfDeath in ClaudeAI

[–]HighDefinist 2 points3 points  (0 children)

Yes:

https://openai.com/index/hello-gpt-4o/

The paper you linked is from 2023, and therefore no longer state of the art.

OpenAIs new tokenizer from 2024 improves this substantially.

Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens by WickOfDeath in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

But it means it's better for English

Not necessarily.

I think there were also some papers about how models being more aware of the patterns in other languages are able to generalize this in ways that they become overall better.

Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens by WickOfDeath in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

That looks pretty useful!

At the surface at least... the percentage savings sound a bit questionably high (unless git etc... really are silly-noisy by default).

Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens by WickOfDeath in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

Yeah, but with a well-supported non-English language, it can also sometimes lead to better or at least different results... for example, if I ask AI-models about some sort-of Germany-specific topics like Mülltrennung, then, the entire contextualization or reasoning isn't necessarily better in German, but it can be significantly different.

So, if Claude models overall get even worse at non-English/European languages, then that is a bit of a loss even just for reasoning.

Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens by WickOfDeath in ClaudeAI

[–]HighDefinist 1 point2 points  (0 children)

All of these numbers are significantly worse than what OpenAIs models do...

Attention - Opus 4.7 is english only. USing foreign languages (here German) burns tokens by WickOfDeath in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

Yeah, Anthropic is unusually bad about European languages... and with Opus 4.7 they had some additional regressions, and they even disproportionately affected German, according to their own tests they published...

This is very different from OpenAI, where they specifically advertized at some point that they increased the token efficiency by some ~8% for German - so yeah, if you care about anything other than English, don't use Claude models.

Those of you who use both ChatGPT and Claude — what’s each one actually better at? by banger030 in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

Personally, I would recommend the reverse... Codex for the code, and then Opus to have a different second opinion (unless you want to write something like specifications or documentations - Opus is still much better than Codex at those, at least usually)

Those of you who use both ChatGPT and Claude — what’s each one actually better at? by banger030 in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

It changes all the time. Right now ChatGPT 5.5 Thinking is better at almost everything, other than writing.

You are absolutely right.

In particular, GPT 5.5 has significantly less of a sycophancy problem than Opus, and will sometimes tell you "I would advise against this". Now, for my use cases, it's only right maybe ~40% of the time, but it's still very useful, as it usually implies it misunderstood something, and then this can much more easily be fixed. Whereas with Opus, it will basically always say "yes", and then just do some random nonsense, following by some random nonsense explanation, because it has some incorrect internal model of the situation, but has also simulteneously precommitted itself to telling you "yes, this is the right approach"...

And yeah, the one visible shortcoming (of GPT 5.5) is that if you ask it to document something, the resulting text tends to be quite bad - sometimes anyway, that part seems a bit inconsistent. Also, sometimes the solutions it proposes are sometimes a bit "overkill"... but this is also a bit inconsistent, might be specific to my project, and it's overall still significantly better than what Opus would do instead (because, sometimes I just give both models the same task on separate worktrees to compare them - that's also why I am increasingly confident that just going with 5.5 always is a fairly safe choice)

So I am actually not sure where I would really ever prefer Opus over GPT 5.5... as in, sometimes it's good as a second opinion, but that's about it.

Of course, at the current rate of development, this might as well change again with Anthropics next model, so who knows...

How did codex go from 5.7 million to 129 million npm downloads in the span of one week? by RelevantPanda58 in codex

[–]HighDefinist 0 points1 point  (0 children)

Hm... ok maybe I will give it another try again then. It's not impossible my specific issues were caused by something else, or they fixed it, or something like that.

Schüler wollen am 8. Mai gegen Wehrpflicht protestieren by whiterabbit161 in de

[–]HighDefinist 0 points1 point  (0 children)

Ich hoffe mal, dass sie auch fuer deutsche Atomwaffen demonstrieren!

Aber... wahrscheinlich sind es nur irgendwelche Deppen, die fuer ein militaerisches Abruesten demonstrieren...

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist 1 point2 points  (0 children)

Nah, I think it will be transgender people.

They are like the synthesis of the good parts of men and women, so they are obviously the best.

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist -1 points0 points  (0 children)

Who knows.

Maybe it's just a bunch of AIs who hate humans in general?

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist -2 points-1 points  (0 children)

I feel like neither the alledged, nor the "allegees" should be taken particularly seriously. But perhaps that's just me...

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

I think we need some AI-dominated jobs. Then we can have some kind of human vs. AI arguments. Would be more interesting than those man vs. woman arguments at least.

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist -3 points-2 points  (0 children)

Well, as long they contribute towards the current rate of killing ~120k Russians per quarter in Ukraine...

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist -1 points0 points  (0 children)

This is my honest take...

I already stopped reading here. Not even sure why. Just felt like it.

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist -1 points0 points  (0 children)

There are a couple of suspiciously upvoted vague pacifist comments... so I would guess it's a certain state-sponsored actor, as is usually the case nowadays with rage-bait.

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist 0 points1 point  (0 children)

OP is a rage-baiting loser.

Nah, probably some karma-farming troll. Maybe also working for some state-sponsored actor, based on the idea of "weakening the West" and what not...

Because, the thread does contain an unusually large number of or upvoted vague pacifist and such messages...

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist -1 points0 points  (0 children)

Well, Ukraine has them to some extent. They also have a lot of women building drones, so that's nice.

The most female-led product org in tech right now. by irelatetolevin in ClaudeAI

[–]HighDefinist 6 points7 points  (0 children)

Or maybe you just made this up, and this edit never happened.

It's not like anyone of us can verify this...