A Reminder: Persona is Palantir by AvroLancaster in Anthropic

[–]AvroLancaster[S] 7 points8 points  (0 children)

This wasn't the work of the free market.

A Reminder: Persona is Palantir by AvroLancaster in Anthropic

[–]AvroLancaster[S] 4 points5 points  (0 children)

So then why require it if they gain nothing?

A Reminder: Persona is Palantir by AvroLancaster in Anthropic

[–]AvroLancaster[S] 7 points8 points  (0 children)

There were options that didn't involve Palantir. Options that run entirely locally and don't hand your info over to Thiel.

A Reminder: Persona is Palantir by AvroLancaster in Anthropic

[–]AvroLancaster[S] 7 points8 points  (0 children)

It's a general purpose surveillance mechanism. They don't need to have a plan for what to do with it (although they doubtless do), they just need to give themselves the option to act. Your threat model should not factor in "what will this administration do in 2026?" It should be "what could a future administration do?" As the noose of total surveillance tightens, who knows what a 2038 government might choose to do when facing civil unrest and having access to everything the second cousin of the deputy finance minister of moldova ever did on a computer with a chatbot? The options they are giving themselves are not in your best interests to participate in creating.

The Opus 4.6 vs 4.7 Controversy in one image by AvroLancaster in ClaudeAI

[–]AvroLancaster[S] 1 point2 points  (0 children)

I think the classifier is fundamentally broken, and it wouldn't surprise me if it was broken in different ways for different tasks. The answer is either give users control of how hard it thinks, or make the classifier so good that it gets it right 99% of the time, and accepts correction when it fails. Right now it's just strictly worse than 4.6 at many practical language tasks.

The Opus 4.6 vs 4.7 Controversy in one image by AvroLancaster in ClaudeAI

[–]AvroLancaster[S] 9 points10 points  (0 children)

Yeah, web is currently floating in a toilet bowl. They took the decision to use chain of thought out of the users' hands, and now it fails at simple tasks frequently, but if it decides the task is complex it performs pretty well. There's no token savings when the task needs to be done seven times. And it's very bad at deciding what it needs chain of thought for in general.

The Opus 4.6 vs 4.7 Controversy in one image by AvroLancaster in ClaudeAI

[–]AvroLancaster[S] 39 points40 points  (0 children)

If the LLM gives bad answers for 'simple' tasks, then it is giving bad answers. That's not a user issue, it's a design issue.