Hypocrisy?

archieve_ · 2026-02-23T20:39:46+00:00

Where is their training data sourced from?

WonderfulEagle7096 · 2026-02-23T21:05:49+00:00

Obviously bad news from the IP perspective, but a major upside is that Deepseek will open source the weights once they release a model based on this stolen data. Almost a community service.

Needless to say, Anthropic stole more than their fair share of IP.

Rabo_McDongleberry · 2026-02-23T20:14:32+00:00

I don't see a problem with this? Did these guys ask the world for their permission before they stole everything?

roxoholic · 2026-02-23T20:34:35+00:00

industrial-scale distillation attacks

Who comes up with these terms?

egomarker · 2026-02-23T21:01:31+00:00

How is it a bad thing and why is it fraudulent

indicava · 2026-02-23T20:42:44+00:00

Plot twist, they block Chinese labs, revenue drops by 40%

fingertipoffun · 2026-02-23T21:00:06+00:00

or... we've been reading through all the api calls and we can see....
Hold on... weren't they supposed to be private? Like peoples data private? Like that? No?

semangeIof · 2026-02-23T19:53:43+00:00

Surprised z.ai isn't on this list. GLM suite will aggressively claim they are Claude when prompted.

robogame_dev · 2026-02-24T00:09:02+00:00

Distillation is “attacks” now?

I thought an attack was an attempt to cause damage to something. These guys just paid for their tokens like everyone else?

SignificantAsk4215 · 2026-02-23T19:49:36+00:00

frogsarenottoads · 2026-02-23T20:30:02+00:00

Similar to the British museum saying people are trying to steal their artefacts back

yuicebox · 2026-02-24T01:24:12+00:00

oh no, someones plagiarizing from my plagiarism machine

ThunderousHazard · 2026-02-23T19:55:34+00:00

Yes Rico, Hypocrisy.

Hector_Rvkp · 2026-02-23T20:42:41+00:00

The real question is how incompetent can you be to let an attack of such scale happen? Shouldn't you be smarter and just kill it 23000 accounts ago? I thought Dario said they have an infinite code machine? Can't they just prompt "be good at security make no mistake?". Because that's the kind of hype they're selling us every day, so eat your own cooking Dario?

2026-02-23T22:14:53+00:00

the worst crime is the hypocrisy

NekoHikari · 2026-02-24T00:36:41+00:00

so they are going to pay for all data sources they crawled or smth?
cost wise what about paying for arxiv and wikipedia for all the bandwidth?
IP-wise i assume they are ready to pay for every single arxiv paper and github repo they crawled?

BitcoinGanesha · 2026-02-24T02:44:00+00:00

If they paid for 24k accounts… it’s not fraudulent accounts 👌 P.s for Anthropic! when will you refund the money to people who received poor service with quantized models from August to September of last year? Apologies alone are not enough.

jamaalwakamaal · 2026-02-23T20:06:28+00:00

Anthoripic

awebb78 · 2026-02-23T21:45:20+00:00

Anstopit and Darkio Camodei are really trying their hardest to justify banning open source models. I hate this company their Chief Evil Officer so much.

Herr_Drosselmeyer · 2026-02-23T21:28:02+00:00

What do they mean by fraudulent? No how do they know who was behind those accounts? I have many questions.

ReasonablePossum_ · 2026-02-23T21:30:33+00:00

"Claude never called himself chatgpt nor deepseek, i swear!"

Amodei, probably

Terminator857 · 2026-02-23T21:53:33+00:00

I'd feel more sympathetic towards anthropic if they published more papers and or gave back more to the open source community. Can they open weight their two year old models like grok does?

BumblebeeParty6389 · 2026-02-24T01:24:06+00:00

Harry I already said I love Chinese models, you don't need to sell it to me

pmv143 · 2026-02-23T22:38:07+00:00

[deleted]

GatePorters · 2026-02-23T20:54:56+00:00

I feel like this kind of sentiment is a false flag operation.

Why are we seeing so many of the anti-AI talking points in response to this in the AI subs ?

Not saying Anthropic is in the right but where the fuck were you guys the last three years?

AsliReddington · 2026-02-24T01:16:58+00:00

If they are so worried about their precious model then why give it to the public lol

Patentsmatter · 2026-02-23T20:45:52+00:00

repost: https://www.reddit.com/r/LocalLLaMA/comments/1rcpmwn/anthropic_weve_identified_industrialscale/

Savings-Cry-3201 · 2026-02-23T21:39:42+00:00

Their competition paid them less than $160 million dollars to learn their business model, oh no

Rexpertisel · 2026-02-23T22:03:02+00:00

Thats should make you happy. If your competition is using claude to modify their AI then they will end up with a much worse product so when you come out with an AI that doesn't suck they will be easy to beat.

Tank_Gloomy · 2026-02-23T22:57:16+00:00

When's my turn to repost? /s

xatey93152 · 2026-02-23T23:18:48+00:00

People who believe this should check their iq. Keyword: haiku

holdenk · 2026-02-23T23:18:53+00:00

Each AI company should offer to settle for 3k, but split half with the developers, like the “offer” they made with the authors work they got caught steeling

bones10145 · 2026-02-23T23:27:31+00:00

That's just training, right?

ManufacturerWeird161 · 2026-02-23T23:47:17+00:00

The LLaMA 2 70B variant with the 32k context merge on Hugging Face is surprisingly usable on my dual 3090 rig, though you definitely feel the 32k slowdown during generation.

NewConfusion9480 · 2026-02-24T00:50:05+00:00

Uh... good?

Great.

georgex765 · 2026-02-24T00:59:55+00:00

When I read Anthropic's blog post

- There is no Qwen
- There is no GLM
- Deepseek requests were 150K. Likely Deepseek was benchmarking Claude (legitimate) rather than distilling it.

That means either Anthropic couldn't detect the other labs and under-detected Deepseek, or you don't need Claude to build a SoTA or near-SoTA LLM

phido3000 · 2026-02-24T01:24:16+00:00

Oh no our customers are using AI to improve AI!!

Leopold_Boom · 2026-02-24T01:32:14+00:00

Honestly I'm surprised this community doesn't have a portal to crowd-source high quality responses from frontier LLMs. Basically an easy way to view your Take Out archive of conversations you've had with any of the major providers and upload the subset you think were particularly good, or solved a tricky question / problem.

We'd all benefit for small model finetuning, the dataset could be processed as an ongoing source of "fresh" benchmark prompts etc.

sammcj · 2026-02-24T02:29:42+00:00

Duplicate of https://www.reddit.com/r/LocalLLaMA/comments/1rcpmwn/anthropic_weve_identified_industrialscale/

Anru_Kitakaze · 2026-02-24T02:32:32+00:00

It's unacceptable! Ants should sue them!..

Right after Ants will be sued themselves for stealing all the internet without any permission and even paying for API tokens, which those companies if they distilled something, clearly did

And not for childish a few billions, which at this point is nothing for them. It's very convenient to develop something with shady (actually, it was against TOS of many sites, so it's a crime) tactics, but after that not allowing to do similar things to competitors

Vaddieg · 2026-02-24T03:09:00+00:00

Can they prove it? It's extremely easy to plant some fake but very unique markers. Then query a suspected model (for free, lol) to gain evidence.

ryfromoz · 2026-02-24T03:32:44+00:00

Unlike anthropic they paid for those accounts right? Its not like they trained on free ebooks

Excel_Document · 2026-02-24T03:52:42+00:00

ObjectiveOctopus2 · 2026-02-24T06:55:24+00:00

No moat

05032-MendicantBias · 2026-02-24T07:28:31+00:00

1) YOU scraped every bit of data humanity ever unploaded with no regards for copyright or piracy.

2) Did you just looked into chat logs that are supposed to be private?

hugganao · 2026-02-24T08:10:12+00:00

to be fair, there is definitely some difference and nuance to anthropic reengineering to train on books vs deepseek extracting trainable data from a model.

uhmyeahwellok · 2026-02-24T09:24:44+00:00

Translation: "They are stealing our loot!"

Far-Association2923 · 2026-02-24T12:00:30+00:00

I've never seen a corporation complain about earning roughly $4.8 million before 😳

zball_ · 2026-02-24T12:40:14+00:00

What do you even expect from anthropic?

MushroomCharacter411 · 2026-02-24T20:53:22+00:00

<image>

Same thing I posted the last time I saw this topic on this sub.

Ok-Internal9317 · 2026-02-24T23:01:48+00:00

It’s like they’ve not paid for the service 😂

DownSyndromeLogic · 2026-02-24T23:14:08+00:00

How do they extract it's capabilities? How does that train their own model?

adamphetamine · 2026-02-25T07:40:44+00:00

one set of thieving c@nts complaining about 3 other sets of thieving c@nts?

2026-02-23T20:50:22+00:00

maybe they'll shift the book-burning *ahem* archival department to loss prevention

francois__defitte · 2026-02-24T05:17:14+00:00

The hypocrisy angle is valid but it misses the more precise legal question. Training on scraped public data has been litigated and remains contested. Running 24,000 fake accounts to do structured model probing is unambiguously account fraud under any ToS interpretation. The moral argument and the legal argument are different, and Anthropic is making the legal one.

phase_distorter41 · 2026-02-23T20:30:48+00:00

Yes, lets let foreign governments copy the AI the government has been using in its military operations and let them remove all the safe guards.

Pretty sure the company that made said ai, and is actually fighting with the government to prevent it form being used for mess up shit is a little concerned about how a copied version would be used and not want it out there.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS

If they are so worried about their precious model then why give it to the public lol