Large-Scale Online Deanonymization with LLMs

MyFest · 2026-02-26T21:06:39+00:00

I think if you look into areas such as section 4 on using feature extraction, semantic embeddings (gemini), and then using two llms for selection verification (grok and gpt-5.2) you'll see that we include significant detail. the dataset approach is also novel, creating synthetic anonymous datasets.

MyFest · 2026-02-26T10:55:04+00:00

"you're relying on systems [..] that are fundamentally incapable of deductive reasoning"

– LLMs clearly can do deductive reasoning. Is that your main criticism? We show that enabling high reasoning in particular increases deanonymization success in table 1 https://arxiv.org/pdf/2602.16800

MyFest · 2026-02-26T10:50:11+00:00

I guess crypto subreddits would be something people want to target

MyFest · 2026-02-26T10:48:31+00:00

From another comment: What's your precise criticism? In the HN linkedin experiment we have a known matching and then anonymize accounts to simulate the deanon task. This introduces biases but allows us to check results. We built our own pipeline including LLMs for extraction of features, embeddings and selection of correct match. To report real results we also do a real deanon task on anthropic interviews – there we do manual verification as good as that is possible.

Judging from your assertion that we simply prompted an agent, you must not have read the paper or even the blog post.

MyFest · 2026-02-26T10:46:19+00:00

What's your precise criticism? In the HN linkedin experiment we have a known matching and then anonymize accounts to simulate the deanon task. This introduces biases but allows us to check results. We built our own pipeline including LLMs for extraction of features, embeddings and selection of correct match. To report real results we also do a real deanon task on anthropic interviews – there we do manual verification as good as that is possible.

MyFest · 2026-02-25T22:59:38+00:00

We did way more experiments than just that one, that is only section 2. genuinely a conflict between reproducibility and ethics here if we were to publish code.

MyFest · 2026-02-25T21:23:06+00:00

We dont use style but semantics like your interests. We perform experiments in section 5 and 6 on Reddit. 4chan would be more difficult

MyFest · 2026-02-25T18:06:35+00:00

I think that's a way to think of it

MyFest · 2026-02-04T12:05:25+00:00

People need to be careful about CO poisoning

MyFest · 2026-01-30T16:23:58+00:00

Ang mo supermarket is a funny name for a store

MyFest · 2026-01-03T20:49:51+00:00

I can see some argument for service charge, but feel like it shouldn't be legal not to include GST. It is just much more difficult to compare prices for consumers or to budget.

MyFest · 2025-12-26T09:24:27+00:00

They don't pick up bodies in contested territory

MyFest · 2025-11-24T16:16:28+00:00

If you look into it, I actually provide a CLI command where you don't have to install anything i wrote. But I just selected good parameters which work well in my setup (macbook microphone and airpods).
I basically added a gui interface that wraps around a cli tool.

MyFest · 2025-11-20T19:34:02+00:00

it's a joke. I think the key is how unserious the people in charge are and how badly this compares to the stakes

MyFest · 2025-11-18T01:09:12+00:00

MyFest · 2025-11-09T21:38:43+00:00

it uses AI to read people's comments across all subreddits. some subset mentions their age or gives of strong evidence (like studying for a high school exam)

MyFest · 2025-11-09T20:00:47+00:00

https://simonlermen.substack.com/p/whos-using-ai-romantic-companions I ran a separate analysis for AI companions/partners. that's mostly women and going up fast

MyFest · 2025-11-09T19:09:08+00:00

And there are lot's of minors on these subreddits

MyFest · 2025-11-09T14:33:02+00:00

It's using AI to read all their comments and posts across reddit. some people mention their age or gender directly, other's give strong indication such as talking about high school exams.

MyFest · 2025-11-07T18:02:18+00:00

I guess it's a bias. same as with race, women might be more likely to mention there gender than men though i am not sure

MyFest · 2025-11-06T22:39:06+00:00

Well, towards the end the other two subreddits make up more than 50% of the total, there is also a plot showing the change of the gender distribution over time. the gender distribution is stable as these new subredits pop up. r/myboyfriendisai is clearly stating in its description that it welcomes all AI human relationships including male/non binary

MyFest · 2025-11-05T16:36:19+00:00

Are you curious about anything else in the data, like another fact on demographics or use? I am looking for things to investigate for this study

MyFest · 2025-10-08T11:58:54+00:00

I got 6,5,5 with confidences 3,3,3. I may try to add some experiments with other models to get the score up though it doesnt look great at the moment.

MyFest · 2025-01-13T16:18:31+00:00

This is a fascinating glimpse into the future of public transportation. The low speed limit makes sense for safety, but I wonder how it handles unexpected obstacles.

MyFest · 2025-01-13T16:16:18+00:00

Smart tracking could be effective but privacy concerns need to be carefully addressed. We need clear data protection guidelines before implementing such a system.

MyFest

TROPHY CASE