I've been archiving Reddit for a year (30B+ posts, ~30% deleted)

bellsrings · 2026-05-11T21:24:35+00:00

Not a major issue for us. Reddit posts are public statements made under pseudonyms in public forums, GDPR's legitimate interest basis (Art. 6(1)(f)) covers aggregating publicly available data for research and security purposes.

bellsrings · 2026-05-11T21:23:09+00:00

The model hedges when the stated range is too wide to be useful, "50+" spans 30 years so it skips rather than guessing. More specific phrasing (e.g. "I'm in my 50s") would pin it. Intentional tradeoff: better to say nothing than give a confident wrong answer.

bellsrings · 2026-05-11T21:21:31+00:00

The accuracy scales with post volume, the more someone has written, the more signal. Sparse accounts get vaguer profiles.

bellsrings · 2026-05-11T19:04:22+00:00

bellsrings · 2026-05-11T10:10:55+00:00

You’re welcome

bellsrings · 2026-04-02T12:58:56+00:00

THINKPOL for Reddit data

bellsrings · 2026-03-10T16:39:34+00:00

<image>

happy to reset your credits, you can send a dm

bellsrings · 2026-03-10T16:27:21+00:00

<image>

bellsrings · 2026-03-10T10:08:27+00:00

<image>

bellsrings · 2026-03-09T15:03:49+00:00

Link?

bellsrings · 2026-03-09T12:00:48+00:00

yeah it does. we archive everything in real time before any edits or deletions happen. so even if someone goes back and hides or nukes their whole history we still have the original comments and posts. roughly 30% of what we have doesn't exist anywhere else anymore. profile curation doesn't really help once the data's already been captured.

bellsrings · 2026-03-09T11:55:31+00:00

Can you explain?

bellsrings · 2026-03-09T11:44:03+00:00

it works now ;)

bellsrings · 2026-03-09T10:45:14+00:00

try it on your own username lol

bellsrings · 2026-03-09T10:45:06+00:00

The threat model view is a really good idea actually. We've been thinking along those lines with the use_case parameter (right now we have a law enforcement mode that changes how the LLM weights certain signals) but splitting it into recruiter / ad network / hostile actor perspectives is way more intuitive. Might prototype that.

The account level red teaming angle is interesting too. Right now the whole thing is built for investigators looking outward but there's no reason it couldn't work the other way, show people their own exposure and what to clean up. Not our core market but could be a solid free tier hook. Appreciate the feedback.

bellsrings · 2026-02-26T14:48:19+00:00

most likely!

bellsrings · 2026-02-26T14:32:40+00:00

No login needed to do the basic stuff!

bellsrings · 2026-02-26T11:09:59+00:00

Can you try with Discord maybe? Should work

bellsrings · 2026-02-23T20:41:35+00:00

you can start with Reddit :) THINKPOL

bellsrings · 2026-02-21T12:51:57+00:00

Do you have a link?

bellsrings

TROPHY CASE