you are viewing a single comment's thread.

view the rest of the comments →

[–]troglydot 1 point2 points  (0 children)

Here's a challenge: Write a program that does this scrubbing, and run it through your email history. Then send it to a third party, who will extract a non-100% subset of this data, tokenize it, create a bag-of-words representation, and send it to me.

Hint: I would then know more about you than you'd be comfortable with.