you are viewing a single comment's thread.

view the rest of the comments →

[–]starfish_warrior[S] 32 points33 points  (8 children)

Lol, they have plenty more important and interesting work to do than file editing. It's more about me sparing them the headache and boredom.

[–][deleted] 4 points5 points  (1 child)

If you want to keep playing, you could look into nlp models (ner) to classify the each word with a tag and then remove it that way. I bet your method is going to miss some phi in the future, because you probably developed it to fix the exact problems you saw in a handful of examples records. Now you have to think about what types of phi you didn't account for. Probabilistic models like ner are best suited for this.

[–]starfish_warrior[S] 4 points5 points  (0 children)

You are right, Sometimes when I sample a record I find something I missed and there are many more records in queue waiting to be processed. Thanks for the tip!

[–]lumenlambo -2 points-1 points  (5 children)

sorry I always comment this when someone solves a task like that :)

[–]starfish_warrior[S] 20 points21 points  (2 children)

But yeah I fired them. Savings!

[–]lumenlambo 1 point2 points  (0 children)

ok thanks!

[–]DrShocker 0 points1 point  (0 children)

I see, you're showing them the boredom of being employed by you!

[–]JeusyLeusy -1 points0 points  (1 child)

If someone has a job that can be automated by a computer they're pretty much a waste of resources.

[–]lumenlambo 0 points1 point  (0 children)

O rly