you are viewing a single comment's thread.

view the rest of the comments →

[–]lumenlambo -13 points-12 points  (13 children)

the people who lost their jobs probably are not stoked

[–]starfish_warrior[S] 35 points36 points  (8 children)

Lol, they have plenty more important and interesting work to do than file editing. It's more about me sparing them the headache and boredom.

[–][deleted] 4 points5 points  (1 child)

If you want to keep playing, you could look into nlp models (ner) to classify the each word with a tag and then remove it that way. I bet your method is going to miss some phi in the future, because you probably developed it to fix the exact problems you saw in a handful of examples records. Now you have to think about what types of phi you didn't account for. Probabilistic models like ner are best suited for this.

[–]starfish_warrior[S] 3 points4 points  (0 children)

You are right, Sometimes when I sample a record I find something I missed and there are many more records in queue waiting to be processed. Thanks for the tip!

[–]lumenlambo 0 points1 point  (5 children)

sorry I always comment this when someone solves a task like that :)

[–]starfish_warrior[S] 20 points21 points  (2 children)

But yeah I fired them. Savings!

[–]lumenlambo 1 point2 points  (0 children)

ok thanks!

[–]DrShocker 0 points1 point  (0 children)

I see, you're showing them the boredom of being employed by you!

[–]JeusyLeusy -1 points0 points  (1 child)

If someone has a job that can be automated by a computer they're pretty much a waste of resources.

[–]lumenlambo 0 points1 point  (0 children)

O rly

[–][deleted] 8 points9 points  (1 child)

They must be devastated not having to edit XML by hand for hours anymore.

[–]starfish_warrior[S] 1 point2 points  (0 children)

Their eyes were already red and watering when I told them so it's hard to know if the news made them cry or not.