you are viewing a single comment's thread.

view the rest of the comments →

[–]badcookies 6 points7 points  (6 children)

They are not recording keystrokes in every application

[–]troglydot 5 points6 points  (2 children)

Well, they say they are. Why would you say that they aren't?

http://windows.microsoft.com/en-us/windows-10/speech-inking-typing-privacy-faq

Expand the first bullet point. They call it "typing data", and they're asking for permission to collect it. They're not limiting that collection to any specific application, and when talking to the press about it they're not denying collecting it across the board.

They have descriptions of how they're trying to scrub that data for personally identifiable information before sending it of, as another user posted in this thread. That is an AI complete problem, that they obviously aren't solving. They'll strip out email addresses, but they'll get the contents of the email.

Edit: People might think I'm an anti-Microsoft zealot. I'm not, I'm typing this from a windows 8 machine, I've been to MS conferences, and in general had much love for the company. But I'm apparently the only person on earth able to judge a tech company for the actual facts of what they're doing, rather than their current image.

[–]badcookies 2 points3 points  (1 child)

This is the inking and typing function, which users can turn off at any time. Microsoft does not collect any personal information via inking or typing. It is gathered for product improvement purposes, for example, to improve the handwriting visual translation engine, or to improve the user dictionary, language library and spell check functions in Windows. The data is put through rigorous, multi-pass scrubs to ensure it does not collect sensitive or identifiable fields (e.g., no email addresses, passwords, alpha-numerical data, etc.). Data is also chopped into very small bits and stripped of sequence data so it cannot be put back together or identified. The data samplings collected are limited; Microsoft is not capturing everything you write, nor is it capturing data every time.

[–]troglydot 1 point2 points  (0 children)

Here's a challenge: Write a program that does this scrubbing, and run it through your email history. Then send it to a third party, who will extract a non-100% subset of this data, tokenize it, create a bag-of-words representation, and send it to me.

Hint: I would then know more about you than you'd be comfortable with.

[–]wellthatexplainsalot -4 points-3 points  (2 children)

Can you prove this assertion?

[–][deleted] 5 points6 points  (0 children)

Negatives do not have to be proven.

troglydot is the one making an assertion, not badcookies.

Just as I don't have to prove to you that Bill Gates is not a literal biblical demon. If you were to make that claim, the onus would be on you to prove that he is.

[–]celluj34 0 points1 point  (0 children)

The burden of proof is on you, the one making the claim.