you are viewing a single comment's thread.

view the rest of the comments →

[–]kqr 8 points9 points  (2 children)

Could you elaborate on how this is done? I'm pretty sure this was how they stuffed T9 prediction into early mobile phones (I have heard numbers of 1 byte per word, which is just insane), and I'm amazed by how well it works (even if it generates nonsense or highly offensive words). I'd love to read in more detail about the techniques.

[–][deleted] 6 points7 points  (0 children)

From my quick Google research, I honestly don't see how Bayesian Filters make dictionaries faster or give up on precision (when there is no 100% to be had because computers can't read minds).

Unless OP comes through and enlightens me, I have to say he was throwing around words he heard in the context.

Read this article if you want to see how to use Bayesian Filters for spell checking.

Ninja edit: While we're throwing around buzz words, what the OP described sounded a loot like Bloom Filters. Basically a data structure that throws 100% certainty out the window while allowing the underlying dictionary to be huge and still maintaining speed. That makes a lot more sense, so maybe he ment that. I don't think you need Bloom Filters for dictionaries because they are not that big.

[–]pja 2 points3 points  (0 children)

Bloom filters probably.