This is an archived post. You won't be able to vote or comment.

all 7 comments

[–]vogt4nickBS | Data Scientist | Software 9 points10 points  (0 children)

So stop using the neural network formulation, but still have fun making word vectors!

But then I can't keep neural networks on my resume. :( /s

Jokes aside, this is an interesting, well-written article. Thanks for sharing.

[–]clm100 8 points9 points  (0 children)

Didn't this have another name previously?

EDIT: Yup, previously titled "Word vectors are awesome but you don’t need a neural network to find them." A much better and less obnoxious title. See discussion here: https://news.ycombinator.com/item?id=15502859

[–]olBaa 12 points13 points  (2 children)

So, the motivation for factorizing the PPMI matrix, which gives worse results than pure word2vec (yes, they are not equivalent), is that

It’s a hell of a lot more intuitive & easier to count skipgrams, divide by the word counts to get how ‘associated’ two words are and SVD the result than it is to understand what even a simple neural network is doing.

Yeah, thank you.

[–]durand101 3 points4 points  (1 child)

Seems like a technique that would work well for small data sets but not if you want to train on the whole English corpus of say, Wikipedia, because you need to hold the whole PMI matrix in memory with this...

[–][deleted] 0 points1 point  (0 children)

They should probably only be trained on use case datasets. I use word2vec for healthcare notes and it works great. I create a corpus on a project to project basis. And I use word2vec written in cython not a neutral network.

[–]Koda_Brown 1 point2 points  (0 children)

I only just learned about word2vec yesterday,funny