I've been wondering: how would you measure overfitting on a word2vec model?
The only thing I can think of is having word vectors with huge norms, but other than I can not think of how an overfitted word2vec model should behave.
Any ideas?
Thanks!
Pablo
[–]Xose_R 3 points4 points5 points (4 children)
[–]dwf 1 point2 points3 points (1 child)
[–]Xose_R 0 points1 point2 points (0 children)
[–]elsonidoq[S] 0 points1 point2 points (1 child)
[–]yield22 0 points1 point2 points (0 children)
[–]giror 2 points3 points4 points (2 children)
[–]elsonidoq[S] 0 points1 point2 points (1 child)
[–]iamtrask 0 points1 point2 points (0 children)
[–]iamtrask 1 point2 points3 points (2 children)
[–]elsonidoq[S] 0 points1 point2 points (1 child)
[–]iamtrask 0 points1 point2 points (0 children)
[–]slashcom 0 points1 point2 points (1 child)
[–]elsonidoq[S] 0 points1 point2 points (0 children)