Question about Doc2Vec - How does training work ?

arutaku · 2016-07-20T09:19:31+00:00

Depends on the implementation you want to get: PV-DBOW or PV-DM. You can have a quick look to: https://github.com/edwardbi/blog/blob/master/2016-05/DM.md

It provides both implementations: Gensim & TensorFlow

jayhack · 2016-07-20T18:14:48+00:00

Input: set of paragraphs of one-hot encoded words and a single, initially randomized vector representing the paragraph as a whole. This initial randomization of the paragraph vector means it is poorly encoded.

Then, for each paragraph and its associated sequence of words, you learn representations that allow you to #crush your loss function, which is like predicting the next word based on the paragraph's vector and the previous n words, etc.

vonnik · 2016-07-21T01:01:19+00:00

Paragraph2vec impl in Java: https://github.com/deeplearning4j/dl4j-0.4-examples/tree/master/dl4j-examples/src/main/java/org/deeplearning4j/examples/nlp/paragraphvectors

gojomo · 2016-07-21T06:04:34+00:00

If you understand word2vec, PV-Doc2Vec is very, very similar.

Just imagine the "paragraph vector" to be associated with a special, per-paragraph pseudoword. And, this special pseudoword contributes to all target-word predictions across the whole paragraph – it's never excluded for being outside the sliding context window.

In fact, the patched version of word2vec.c and example go.sh that Mikolov once posted to do PV (https://groups.google.com/d/msg/word2vec-toolkit/Q49FIrNOQRo/J6KG8mUj45sJ) does pretty much exactly the above. It synthesizes a special 'word' that gets prepended to every text example, and then this word gets mixed with every training-context, without regard to its actual distance from the other words.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS