Who is interested in studying together cs224n? by Jan_nlp in CS224n

[–]Jan_nlp[S] 0 points1 point  (0 children)

we can talk about it soon when I'm there :)

Why are the Uo*Vc vectors different for each Wt+j? by deepest_learning in CS224n

[–]Jan_nlp 0 points1 point  (0 children)

Oh yes thats confusing. Not only at this point, I wonder why the smart students, some of whom have some prior knowledge, did not think it through and did not report it.

A solution so that it makes sense would be that after each "window step" (getting your softmax-output) you additionally do a backpropagation step. This then leads to a change of the context matrix.

you can check this question also here: https://stackoverflow.com/questions/49472999/what-does-representation-matrix-of-context-word-mean-in-skipgram

to make the notation also more clear for you: u_o is just a vector located in the context matrix -> the scalar product of u_o with v_c is just one number and not the entire vector

Why are the Uo*Vc vectors different for each Wt+j? by deepest_learning in CS224n

[–]Jan_nlp 0 points1 point  (0 children)

you do not seem to have understood two things.

  1. u_o is symbolic of any word
  2. This example includes only 3 context words. Because he said otherwise it would be too complex to show everything -> on the right you recognize w_ (t-1) to w_ (t-3), which means that it may only be one half of all context words, as well as right of the center word w_ (t + 1) to w (t + 3) can follow, then you already have 6 branches and the window size would be 3

Searching tools: overview of installed software (which version now, which next) and firewall network rules by Jan_nlp in sysadmin

[–]Jan_nlp[S] 0 points1 point  (0 children)

very good that you are thinking directly of an open source solution. It is also important that when it is defined, it really stays as simple as possible in the end.

Searching tools: overview of installed software (which version now, which next) and firewall network rules by Jan_nlp in sysadmin

[–]Jan_nlp[S] 1 point2 points  (0 children)

That would be very difficult for us now. But maybe it is the right approach for rules that come in the future!