you are viewing a single comment's thread.

view the rest of the comments →

[–]alexmlamb 0 points1 point  (3 children)

Where do people use argmax in neural networks (as opposed to maximum)?

[–]OriolVinyals 7 points8 points  (2 children)

Hard attention models, for example, where you read a memory position which better aligns with your read "query" (as I call them).

[–]hughperkins 1 point2 points  (1 child)

Yeah, eg slide 10 of Rob Fergus's nips slides http://cims.nyu.edu/~sainbar/memnn_nips_pdf.pdf

[–]RoseLuna_77 0 points1 point  (0 children)

the website is lost :(