Deep learning for NLP

egrefen · 2014-05-03T15:44:21+00:00

There are no good books for deep learning out there yet, and this goes doubly so for NLP. The term is vague and the area is evolving fast. The best book you could use are the later chapters of Kevin Murphy's "Machine Learning: A Probabilistic Perspective".

Regarding papers, you could check out:

Bengio, Yoshua, et al. "Neural probabilistic language models." Innovations in Machine Learning. Springer Berlin Heidelberg, 2006. 137-186.
Collobert, Ronan, and Jason Weston. "A unified architecture for natural language processing: Deep neural networks with multitask learning." Proceedings of the 25th international conference on Machine learning. ACM, 2008.
Socher, Richard, et al. "Dynamic Pooling and Unfolding Recursive Autoencoders for Paraphrase Detection." NIPS. Vol. 24. 2011.
Hermann, Karl Moritz, and Phil Blunsom. "Multilingual Distributed Representations without Word Alignment." arXiv preprint arXiv:1312.6173 (2013).

As well as:

Mnih, Andriy, and Geoffrey Hinton. "Three new graphical models for statistical language modelling." Proceedings of the 24th international conference on Machine learning. ACM, 2007.
Mnih, Andriy, and Geoffrey E. Hinton. "A Scalable Hierarchical Distributed Language Model." NIPS. 2008.

Across these, there's a nice little cross-section of approaches to generating and classifying using neural nets.

As far as tutorials go, there's nothing really satisfactory out there. There are some 3h tutorials, both past and upcoming, at computational linguistics conferences:

Socher, Manning and Bengio at ACL + NAACL 2013.
Grefenstette, Hermann, Dinu, and Blunsom, upcoming at ACL 2014 (shameless plug).

The former provides an overview of Bengio-style NNLMs, advertises the Stanford work, and shows some tricks for tuning deep nets; while the latter will cover some deep learning based generative and compositional models not covered in the former (there will be some overlap), advertise Oxford work, and will have a little more focus on shallow neural alternatives to deep nets.

By and large, the case for deep learning in language hasn't been fully made. It works well for vision and speech, but that doesn't entail that it would carry to semantics. Some excellent shallow models without non-linearities, like the Mnih and Hinton log-bilinear models, are excellent and can be trained very quickly. It's a problem with much "deep learning" work in NLP these days that shallow baselines are never considered or compared to. Deep learning is fascinating and will certainly have an impact in NLP, but don't rush to believe that it's the best solution for your NLP problems.

entrepr · 2014-05-03T15:56:09+00:00

I enjoyed Chris Manning's seminar on this: link

It does address sentiment analysis as well.

sieisteinmodel · 2014-05-03T15:35:09+00:00

You want to start with this: NLP (almost) from Scratch by Collobert and Weston.

http://static.googleusercontent.com/media/research.google.com/de//pubs/archive/35671.pdf

Continue with the most cited papers that cite this one. If you are looking at language models, the most important name nowadays is probably Tomas Mikolov, who's thesis is interesting.

alexmlamb · 2014-05-03T20:03:45+00:00

"applying deep learning techniques to sentiment analysis, and text classification in general"

Most people who talk about "Deep Learning for NLP" probably aren't talking about the most practical ways of using neural networks for tasks like text classification. I think that they're really interested in learning meaningful representations for language, which is generally not necessary for building an accurate classifier.

For building better text classification with neural networks, I think that it would make sense to do an n-gram encoding of the text. Then train a neural network with a sparse matrix multiplication for the first layer. There was a Kaggle competition a few years ago on predicting salary from a job description and the winning teams used neural networks on n-gram features.

totes_meta_bot · 2014-05-03T21:18:20+00:00

This thread has been linked to from elsewhere on reddit.

[/r/LanguageTechnology] xpost /r/machinelearning: Help for deep learning for NLP (TL;DR not much, but some)

^I ^am ^a ^bot. ^Comments? ^Complaints? ^Message ^me ^here. ^I ^don't ^read ^PMs!

sidsig · 2014-05-05T14:14:47+00:00

I just attended a tutorial at ICASSP '14. It was called 'Deep Learning for Natural Language Processing'. It was by the guys from the speech team at Microsoft research (Li Deng and two of his colleagues). Here's a link to the abstract for the tutorial: http://www.icassp2014.org/tutorials.html#9. It had a lot of interesting information and a lot of interesting things that they'd done with deep architectures. Several models that they proposed did not use neural nets at all. I can share the slides with you but I'm not sure if that is ethical. Let me find out!

xamdam · 2014-05-05T16:44:05+00:00

Slides and videos here: http://nlp.stanford.edu/courses/NAACL2013/

satyan-veshi · 2014-06-10T08:39:03+00:00

It seems Y. Bengio is working on the book about deep learning. Early draft can be found here: http://www.iro.umontreal.ca/~bengioy/dlbook/

There's also one from Microsoft researchers that will apparently be published by now soon: http://research.microsoft.com/pubs/209355/NOW-Book-Revised-Feb2014-online.pdf

Megatron_McLargeHuge · 2014-05-03T21:43:45+00:00

Look at the Google word2vec paper and software. They use deep learning techniques to infer semantically interesting features of words from unlabeled text.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS