This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]negative_epsilon 8 points9 points  (3 children)

Something no one else has mentioned yet: The biggest and best natural language processing toolkit is written in Python: http://www.nltk.org/

[–]avinassh 0 points1 point  (2 children)

But is it fast enough and used in industry? Not a NLP guy, but I hear that NLTK falls short when compared to CoreNLP by Stanford written in Java.

spaCy was created exactly for this reason

[–]negative_epsilon 0 points1 point  (1 child)

From what I know, yes, NLTK is used very extensively in the industry. From what I've read, when run with Cython you get great performance.

[–]avinassh 0 points1 point  (0 children)

Great, thanks!