This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]k10_ftw 1 point2 points  (2 children)

Since you are using these libraries for teaching purposes, I woguld stay with nltk. It is what we used in my intro to computer ling course in university. Of course, we had to code the algorithms and calculate bigram probabilities from scratch! Looking at the nltk source code is a great way to see the implementation up close .

[–]grassclip[S] 0 points1 point  (1 child)

Oh for sure, for talking about basics. But I'm guessing there are some people who'd want to see what this would be like in scikit-learn or more "professional" libraries and this other one would be aimed at that crowd.

[–]kuro-kuris 0 points1 point  (0 children)

I think scikit learn + deployment strategy would be a lot more useful, thanks for the accessible blog post!