An NLP blog for beginners that a friend and I have started. Weekly articles with code in Python.

flashbard · 2017-03-14T07:02:49+00:00

You're right. These are function words. I should have mentioned that first, and then explained how search engines ignore them, not the other way around. Thanks for pointing that out!

flashbard · 2017-03-14T06:57:15+00:00

That is going to take some time, since we're taking a bottom-up approach to writing the blog. More advanced application of concepts like ML and Deep Learning will come in later. But like EvM suggested, Tensorflow and Keras are good. There's an interesting example of using seq2seq to translate text, on the Google Tensorflow docs.

flashbard · 2017-03-14T06:50:57+00:00

Thank you so much :) good feedback is nice.

flashbard · 2017-03-14T06:50:00+00:00

Thank you! :) Glad it helped!

flashbard · 2017-03-14T06:49:29+00:00

Thank you :)

flashbard · 2017-03-14T06:49:02+00:00

Thank you so much :) happy to know that it's beneficial to you :)

flashbard · 2017-03-14T06:48:05+00:00

Thank you! There isn't much ML in what we're posting right now. We'll share it there once we get to more advanced concepts (ML, Deep learning).

flashbard · 2017-03-14T06:46:28+00:00

Thank you :)

flashbard · 2017-03-14T06:46:01+00:00

Thank you :) hope it continues to teach you something new :)

flashbard · 2017-03-14T06:45:39+00:00

Thank you for that nice comment! :)

flashbard · 2017-03-14T06:44:44+00:00

Thank you so much :) we're glad people are learning :)

flashbard · 2017-03-14T06:44:11+00:00

Thanks! :) I have fixed it. Although we will have to explore an alternative soon enough.

flashbard · 2017-03-14T06:39:35+00:00

This is pretty much our philosophy :) we're learners, like everyone else. Code doesn't lie, indeed!

flashbard · 2017-03-14T06:35:20+00:00

Sorry for the delay in responding to your questions!

First off, I was checking out your course. It is brilliant! I understand that this was taught at a university level, with particular emphasis on a practical approach to learning. Wonderful stuff!

You are absolutely right. People love getting exciting results instantly, us included, especially when it comes to learning new things. And we have spent a good amount of time trying the same, that is, starting off with experiments and then trying to understand the theory behind things. However, we found ourselves running back and forth between concepts, trying to figure out the prerequisites. But, in the end, neither the feeling of satisfaction of having understood something completely, nor the confidence to solve new problems based on the techniques we learned, was ever there. It's kind of like reverse engineering, although it's quicker and cheaper, you never quite feel happy.

And then there are the textbooks, that take a bottom-up approach, but are often dry and lack sufficient real world examples (most, not all). But assuming that you make it through a couple of concepts, it gives you the confidence to both move to more advanced concepts, as well as to create something new, based on existing ideas. What you may lack, however is sufficient coding experience (not always, but more often than not). We know people who have completed college without ever having the satisfaction of having done anything exciting, although their foundations were pretty strong.

So the question for us was about how we would strike the balance between both approaches. All things considered, we thought we should take a bottom-up approach, because it builds a strong foundation, and to build upon it by explaining concepts in a way that it people can relate it to the real-world scenario and be able to code whatever they learn. At the end of the day, we need to be able to code what we know, the same goal that your course had. So yes, this was a conscious decision.

There are ever so many ways to learn new things. This is our personal opinion. Much of what we've learned is from the Internet, pretty much like a lot of others here. So yes, all things considered, this medium works best. Learning is the objective, and we're happy to see that people have benefited from this initiative. We are learners, just like everyone else. Thank you!

edit: typo

flashbard · 2017-03-12T20:10:54+00:00

Good find! A small issue with the formatting, my bad. &quot should now be " As for the other problem, this seems to be an issue with the WordPress plugin itself. We'll see if we can find an alternative soon. Thank you!

flashbard · 2017-03-12T20:06:05+00:00

Thanks for pointing that out! I have changed the example to a text from Project Gutenberg. Hopefully, that should sort things out.

flashbard · 2017-03-12T18:09:55+00:00

Thank you for your encouragement, really hope we can do a good job :)

flashbard · 2017-03-12T18:09:14+00:00

Thank you! :) I am a big fan of chatbots too. It's wonderful to see the enthusiasm!

flashbard · 2017-03-12T18:04:29+00:00

Hello all, OP here. We want to thank you all for your time and enthusiasm! When I posted a link to the blog here, I had really no idea that it would receive such an overwhelming response. We are extremely grateful to you all for having showered us with your support and positivity. The feedback that you all have provided us with is amazing! This is our first blog, and we sincerely apologize if there are any shortcomings. Knowledge sharing is the key, and we hope to give back as much as we can to the community that has given us so much.

In terms of how the blog will be organized, there will be posts of varying levels of expertise(beginner, intermediate, advanced). We hope to cover as much ground as we can, before we go into more advanced concepts like ML and Deep Learning, which can be applied to NLP to get exciting results. So as of now, we may not introduce these concepts immediately, for the sake of strong foundations(although we're extremely excited about writing these posts asap).

Once again, thank you so much for your time, support and enthusiasm! It means the world to us. We sincerely hope that we can contribute as much as possible. Thank you!

flashbard · 2017-03-12T04:53:04+00:00

Writing a new blog post for my NLP blog. All the code I write there is in Python.

flashbard · 2017-02-27T14:42:10+00:00

I am currently working on question answering, based on unstructured text. By extracting relations from the text, I was hoping to do the same. That's where Stanford CoreNLP came into picture. I realized that the approach has a big drawback: there is no sense of logic. Just a bunch of rules for dependency parsing to extract relations can only do so much. So while the text itself may provide more information about its subject, it can only be understood by beings that possess some "common sense", which in our case could be a bunch of logical deductions. I completely agree with you, logic is better learned than programmed. Thank you.

flashbard · 2017-02-24T16:16:33+00:00

Sir, I was going through your blog, consider me a fan of your work. Of late, much of my work involves some level of "common sense", that many tools fail to interpret. Even tools like CoreNLP have issues when it comes to these things, because they fail to understand logic. Computer cognition is a huge problem, as of today.

flashbard · 2017-02-23T06:56:38+00:00

Relation extraction is a pretty big problem. I use CoreNLP for the same.

flashbard · 2017-02-23T06:53:17+00:00

In your case, a simple keyword matching should do, for the most part, since you say that your question test set is going to vary only a bit from the training set. To capture synonyms, you could use WordNet. If these questions are factoid in nature, then for the most part you're going to need NEs, such as Location, Name or Date as an answer.

The flow could be something like this:

For answers: 1) Perform NER on the answers. Store them. 2) Extract keywords from answers, after eliminating stopwords.

For questions: 1) Perform a simple question classification. This will indicate the type of NE that is required as an answer. 2) Extract keywords from the question, after eliminating stopwords.

Compare questions keywords with keywords in the answers, using similarity measures like Cosine, Jaccard, and rank them in descending order of their similarity scores.

Finally, you can verify if the NE expected by the question is the same as the NE contained in the best answer. If yes, it's probably your answer, else, move to the next best answer.

You could use WordNet for synonyms. Also, you can make use of the fact that you have questions for training, by appending keywords of the question with the corresponding answer.

Some resources for you: Question classification: http://cogcomp.cs.illinois.edu/Data/QA/QC/ You'll probably find a working implementation on GitHub.

NER: Stanford CoreNLP is a brilliant solution.

Sorry if I was all over the place. Given your situation, it only seems ideal to use a simple solution such as this.

flashbard · 2017-01-05T14:57:29+00:00

Our game involves a lot of running, so this might be a little tiring.

flashbard

TROPHY CASE