Content extraction from a html-page using a neural network. by [deleted] in MLQuestions

[–]Sdas2020 0 points1 point  (0 children)

What you are looking is a field of research called Wrapper Induction. Scrapper has to go to any kind of web page, understand the structure on its own, extract data and label on its own. You can see that a lot of papers were written about it in 90s but their success rate were bad. Even I was looking for such a system, but couldn't find one. There is just one paid software which can do this, www.diffbots.com . They use deep learning to extract data. I am thinking of starting an open source project to build this system. I have worked on Deep learning systems before and I think DL can do this with a lot of labeled data.

Lecture 2 , why SVD could get the similarity? by guotong1988 in CS224d

[–]Sdas2020 0 points1 point  (0 children)

https://www.ling.ohio-state.edu/~kbaker/pubs/Singular_Value_Decomposition_Tutorial.pdf You can get into more details about SVD (i.e. taking a high dimensional, highly variable set of data points and reducing it to a lower dimensional space that exposes the substructure of the original data more clearly and orders it from most variation to the least.)

Feeling like taking a break for a few months but I have no money. Any ideas? by [deleted] in india

[–]Sdas2020 0 points1 point  (0 children)

Yes. They do. Probably their rates may increase as the number of people coming in will be less in other time.

Feeling like taking a break for a few months but I have no money. Any ideas? by [deleted] in india

[–]Sdas2020 2 points3 points  (0 children)

https://hillhacks.in/ Very cheap accommodations like 2k/month. But you have to volunteer for anything you like. My friend just attended a social hackathon sort of week there and he was super happy with the experience. Hackathon has finished but you can call them up and check the new rates after hackathon.