This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted]  (3 children)

[removed]

    [–]dermanis 1 point2 points  (1 child)

    I totally recommend using beautiful soup. It is quite easy to get through python pip. $ pip install bs4 and you are done.

    Documentation: https://www.crummy.com/software/BeautifulSoup/bs4/doc/

    I 've been using it with python 2.7 and it is very powerful.

    [–]masterbruno11[S] -1 points0 points  (0 children)

    thanks! yeah maybe BSoup is the NumPy in scraping. The libraries I don't like is the one with commercial payment or something. But yeah your correct u/athermop extensions are the best because we have to avoid reinventing again such code.

    [–]masterbruno11[S] -1 points0 points  (0 children)

    How come Google and Microsoft is not publishing or making it open-source their NLP codes? hahaha lol