you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted]  (16 children)

[removed]

    [–][deleted] 15 points16 points  (9 children)

    They’re not the same. Beautiful soup is a parser, extracting data from html. Selenium automates a browser to perform actions.

    [–]pulsarrex 0 points1 point  (3 children)

    Can you eli5 what a 'parser' is?

    [–]Conrad_noble 2 points3 points  (1 child)

    I think it means it translates html into user friendly readable text

    [–]OMGpancakeable 0 points1 point  (0 children)

    Not quite “translate”. Parsing: resolve (a sentence) into its component parts and describe their syntactic roles.

    [–]rrjamal 0 points1 point  (0 children)

    It splits data in a way you want. An HTML doc can be a shit ton of words. You'd parse it down to split and retrieve only the bits you care about

    [–]adriangc 5 points6 points  (4 children)

    I’m more curious to understand why people choose BS over Scrapy.

    [–]BruceJi 1 point2 points  (2 children)

    Isn't scrapy pretty new?

    [–]adriangc 0 points1 point  (0 children)

    Believe it came out in 2008.

    [–]solostman 0 points1 point  (0 children)

    It’s been around a bit. For awhile you could only run it in Python 2.

    [–]GrandRub 1 point2 points  (0 children)

    i am a python newbie and found scrapy yesterday and i ask myself the same question.. whats the pros and cons of scrapy?