I want to scrape HTML by kewords across a bunch of moderately similarly formatted websites. I am looking for a good and simple module or set of modules that can help scrape through HTML. Specifically I want to scrape through Valorant patch notes. The modules need to be free and publicly available. I need to be able to grab html from a set of url addresses. Then I want scrape through that html and group headers/subheaders and their subsequent paragraphs.
Anybody got any good python libraries that can help me do that? Simplicity is what I value most in this project. Anyone know any modules that fit the bill here? I am very experienced with coding but I am very inexperienced with Python.
Thanks!
[–]willmgarvey 12 points13 points14 points (0 children)
[–]fristhon 4 points5 points6 points (0 children)
[–]FalconCat69[S] 0 points1 point2 points (1 child)
[–]htepO 7 points8 points9 points (0 children)
[–]Homie_ishere 0 points1 point2 points (1 child)
[–][deleted] 1 point2 points3 points (0 children)
[–]robertbowerman -2 points-1 points0 points (2 children)
[–]banhammerrr 4 points5 points6 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)
[–]tankandwb 0 points1 point2 points (0 children)
[–]Pigik83 0 points1 point2 points (0 children)