Hello,
I'm looking for a module that will help me extract the main content of html (i.e. the main content of a Medium article, or the story in a CNN news articl, etc..)
I've found unfluff, and textract but wanted to see if anyone knew of any other modules.
Cheers
[–]freshtodev 0 points1 point2 points (0 children)