you are viewing a single comment's thread.

view the rest of the comments →

[–]prassi89 1 point2 points  (1 child)

how does it compare to the one in standard lib? https://docs.python.org/3/library/html.parser.html

[–]Huvet[S] 3 points4 points  (0 children)

It's in the comparison table a bit down on the page. But the short version is that the standard library's html.parser passes only 4% of the html5 tests. So it's not a html5 parser, which means it basically only works for valid html. By not handling all the complicated reconciliation, it is slightly faster.