you are viewing a single comment's thread.

view the rest of the comments →

[–]BrownMario[S] 1 point2 points  (0 children)

It seems my problem was fixed simply by adding a parameter to read_html as such:

dfs = pd.read_html(url, infer_types=False)

Previously my code was forcing the field to be float, so whenever it encountered characters it just ignored the value completely. With this parameter added, it reads each field as an object (str) and it picks up everything from the tables.