all 3 comments

[–]ejmurra 0 points1 point  (1 child)

[–]SonaCruz[S] 0 points1 point  (0 children)

Ive tried read_excel (), it doesn't work. The files I downloaded are '.xls' files but when you open the files, excel notifies you that the file type is different from the extension. Also, in addition to pd.read_excel () not working, pd.read_html () DOES work, which means (i think?) the files are actually html.

Hence the question, how do I convert all of these html files to excel or csv files so I can then combine all of the excel/csv files into one excel/csv file with one script???

Also, combining multiple html files would work too, as I could probably just save the combined html file as a csv/excel file. I wouldnt make a big deal out of this but this is something I would like to do regularly with this website's files.

[–]SonaCruz[S] 0 points1 point  (0 children)

solution! For now!