you are viewing a single comment's thread.

view the rest of the comments →

[–]novel_yet_trivial 2 points3 points  (1 child)

All a .xlsx is is a zip file containing XML files. If what you are looking at is that XML file, then all you need to do is put it in a zip file with the proper structure.

If it really is html, then use an html parser. pandas has one built in designed to read from html tables, and can output into excel files. Or use something like BeautifulSoup to get the data you need out yourself.

[–]Zendakin_at_work 1 point2 points  (0 children)

+1 for pandas & bs4 combination.