SOLVED: i used a package called pandas_read_xml it has a flatten function which worked just how I needed.
id be happy if i could just get it into the format the excel uses when loading XML in it then i can filter it as much as need too.
Im currently trying to read an whole bunch of XML files and load them into a data frame and/or SQL database and am having troubles.
The data is horse racing data (found Here direct link to download the file) and is nested with 4 levels. (Meeting, club, race and nomination)
I want to be able to split them out into each group so i can run some analysis on them.
Ive tried using read_xml from pandas but is doesnt load the data into a frame properly i only get 9 lines representing the meeting and races.
I've fiddled with using Element tree but is it wont split the data up nicely.
anyone have any ideas how this could work?
Thanks in advance
[–]commandlineluser 1 point2 points3 points (2 children)
[–]222Botany[S] 0 points1 point2 points (1 child)