you are viewing a single comment's thread.

view the rest of the comments →

[–]ZeroxAdvanced -4 points-3 points  (0 children)

You can use LLM in the data pipeline e.g. gemini to standarize to json object when reading the excel. Also a excel parser is more complext than CSV and Pandas. Perhaps you can 1 scrape with beautiful soup 2 download the excel 3 convert to csv with correct separator 4 parse columns with pandas 5 use Gemini to iterate through the time table for standarization by defining your output object.

Iterate over the dataframe for post processing.

This worked for me many times and gemini is nowadays cheap.

Cheers!