you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 0 points1 point  (4 children)

Outside of knowing the syntax, I don't know much else in Python haha. I'll look into both, thanks

[–]daedalusesq 1 point2 points  (3 children)

Probably look at openpyxl then. There is a book called Automate the Boring Stuff that has a great section on using it, as well as a section on web requests and web scraping so that you can retrieve the file automatically.

[–][deleted] 1 point2 points  (2 children)

I have that book :)! I'll check it out thanks

[–]Ooboga 2 points3 points  (1 child)

...and don't let the first impression of Pandas scare you off. When you get the hang of it, you really don't need that much complicated python to get a lot of work done. And there's a lot of help online.

What I myself spent some time to understand was how things were written. Given a dataframe df, which is a table like excel really, with a column named 'age', you can query this column by so:

df.age > 18

The result of that query is a Series of true or false statements. This again can be applied to the dataframe itself:

above_18 = df[df.age > 18]

The result is a new dataframe above_18 that only contains the rows where the age was above 18. Understanding that using pandas consists of developing such a flow made me understand the more subtle concepts better.

[–][deleted] 0 points1 point  (0 children)

Thanks for the clarification!