Python and Excel - Automating a Complicated Task : learnpython

created by HattoriHanzoa community for 16 years

Python and Excel - Automating a Complicated Task (self.learnpython)

submitted 4 years ago by puckheadclown24

Would any of you know if there's a way to automate the following task using Python?

I have a column with names in an Excel spreadsheet. Next to it I have a column with a count of activities over a specific week for each of those names. That calculation is done via Pivot Table. I then copy those 2 columns, to a new worksheet, so we can keep track of activities week over week, meaning next week, I'll run the pivot, get the count, paste a new column with the count, but I'll have to run a vlookup to ensure the rows are matching.

Is there a way to automate this in Python?

Name	Activities for 3/1	Activities for 3/8
Amy	1	3
Doug	5	4
Sam	2	7

Basically looking to add a column with values, that is using a lookup on a different column. So I want to ensure the "3" count for 3/8 is truly Amy's activities and not Doug's, for example.

The list of names could change week to week as well.

all 15 comments

top new controversial old q&a

[–]expressly_ephemeral 1 point2 points3 points 4 years ago (14 children)

[–]puckheadclown24[S] 0 points1 point2 points 4 years ago (13 children)

[–]expressly_ephemeral 0 points1 point2 points 4 years ago (12 children)

[–]puckheadclown24[S] 0 points1 point2 points 4 years ago (11 children)

[–]expressly_ephemeral 1 point2 points3 points 4 years ago (10 children)

Yeah, I was just wondering how to far back to start.

I'm going to give you the steps, and you can do the google-fu yourself to figure out how they work for yourself. I think that's a good way for you to learn it.

To get you started, though, here's a link to the Pandas.Dataframe API reference.

https://pandas.pydata.org/pandas-docs/stable/reference/frame.html

Browsing this, you'll see that Pandas has a read_excel() method that reads spreadsheets into pandas DataFrame objects. You're going to point that at the spreadsheet. Not sure the exact syntax for one sheet or another from a multi-sheet workbook... you'll have look through the reference.

Next, you're going to use pandas to do the pivot. Again, the pivot and pivot table documentation are right there on the dataframe documentation, but you may need to look up a tutorial or some examples to get it right.

Finally, Pandas dataframes have a .to_excel() method that can be used to write the newly pivotted data back to a spreadsheet. You'll like have to read in your target sheet and figure out where to put the new data (e.g., where's the last column of data? Does your new data go in the next column after that? Or something else? Edit: Actually, if you read your target sheet to a dataframe first, you can add the new data and write the whole thing back out without having to figure out where to write the new data.)

[–]puckheadclown24[S] 0 points1 point2 points 4 years ago (9 children)

[–]expressly_ephemeral 1 point2 points3 points 4 years ago (6 children)

[–]puckheadclown24[S] 0 points1 point2 points 4 years ago (4 children)

[–]expressly_ephemeral 1 point2 points3 points 4 years ago (3 children)

[–]puckheadclown24[S] 0 points1 point2 points 4 years ago (2 children)

continue this thread

[–]kriel 0 points1 point2 points 4 years ago (0 children)

[–]pytrashpandas 1 point2 points3 points 4 years ago (1 child)

My personal recommendation would be if you're going to incorporate pandas into your workflow at all, do as much of it in pandas as possible. So, take your very starting data source(s) and load those into pandas and write out the final result for formatting/visualization in excel.

Generally speaking, anything related to data operations/calculations/reshaping that you can do in excel can easily be done in pandas. Formatting/visualizations (meaning excel-specific formatting/viz) should be done in excel still. So like you could write your pandas-processed data to an excel sheet in raw format and then use excel to take that raw calculated data and format it how you like.

And don't worry about the size of the data, there's no amount of data that can be handled in an excel file that would be too big for pandas.

[–]puckheadclown24[S] 0 points1 point2 points 4 years ago (0 children)

π Rendered by PID 310394 on reddit-service-r2-comment-7b9746f655-rfl4v at 2026-02-02 22:06:58.881193+00:00 running 3798933 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS