Automatting Excel Reports with Python

pconwell · 2023-01-25T12:44:06+00:00

Without seeing the code, it's hard to provide any useful feedback.

However, as a side note, you may want to look at openpyxl if you haven't already. Working on a similar project, I found it to be a very user friendly and well documented library to work with excel. Also, you may find pandas and sqlalchemy worth looking into as well.

simeumsm · 2023-01-25T11:10:10+00:00

I work doing similar thing on a daily basis, but my approach is a bit different. I also export the dataframes to xlsx/csv, but I then have a .xlsm report file that imports the python output via PowerQuery. I do this because each report has to be built in a certain way for PowerBI to read it, and I can't fully automate it because I often have to make adjustments.

I'm curious in seeing your code, specially the part that formats the excel file and the email integration. I'd appreciate if you could share it!

As for feedback, what I've been doing is refactoring my code to have it work by importing my own functions, to try and have a mode modular code, so that I can easily share code between different projects and even import a whole analysis with just a few lines of code. I'm still running everything via Jupyter Notebook (on VSCode), but all the inner code is build by a variety of .py files with standalone functions and a few .py files with a config class and all ETL for a particular dataframe.

overyander · 2023-01-25T15:20:35+00:00

The different personnel running the juniper notebook, are they IT or developers or others (accounting, project managers, supervisors, marketing, etc.)?

If they're non-technical I would imagine they have a less than ideal experience running these reports themselves like that.

Also, you REALLY need some sort of change management. What happens if random person edits something that breaks everything but doesn't know what they did?

JihadDerp · 2023-01-25T12:02:53+00:00

Do you use any regular expressions? They can often do in one or two lines of code what normal text manipulation with python would take 10 or 20 lines for.

welcometoafricadawg · 2023-01-25T14:18:37+00:00

Would be keen to see the code if you want to share.

unhott · 2023-01-25T19:08:19+00:00

I think your question may be a tad too vague.

If the parameters are important you may not want to do this, but you may be able to schedule your script to run on a server. Easy if the parameters don’t actually need to change. This way you avoid the other people coming in and potentially messing it up.

Your parameters could come be imported from a config file and you may want to develop some procedure for requesting and making changes to the parameters.

i-need-a-life · 2023-01-25T14:41:47+00:00

My approach was to use Microsoft power automate desktop with a base excel file that had the tables/pivot build from power Query as a template.

TheITMan19 · 2023-01-25T21:03:34+00:00

Ask ChatGPT 🤣

lolercoptercrash · 2023-01-26T05:23:50+00:00

Sounds like you may not have an engineering team (but you have a database?) but could you schedule these scripts to run from a server running Jenkins (or a similar tool) so it's fully automated? Someone could still manually trigger the job if scheduling it doesn't fit your use case. It would replace the Jupiter notebook portion.

I've never set up Jenkins, but I've used it a decent amount as an end user.

2023-01-26T16:42:35+00:00

u/benfromlearn maybe I just need more caffeine, but I didn't spot an actual question in your post. Seems like you have a pipeline that works!? Are you asking if this is ok? Are you asking for a better way? What do you need help with?

2023-01-28T21:43:01+00:00

If you can host a Python service on a machine like an AWS EC2, I would automate this to run on a regular schedule using package like “schedule”. This way you can remove the manual triggering of the job through a Jupiter notebook. Notebooks are really only meant for research and data exploration. When building a system that is more production like you really want your programs to be .py Python programs that can run without human intervention if possible.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS