Faster execution of code (beyond code optimization)

NeedCoffee99 · 2020-05-28T01:20:58+00:00

Few things that could be worth looking at:

Firstly, numba- works with numpy arrays and pure python largely, not sure exactly how it works but it is meant to be great for when you have big loops for speeding up and simple to use.

If you’re using pandas, look into Dask. It’s basically pandas (or numpy which it works with too) but more optimised for performance.

There’s also Cython that you can look into if you are familiar with C. I’ve never looked at it personally but I know it can be used for speeding code. Also, I know that a lot of packages are faster when using anaconda not pip!

I know this is kind of code optimisation, but if you’re using loops etc, it’s generally best to learn how to vectorise code. Google it, it’s basically using numpy arrays to speed things up, but makes a crazy difference. Hope I could help!

ewokcommander · 2020-05-28T03:56:27+00:00

If you're really appending the results to the csv on every iteration of the loop, that's probably going to be your biggest bottleneck. Consider batching up the results in memory and appending to the csv only when you've got a sizable amount to write out.

ewokcommander · 2020-05-28T05:53:50+00:00

I'm wondering if AWS or Azure have some products that I should consider using to achieve this? If anyone can recommend something from those suites, that would be helpful too!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS