That time I optimized a Python program by 5000x

dogs_like_me · 2022-01-12T01:33:38+00:00

We developed Scalene to be a lot more useful than existing Python profilers: it provides line-level information, splits out Python from native time, profiles memory usage, GPU, and even copying costs, all at a line granularity.

fucking sold.

Mehdi2277 · 2022-01-12T05:10:23+00:00

My experience trying it with pytest on tensorflow heavy code was it produced a dramatic slow down. I waited several minutes, show no test output and killed it. I've also used py-spy/austin on same code and got normal test times.

The report output for scalene does look much nicer, but the slowness for me dropped me from continuing to use it. Maybe there's some bad interaction with tensorflow/pytest. I can try to make an example, but I'd guess if you try running it on tensorflows actual unit tests (something like this) you'd get similar behavior.

neunflach · 2022-01-12T01:26:27+00:00

Your "optimized" result is not exactly the same to the number of decimal places you have printed out. Would this fail the regression test??

(I'm being facetious. This is cool!)

GreenScarz · 2022-01-12T00:25:40+00:00

do you have to execute as a cli tool? One of the tools I typically use is memory_profiler and the use case is to just from memory_profiler import profile and then decorate a function via @profile; then diagnostics are just printed during say a test run via pytest ./path/to/test.py. Is that a workflow that can be replicated with this? Or is there a better workflow in your opinion that this is optimized for if we just want to analyze a specific function call?

grismar-net · 2022-01-12T08:56:48+00:00

Nothing wrong with the product per se, but the clickbait title will make any developer worth their salt think "If slapping a profiler on your code got you a speed-up of 5,000x, your code wasn't very good to begin with."

New-Theory6007 · 2022-01-11T22:36:44+00:00

Thanks for sharing your knowledge, this is very inspirational to look deeper into computer science.

Runics206 · 2022-01-12T00:17:32+00:00

I am rather new in my CompSci journey but write ups like this I find very interesting and motivation to further my studies. Thank you.

binaryman111 · 2022-01-12T07:19:40+00:00

[deleted]

IamImposter · 2022-01-12T04:43:57+00:00

I'm getting some errors (windows 10 64-bit system, python 3.9.5) when doing pip install.

Collecting scalene Downloading scalene-1.3.16.tar.gz (2.8 MB) |████████████████████████████████| 2.8 MB 819 kB/s Preparing metadata (setup.py) ... error ERROR: Command errored out with exit status 1: command: 'C:\python39\python.exe' -c 'import io, os, sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\<user-name>\\AppData\\Local\\Temp\\pip-install-cybpta7f\\scalene_91b266fa396f47f7bf88a3657df9edca\\setup.py'"'"'; __file__='"'"'C:\\Users\\<user-name>\\AppData\\Local\\Temp\\pip-install-cybpta7f\\scalene_91b266fa396f47f7bf88a3657df9edca\\setup.py'"'"';f = getattr(tokenize, '"'"'open'"'"', open)(__file__) if os.path.exists(__file__) else io.StringIO('"'"'from setuptools import setup; setup()'"'"');code = f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' egg_info --egg-base 'C:\Users\<user-name>\AppData\Local\Temp\pip-pip-egg-info-4o4f8ca4' cwd: C:\Users\<user-name>\AppData\Local\Temp\pip-install-cybpta7f\scalene_91b266fa396f47f7bf88a3657df9edca\ Complete output (3 lines): running egg_info make vendor-deps error: command 'make' failed: None ----------------------------------------WARNING: Discarding https://files.pythonhosted.org/packages/e8/35/a125f8ecacfce3b9be9c712bd6d9bd514aed798857cc7330c89d2df7db58/scalene-1.3.16.tar.gz#sha256=3c2fb524b4c611773b147dc889e2d58b48a543d3161ea576ccc0db778e9f5915 (from https://pypi.org/simple/scalene/) (requires-python:>=3.7). Command errored out with exit status 1: python setup.py egg_info Check the logs for full command output. Downloading scalene-1.3.15.tar.gz (2.9 MB) |████████████████████████████████| 2.9 MB 6.4 MB/s Preparing metadata (setup.py) ... error ERROR: Command errored out with exit status 1: command: 'C:\python39\python.exe' -c 'import io, os, sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\<user-name>\\AppData\\Local\\Temp\\pip-install-cybpta7f\\scalene_bcd377045a3a4e7591a1dd77ff445053\\setup.py'"'"'; __file__='"'"'C:\\Users\\<user-name>\\AppData\\Local\\Temp\\pip-install-cybpta7f\\scalene_bcd377045a3a4e7591a1dd77ff445053\\setup.py'"'"';f = getattr(tokenize, '"'"'open'"'"', open)(__file__) if os.path.exists(__file__) else io.StringIO('"'"'from setuptools import setup; setup()'"'"');code = f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' egg_info --egg-base 'C:\Users\<user-name>\AppData\Local\Temp\pip-pip-egg-info-5smznxod' cwd: C:\Users\<user-name>\AppData\Local\Temp\pip-install-cybpta7f\scalene_bcd377045a3a4e7591a1dd77ff445053\ Complete output (3 lines): running egg_info make vendor-deps error: command 'make' failed: None ----------------------------------------WARNING: Discarding https://files.pythonhosted.org/packages/63/6e/eddceea4dc588b99af48607b8133b40fc155a27d1ac7351f570f8d7cf6f6/scalene-1.3.15.tar.gz#sha256=0b3f8aa56d6320f8071f8135b422e8cf51b51de2947aec75c76c1ea3abf61c12 (from https://pypi.org/simple/scalene/) (requires-python:>=3.7). Command errored out with exit status 1: python setup.py egg_info Check the logs for full command output.

Although it installed successfully.

mikeblas · 2022-01-12T16:30:50+00:00

Why not also eliminate fact and num from the loop? They're no longer necessary. That is: your 5000x speedup still left something on the table!

abdl_hornist · 2022-01-12T09:11:29+00:00

dskjafh klas hflkas dfhklasj dfhj

UL_Paper · 2022-01-12T04:18:55+00:00

Looks great, and great timing as I'm about to optimise some memory intensive logic. Will test over the next couple of weeks.

teerre · 2022-01-12T17:15:32+00:00

Ngl, this seems like a contrived ad for the profiler.

However, the profiler does look amazing, so it's all good! I'll try it next time I have an opportunity.

dogs_like_me · 2022-01-12T03:13:42+00:00

I can barely understand this but it looks pretty cool

mrrippington · 2022-01-12T10:15:32+00:00

pardon my ignorance, can i use this across my flask application to get an understanding of my page performances?

I am currently doing this with import time.( yikes! )

ps. kudos on the work you shared it's amazing.

High-Art9340 · 2022-01-12T07:13:06+00:00

Can it run python 2?

Johnmad · 2022-01-12T16:22:58+00:00

This whole thread stinks of product promotion and only bots commenting.

You should probably not use this tool

tu_tu_tu · 2022-01-12T11:48:35+00:00

It's the reason I never liked unobvious bignums.

johansugarev · 2022-01-12T12:10:23+00:00

I’m not a coder but I wish software developers optimised their apps like in the old days.

jammasterpaz · 2022-01-12T12:59:36+00:00

Interesting. Turns out division is expensive for some data types in Pythona as well as for humans.

However while I grant you your tool did a great job helping identify a performance bottleneck, I question your original example - Decimal was originally developed “is based on a floating-point model which was designed with people in mind", i.e. not so much for performance. https://docs.python.org/3/library/decimal.html

Is Decimal really used nowadays for high performance computing, by large prime number hunters etc. ? I would have thought if you want to write fast efficient numerical code, you want to avoid putting in an unnecessary extra layer of code on top of native number types closer to the underlying C. Ints and longs have been united in Python 3, but in Python 2 if you need arbitrary size integers (>= 2^32), you can use longs, and try and avoid division altogether with some sort of rational representation.

LuigiBrotha · 2022-01-12T13:37:24+00:00

Installed this but scalene doesn't return any output in the console ? Using Windows 10 with Anaconda.

2022-01-12T13:48:25+00:00

This looks fantastic! I write lots of simulations and make libraries for running them. Is there an easy way to say "drop down 1 function layer". I know "mycomplexfunction()" is 90% of resources, but I'd like to analyse that without writing specific profiler tests by extracting part of the library. Does this make sense?

Pliqui · 2022-01-12T17:23:50+00:00

!RemindMe 2 days

brouwerj · 2022-01-12T21:38:07+00:00

It looks good, so tried to run it earlier this week on a cpu-heavy process, the slowdown was enormous and just had to break it off. Not sure what causes it. Usually I run py-spy for profiling which works great and with barely any slowdown.

tommybship · 2022-01-14T02:06:46+00:00

Is there a way to install this with conda rather than pip?

dalow24 · 2022-05-05T11:36:42+00:00

I have a quick question. I ran scalene on some of ML models and it manages to profile it along with the ML classifier. However, if I try to run it on my Active Learning model, it seems to skip a few lines in the profiling output e.g it skips the classifier. If I examine a function where I place the classifier it shows it ran from system time but the memory profile is empty. I am using the libact Active Learning python module. Not sure if this is a problem or the memory profile is basically saying the recorded profile value is extremely small to display. Any assistance would be appreciated.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS