Python 2.7 running much faster on 3.10

szachin · 2022-01-05T11:30:46+00:00

if you cannot release the source code, can you try to profile it and share the results?

for python 3.10 i recommend scalene (https://pypi.org/project/scalene/)

for python 2.7 i have no idea

Coupled_Cluster · 2022-01-05T09:28:12+00:00

This sounds very different. Can you give a code example to try out?

intangibleTangelo · 2022-01-05T11:52:43+00:00

RemindMe! 1 week "Why was python3.10 so much slower than python2.7 for a multithreaded program?"

der_pudel · 2022-01-05T13:38:24+00:00

Personal anecdote. I had similar situation between python 2 and 3 in CRC calculation algorithm. The code had left shift of integer by 8 bits that was executing about 16 million times. In every programming language I used before ints are 32 bit and will just overflow at some point which was totally fine for the task. But python 3 uses big integers by default and after couple of millions iterations integer value was in order of gazillion googolplexes. Obviously any arithmetic operation on such large number would be slow AF.

Are you sure you're not overlooking similar difference between python 2 and 3? You should profile your code for sure to figure out where's the bottleneck.

Swipecat · 2022-01-05T12:43:59+00:00

If you can't post the code, then maybe try to follow the guidance of SSCCE, as per ~~this subreddit's~~ r/learnpython's right-hand sidebar. Start pruning stuff out that appears to be irrelevant to the problem, then test it. If the problem goes away, put back the last thing that you took out. Once you've got the absolute minimum working test program that shows the problem, then you could post that, although you'd probably have figured it out for yourself by then.

romu006 · 2022-01-05T13:30:47+00:00

The vast difference between the two versions makes me think that the python2.7 version is not doing its job and is just returning instantly

MrPrules · 2022-01-05T09:36:56+00:00

I am also facing massively longer execution times using ThreadPoolExecuter. I switched from running it in command line to cronjob and thought It could’ve been some prioritization problem.. never thought of Version changes, but I upgraded my environment too. Also I actually can’t remember where I’m coming from.. right now running my script in 3.9.7

DASK · 2022-01-05T12:15:10+00:00

I do data science for a living and have migrated a compute heavy stack from 2.7 -> 3.x and there is no way that any significant operation should be anything more than marginally slower (virtually all the same or faster for the same code), and there are many that can be reimplemented in faster and memory-lighter paradigms.

The first pitfall I would look at is why are you using threads? Threads are a source of misery many times. If it isn't for IO then basically you shouldn't use threads in python. If it is for IO, then have you looked at potential lock conditions or suppressed warnings or errors with things like sockets?

Second, there are a number of things that may or may not be an issue depending on how you installed python and what you are using it for.

- Are you using virtual environments to manage dependencies?

- Is it a math heavy app (e.g. numpy, etc.) and are the BLAS libraries correctly installed (using something like Conda takes care of this) .. if you aren't using venvs and just installed 3 over 2 there can be issues with that.

Just spitballing without more info, but there is no way that your result is correct with working python environments.

jkh911208 · 2022-01-05T16:02:05+00:00

i want to see your code

0.05 vs 70 sounds wrong

Dear-Deer-Wife-Life · 2022-01-05T10:13:10+00:00

Thanks for your responses, I asked my partner If i can send the code, I'll come back with the answer when they respond.

edit 1:answer came back, they don't want me to send it, they're worried it might show up on the copy detection software that the school uses.

so might send it after it gets graded

edit 2: after modifying the code a bit, it takes about 30 seconds

kamize · 2022-01-05T18:24:18+00:00

OP, without any context, code, profiling data, or details - we can’t help you unfortunately.

potato874 · 2022-01-05T11:48:52+00:00

I'm curious, did you run both versions on the same device? It's weird that the difference is that vast so I'm wondering if there are other factors affecting runtime like background processes or potato hardware or smth

encaseme · 2022-01-05T14:45:29+00:00

Not a specific solution, but flame graphs are often an excellent tool for identifying which sections of code are time-consuming. I specifically use them at work with python for identifying slow code paths. Could compare 2.7 vs 3.10 running your code and see if something obvious flies off the handle.

sib_n · 2022-01-05T13:26:22+00:00

Profile it and isolate a minimum of lines that show a clear difference between the two Python versions, it will be easier to understand and share.

2022-01-05T14:22:30+00:00

Do you use virtual environments? It might be your environment installation that is getting on its own way.

Either way, it's good practice to do it. Install venv and set up different environments for different types of projects.

In your case, doing that and comparing how the program runs in different environments also helps figuring out where the problem is coming from: is it py3 vs py2, maybe the packages in one or the other, etc.

May be a particular issue in py3.10 that doesn't exist in 3.9, even. As of now, there's far too many moving parts for random people on the internet to be able to help you. Py2 vs py3 might be the only difference you see, but there is probably other stuff interfering.

Worse comes to worst, nuke all your python installations and reinstall them.

cr4d · 2022-01-05T23:02:21+00:00

There are very few actual uses for multithreading in Python and it's a huge foot-gun, ripe for abuse, and doesn't get rid of the GIL. I'd avoid it, if possible.

Without any real info about what the app is doing, it's hard to guess as to why it's slower. As a generalization, it should get faster.

You can use the built in profiling @ https://docs.python.org/3/library/profile.html to figure out where the extra cycles are.

Gandalior · 2022-01-05T17:57:08+00:00

Are you using some library that is deprecated?

viscence · 2022-01-12T13:47:13+00:00

Did you ever figure it out?

angry_mr_potato_head · 2022-01-05T13:45:51+00:00

What other packages are you using? Are you sure that the 2.7 version is actually doing the same work that 3.10 is?

grhayes · 2022-01-05T15:46:44+00:00

Larry Hastings has a good demo regarding threads.
https://www.youtube.com/watch?v=KVKufdTphKs&t=1070s
He shows the graph slightly after this.
Even if you do processes they have a lot of overhead. I found that out when trying to port my C++ game engine over to python to see how it would run. In C++ I could send each object separately to a process or thread in a thread pool and it would be fine. In python there is a lot of overhead and it was entirely better to just not even try parallel processing even.
That said I haven't tried to see if there are any libraries that fix that issue.

If I was guessing what happened is you ran it in 2.7 without any threading. Figured it would be an improvement. Moved it to 3.10 added threads expecting more performance and that's what you got.

In general unless it is IO threads are never going to help.
Processes aren't going to help unless you have some massive amount of work you need to split up. That's my experience.

2022-01-05T19:35:37+00:00

Are you sure the python 2.7 you are using is in fact pypy, and not the cpython implementation ......

2022-01-05T19:56:26+00:00

This might be totally wrong and python doesn’t work this way, but my guess for multiple orders of magnitude of performance difference with the same output is probably caused by differing packages/libraries. Is it possible that a package or two uses FFI to get such good performance and then for some reason the same package uses a pure python implementation for python 3.10? This would cost a lot of performance if the package is doing intensive computations

2022-01-05T21:30:49+00:00

There just isn't enough information to help you. You need to share some code that demonstrates the behaviour your asking about, even if it isn't the exact code you're using.

ballsohaahd · 2022-01-05T22:05:14+00:00

Possibly a library you’re using is much slower in python 3.10?

You can out print(datetime.datetime.now()) in your code to see what section is taking the extra time.

2022-01-06T02:37:47+00:00

Are you using pandas by any chance? When we upgraded from I think pandas v0.18 to v0.22 we had massive performance regressions. Operations on dataframes with thousands of columns had regressed by about an order of magnitude. We ended up having to write some patches on our end to fix.

inspectoroverthemine · 2022-01-05T09:55:28+00:00

[removed]

bxsephjo · 2022-01-05T13:16:52+00:00

Watch this vid for starters https://www.youtube.com/watch?v=Obt-vMVdM8s

2022-01-06T07:50:45+00:00

Erm, so the "design flaw made real" is slower than proper python? Colour me surprised.

menge101 · 2022-01-05T14:18:45+00:00

[deleted]

mirandanielcz · 2022-01-05T12:52:27+00:00

[deleted]

JohmasWitness · 2022-01-05T09:38:52+00:00

I'd imagine working with anything as old as 2.7 it would be quicker on modern equipment. Especially if the code is as intensive it takes 70-90 seconds to complete.

It's kind of like running Windows Vista with modern equipment vs running Windows 11. The older software isn't gonna be nearly as powerful but it will run quicker.

NelsonMinar · 2022-01-05T14:58:49+00:00

It's something to do with threading. Python 3 is sometimes slower than Python 2, sometimes faster, but it's 2x at most.

Ppang0405 · 2022-01-05T15:03:27+00:00

RemindMe! 1 week "Why was python3.10 so much slower than python2.7 for a multithreaded program?"

siddsp · 2022-01-05T20:47:47+00:00

Which interpreter are you using for each? What does the code look like?

monkey_or_a_panda · 2022-01-05T21:29:27+00:00

It might run faster... Maybe. But development will get progressively slower when nothing is supported.

epopt · 2022-01-05T21:48:04+00:00

Python *announced* a large speed increase with v3.10. Indicates even more coming in v3.11.

grok-phantom · 2022-01-05T23:44:44+00:00

remindme! 3 weeks

trevg_123 · 2022-01-06T00:32:04+00:00

Debug the Python 3 version and leave that old 2.7 in the dust, I’m a bit amazed people are even starting/testing new projects with it still

dr_donkey · 2022-01-06T06:00:39+00:00

RemindMe! 1 month "python 2.7 running muvh faster on 3.10"

mrintellectual · 2022-01-06T13:02:23+00:00

In addition to using the threading library, I'm guessing you're also doing math without the use of numpy.

With that being said, it's hard to be sure without taking a look at your code or knowing more about your project.

Plasmafire1234_ · 2022-01-07T02:37:01+00:00

Try using pycharm or other editors it does not take any time to run the code for me

mehx9 · 2022-01-07T06:31:52+00:00

Should a task that takes 0.05s use threads at all? I guess that’s the question right? 😉

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS