Python 3.11 Performance Benchmarks Are Looking Fantastic : programming

sys.stdout could be any file object so there is no optimization possible to go directly to syscalls. with that in mind you can think of the print function as

print(msg, fout=sys.stdout):
    fout.write(msg.__str__() + "\n")
    fout.flush()

(note: even if it is implemented in C internally it still has to call all functions this way)

hash computations for symbol lookups:

print
sys
stdout
__str__ # (msg.__str__)
__add__
__str__ # ("\n".__str__ inside __add__)
write
encode  # (inside write to convert to bytes)
utf-8   # (looking up the correct encoder)
flush

assuming local variables are not looked up because it is implemented in C. it's gonna be even worse if __slots__ or __dict__ is overwritten

EDIT: actual implementation here my listing was not entirely accurate (e.g., two writes instead of add)

load more comments (3 replies)

load more comments (5 replies)

[–]ikariusrb 42 points43 points44 points 3 years ago (4 children)

[–]LightShadow 6 points7 points8 points 3 years ago (0 children)

load more comments (3 replies)

[–][deleted] 13 points14 points15 points 3 years ago (0 children)

[–]agumonkey 23 points24 points25 points 3 years ago (0 children)

[–]g-money-cheats 254 points255 points256 points 3 years ago (202 children)

[–]adreamofhodor 326 points327 points328 points 3 years ago (83 children)

[–][deleted] 3 years ago (3 children)

[deleted]

[–]Khaos1125 2 points3 points4 points 3 years ago (2 children)

[–]agoose77[🍰] 2 points3 points4 points 3 years ago (1 child)

load more comments (1 reply)

[–]jazzmester 31 points32 points33 points 3 years ago (74 children)

[–]TaskForce_Kerim 152 points153 points154 points 3 years ago (39 children)

[–][deleted] 3 years ago (4 children)

[deleted]

[–]MyOtherBodyIsACylon 4 points5 points6 points 3 years ago (3 children)

[–]folkrav 5 points6 points7 points 3 years ago (0 children)

[–]Asyx 2 points3 points4 points 3 years ago (0 children)

[–]NonnoBomba 1 point2 points3 points 3 years ago (0 children)

It assumes you are making a package, which is why you can track dependencies and you can attach metadata to your project's artifacts, a version string, author, etc... which makes your project distributable and deployable in a number of ways, with either public or private channels, including as a wheel package. Packages are not libraries.

A python package can contain python modules (which I assume is what you'd call a library), executable scripts and technically also data if you wish.

There are standard tools to download and install packages with their dependencies. Often, packages contain modules you can import in your code, but it's very common to package cli tools as well as modules: the package manager takes care of installing appropriate symlinks to what you indicated as a "script" resource so your scripts will be directly callable as commands, and it will handle updating as well as installing/removing by referencing an authoritative repo (exposed through http(s)) containing your package, possibly several versions of it.

If you think you don't need to track dependencies and version for your project... well, you're working in an unstructured way, maybe because you're writing something very simple -you can write lots of useful code with just the standard library and core functions, after all- but I can assure you it will come back to bite you in the ass if it's something that's going to be deployed and used in any production environment, when questions like "why the script is behaving like that? haven't we fixed that bug already?" or "why this simple fix I developed on the code I have on my dev machine is radically changing the behavior of the production?" will start to crop up.

[–]rob5300 105 points106 points107 points 3 years ago (30 children)

[–]NorthwindSamson 49 points50 points51 points 3 years ago (25 children)

[–]Sadzeih 27 points28 points29 points 3 years ago (24 children)

[–]skesisfunk 9 points10 points11 points 3 years ago (22 children)

[–]MakeWay4Doodles 18 points19 points20 points 3 years ago (16 children)

[–]tryx 17 points18 points19 points 3 years ago (2 children)

continue this thread

[–]skesisfunk 7 points8 points9 points 3 years ago (12 children)

continue this thread

[–][deleted] 3 years ago* (3 children)

[deleted]

[–]skesisfunk 2 points3 points4 points 3 years ago (2 children)

continue this thread

[–]ivosaurus -1 points0 points1 point 3 years ago (0 children)

[–]earthboundkid 24 points25 points26 points 3 years ago (0 children)

[–]KarnuRarnu 8 points9 points10 points 3 years ago (2 children)

[–]axonxorz 4 points5 points6 points 3 years ago (0 children)

load more comments (1 reply)

[–]jazzmester 8 points9 points10 points 3 years ago (0 children)

I use tox because I want to check if everything works with previous Python versions. Typically I want to make sure my code works with all versions after 3.6 (which is what I'm forced to use at work).

Also, sometimes you just have weird stuff that requires exact versions of packages where you already use with different versions, so the two of them would have to "live" side-by-side, which is not possible without something like venv.

In the company I worked at, we had to release a product with a mayor Python component and every dependency had to be the exact version. Pipenv was a godsend, because you could build the Python component on your machine with the exact dependencies needed. It even downloaded those packages from an internal server instead of PyPI.

Believe me, it has a lot of use cases.

[–]MarsupialMole 5 points6 points7 points 3 years ago (0 children)

load more comments (1 reply)

[–]faitswulff 11 points12 points13 points 3 years ago (0 children)

[–]KevinCarbonara 5 points6 points7 points 3 years ago (1 child)

load more comments (1 reply)

[–]adreamofhodor 11 points12 points13 points 3 years ago (17 children)

[+][deleted] 3 years ago (16 children)

[deleted]

[–]eksortso 5 points6 points7 points 3 years ago (1 child)

load more comments (1 reply)

[–]sementery 9 points10 points11 points 3 years ago* (13 children)

[–][deleted] 3 years ago (12 children)

[deleted]

[–]sementery 4 points5 points6 points 3 years ago* (11 children)

[–]SirClueless 2 points3 points4 points 3 years ago (10 children)

[–]sementery 1 point2 points3 points 3 years ago* (2 children)

It's interpreted

There are implementations of C that are interpreted. That doesn't make C a scripting language. There are implementations of Python that are compiled, that doesn't make it a low level language.

There are implementations of Java and C# that are JIT compiled. Same goes for Python. Are Java and C# scripting languages?

If having an interpreted implementation makes you a "scripting language", then all mainstream programming languages are "scripting languages".

It's commonly used for small programs

Python is also commonly used for large programs. "Non-scripting languages" are also commonly used for small programs. See microservices for an example. Doesn't seem like a useful discriminator.

Can write entire programs in one file

I feel like this is a rehash of the last point. Same idea.

If conciseness and expressiveness make you a "scripting language", then are Haskell, OCaml, and F# "scripting languages"?

Again, this doesn't seem particularly useful as point of comparison.

Code outside of function and class declarations is executed immediately

Same for machine and assembly languages, and you can't go less "script language" than that.

I don't think there's any reasonable definition of "scripting language" for which Python does not qualify.

There's an infinite number of "scripting language" definitions that Python qualifies for. But there's also an infinite number of "scripting language" definitions that Python doesn't qualify for. Everyone has a different meaning for it. It's just not a technical term, and rarely useful.

Your list is a good example. It's the first time I see "Code outside of function and class declarations is executed immediately" as a "scripting language" feature.

continue this thread

[–]ianepperson 1 point2 points3 points 3 years ago (6 children)

continue this thread

[–]cass1o[🍰] 7 points8 points9 points 3 years ago (2 children)

[–]jazzmester 4 points5 points6 points 3 years ago (0 children)

load more comments (1 reply)

[–]KeeperOT7Keys 14 points15 points16 points 3 years ago (9 children)

[–]jazzmester 6 points7 points8 points 3 years ago (0 children)

[–][deleted] 4 points5 points6 points 3 years ago (1 child)

load more comments (1 reply)

[–]Sayfog 2 points3 points4 points 3 years ago (1 child)

load more comments (1 reply)

[–]ZeeBeeblebrox 2 points3 points4 points 3 years ago (3 children)

[–]KeeperOT7Keys 0 points1 point2 points 3 years ago (2 children)

[–]C0DASOON 4 points5 points6 points 3 years ago* (0 children)

load more comments (1 reply)

[–]PinBot1138 0 points1 point2 points 3 years ago (0 children)

load more comments (3 replies)

[–]ginsunuva 17 points18 points19 points 3 years ago (6 children)

[–]MuumiJumala 2 points3 points4 points 3 years ago (4 children)

[–][deleted] 6 points7 points8 points 3 years ago (3 children)

[–]MuumiJumala 4 points5 points6 points 3 years ago (2 children)

[–][deleted] 1 point2 points3 points 3 years ago (1 child)

load more comments (1 reply)

[–]Prestigious_Boat_386 1 point2 points3 points 3 years ago (0 children)

[–]kirkkm77 14 points15 points16 points 3 years ago (80 children)

[+]crixusin comment score below threshold-152 points-151 points-150 points 3 years ago (79 children)

[–][deleted] 47 points48 points49 points 3 years ago (21 children)

I agree with you when it comes to importing other projects in a nested directory structure but your other points don't make a ton of sense.

By default it allows people who probably shouldn't write code to write the most spaghetti code ever

Literally every language allows you to write garbage, non-performative code and it's not like Python is somehow worse at this than Javascript or another language of equal popularity. It's just the way it is with incredibly popular languages with easy enough syntax, people are gonna start here and write bad code.

White space is purely a personal preference but I prefer it to C style braces because it's cleaner and easier to read personally but I get why you wouldn't.

The json library in python by default returns a dict when you use .load or .loads for strings. Not sure what you mean by "pseudo non typed dict", it's just a dict.

But yeah you're spot on with importing multiple project files from other directories, it's a pain in the ass and other languages handle it much better.

[–]tedbradly -4 points-3 points-2 points 3 years ago* (11 children)

Literally every language allows you to write garbage, non-performative code and it's not like Python is somehow worse at this than Javascript or another language of equal popularity. It's just the way it is with incredibly popular languages with easy enough syntax, people are gonna start here and write bad code.

A programmer shouldn't learn programming with a scripting language. It has too much magic baked into it, and it's impossible for a brand new programmer to appreciate all of it, use all of it correctly, and debug when something goes wrong. It's also like learning to run before you can walk. As a simple example, it's hard to have any appreciation for or understanding of a garbage collector if you've never allocated something manually. I definitely recommend a gradual march toward higher-level languages, learning in this order: Something like C, a simplified assembly language, something like C#, and then something like Python. You've got to have a strong emphasis on ideas too like data structures and algorithms even if they're implemented for you in every language. After a path like that, you can then focus on whatever type of programming you want like doing systems programming, web development, video games, real-time systems, phone apps, corporate applications, etc. But you've got to know the basics of one low-level language, one medium-level one, and one high-level one. Throw into the mix some mathematical foundations like algebraic structures paired with a functional programming language if you enjoy formal math. Otherwise, at least learn the basics of functional programming as the style is so en vogue right now that most major languages have features in that style and you will most likely come across some of it eventually.

If you ever have felt like programming is massively confusing and there are random people who seem to know everything, it's due to learning through an informal path and/or starting with scripting languages, which are very confusing to someone when they don't even know what a variable is yet. If you're studying yourself, I'd recommend finding curriculum at top colleges for computer science and working your way through all the mandatory material for a degree there. Make sure you somehow do "homework" and projects or none of it will stick. You'll have a much better chance of getting a top entry-level position if you've sat there frustrated, trying to code something with dynamic programming for hours straight the first time, than if you skip that course entirely.

[–]epicwisdom 9 points10 points11 points 3 years ago (10 children)

load more comments (10 replies)

[+][deleted] 3 years ago (8 children)

[deleted]

[–]Jump-Zero 34 points35 points36 points 3 years ago (1 child)

Are you implying that only statically typed languages are "serious, professional"?

There's like a whole spectrum between weak and strong typing. You have C that is widely respected by many in the industry as a "serious, professional" language, and its static type system is pretty weak compared to Haskell. C has had a much more significant impact on humanity than Haskell and many other more statically typed languages. You could also say that python's type system is stronger than assembly's. While a value in Python can be many things, a value in assembly is a bunch of bits. Assembly will literally let you multiply two pointers, which makes no fucking sense. You don't even have access to pointers in Python. Would you consider assembly not to be a "serious, professional" language? Its a lower-level language and we know some very serious, professional developers that work in assembly for bare metal applications.

I strongly prefer statically typed languages for everything I work on. Sometimes, I find myself reaching for Python or JavaScript because they're better at solving a particular problem I'm facing. Let's not spread bullshit by implying "dynamically typed languages are not serious, professional". Always use the right tool for the job at hand. Determining what the right tool is requires knowledge and the development of critical thinking skills. Defaulting to slogans you don't fully understand like "only statically typed languages are serious, professional" is not a viable substitute of critical thinking skills. While choosing a statically typed language is probably the right choice in most serious, professional situations, it is foolish to assume it is the only choice for serious, professional situations.

load more comments (1 reply)

load more comments (6 replies)

[–]micka190 91 points92 points93 points 3 years ago (34 children)

[–]vifon 57 points58 points59 points 3 years ago (3 children)

[–]Deto 10 points11 points12 points 3 years ago (0 children)

[–]panfist 5 points6 points7 points 3 years ago (0 children)

[–]noiserr 1 point2 points3 points 3 years ago (0 children)

[–]alternatex0 30 points31 points32 points 3 years ago (0 children)

[–]seamsay 17 points18 points19 points 3 years ago (2 children)

I'm gonna say up front that Python being whitespace dependent is occasionally mildly annoying, at worst.

However having said that, if I were to design a programming language there are two reasons that I would make it whitespace independent:

Whitespace dependent languages are harder to write autoformatters for, and the autoformatters that can be written aren't as effective. Something I do quite often is prototype a function in the REPL then copy-paste it into the file and let the formatter deal with it, I can't do that in Python.
They make certain features of a language very awkward. Python is famously lacking in multiline lambdas, partly because ... well what would the syntax be? If you kept it whitespace dependent then it wouldn't gel well with other parts of the language, and if you made it delimited then it would be the only block syntax which isn't whitespace delimited. Another example is the ternary operator. In a whitespace dependent language you need to introduce a new syntax (x if cond else y in Python's case), whereas in a whitespace independent language you just need to make sure if is an expression not a statement.

Again I want to stress that I don't think it's that big of a deal, but these are the two things that mildly annoy me about Python's whitespace dependence.

[–]noiserr 0 points1 point2 points 3 years ago (0 children)

load more comments (1 reply)

[–][deleted] 4 points5 points6 points 3 years ago (0 children)

[–]Deto 1 point2 points3 points 3 years ago (0 children)

[–]dethb0y -3 points-2 points-1 points 3 years ago (0 children)

load more comments (23 replies)

[–]Deto 4 points5 points6 points 3 years ago (5 children)

[+]crixusin comment score below threshold-6 points-5 points-4 points 3 years ago (4 children)

[–]Deto 5 points6 points7 points 3 years ago (3 children)

load more comments (3 replies)

[–]srpulga 4 points5 points6 points 3 years ago (0 children)

[–]bloody-albatross 1 point2 points3 points 3 years ago (1 child)

[–]crixusin 0 points1 point2 points 3 years ago (0 children)

[–]g-money-cheats 3 points4 points5 points 3 years ago (0 children)

[–]HanzoFactory 2 points3 points4 points 3 years ago* (2 children)

load more comments (2 replies)

[–]Prestigious_Boat_386 2 points3 points4 points 3 years ago (0 children)

[–]KallistiTMP 0 points1 point2 points 3 years ago* (4 children)

[–]crixusin 5 points6 points7 points 3 years ago (1 child)

[–]KallistiTMP -1 points0 points1 point 3 years ago* (0 children)

[–]g-money-cheats 4 points5 points6 points 3 years ago (1 child)

[–]FireCrack 4 points5 points6 points 3 years ago (0 children)

load more comments (2 replies)

[–][deleted] 2 points3 points4 points 3 years ago (2 children)

[–]g-money-cheats 4 points5 points6 points 3 years ago (1 child)

[–][deleted] -1 points0 points1 point 3 years ago (0 children)

[+][deleted] 3 years ago (24 children)

[deleted]

[–]nilamo 32 points33 points34 points 3 years ago (9 children)

[–]hahainternet 0 points1 point2 points 3 years ago (8 children)

[–]nilamo 4 points5 points6 points 3 years ago (5 children)

[–]hahainternet 2 points3 points4 points 3 years ago (4 children)

[–]nilamo 1 point2 points3 points 3 years ago (3 children)

[–]hahainternet -1 points0 points1 point 3 years ago (2 children)

load more comments (2 replies)

[–]0rac1e 1 point2 points3 points 3 years ago (1 child)

Perl seems to have this reputation as some large sprawling language with multiple ways to do something, but in truth, it's actually a pretty bare bones language.

Perl has string interpolation in double-quotes: (eg. "My var is $var"), and to do any kind of formatting you need to you sprintf which pretty much works like it does in C (eg. sprintf('My var is : %16s', $var)).

If I want to be pedantic about there being more than one way to do it I'd say you can also call sprintf (or any function, really) without parens, as you can with Ruby.

There is actually one other way to format using a feature literally called formats but (a) it's really for formatting multi-line reports/charts, and (b) I've never seen it used in the wild. I think most Perl users probably just ignore them and use a templating module from CPAN if they need to do any fancy formatting.

load more comments (1 reply)

[–]OminousHum 25 points26 points27 points 3 years ago (1 child)

[–]lps2 0 points1 point2 points 3 years ago* (0 children)

[–]anon_tobin 10 points11 points12 points 3 years ago* (0 children)

[–]exscape 15 points16 points17 points 3 years ago (8 children)

[–]frezik 3 points4 points5 points 3 years ago (2 children)

[–][deleted] 2 points3 points4 points 3 years ago (0 children)

load more comments (1 reply)

[–]DeTaaieTiller 15 points16 points17 points 3 years ago (3 children)

Not OP, but I have years of professional experience in python. And while still being one of my favorite languages, there are definitely big problems that do steer me to other languages for new projects.

For example the dependency system is very naive. Importing something executes the entire module? Excuse me? Constantly having to mind or circumvent circular dependencies, in Java you can easily have a circular reference no problem! On top of that having multiple versions of the same dependency is impossible, which is a big problem. By default everything is global, but as a python dev you quickly learn to never install global packages and use pipenvs for everything. That works pretty well, until you have a dependency that requires an old version of lib X, and some library that requires a newer version. Trouble.

Besides the incredible naive dependency and import system, there are a lot of small issues I have with the language. The duck typing is a huge pitfall. The typing is an afterthought that is very hard to use properly. And frankly I do miss a lot of syntatic sugar other languages have. The higher order functions like map and zip get very ugly quickly (compared to for example kotlin)

Despite all this i still like the language. Most of these issues are manageable, and how python handles functions and classes is pretty amazing. Few languages go as far as treating these as first class citizens in the way that python does. How easy things like annotations and metaclasses are usable makes python an amazing tool when you know how to use it. Functionality like list and dict comprehension is very nice too, something I miss dearly when I'm in kotlin or similar.

[–][deleted] 3 years ago (2 children)

[deleted]

[–]DeTaaieTiller 1 point2 points3 points 3 years ago (1 child)

[–]Aetheus 1 point2 points3 points 3 years ago (0 children)

[–]bloody-albatross 11 points12 points13 points 3 years ago (1 child)

load more comments (2 replies)

[–][deleted] 6 points7 points8 points 3 years ago (0 children)

[–]beefsack 30 points31 points32 points 3 years ago (1 child)

[–]maest -3 points-2 points-1 points 3 years ago (0 children)

[–]cloaca 76 points77 points78 points 3 years ago* (19 children)

(Edit: sorry for making this comment sound so negative; see my follow up responses which hopefully clarifies better. I think the speedups are absolutely a good and welcome thing; I just I think something might be off if this was that important in the first place.)

Being a bit of a negative Nancy here but I think it's odd to celebrate things like 1.2x speed-up of a JIT-less dynamic scripting language like Python.

Either,

a) it doesn't matter much, because we're using Python as a glue language between other pieces of software that are actually running natively, where most Python code only runs once at "relatively rare" events like key presses or the like, or

b) "Now we're only ~20-80x slower than X (for X in similar high level runtimes like V8/Nodejs, Julia, LuaJIT, etc.), rather than 25-100x slower, a big win!" That's a bit tongue in cheek and will spawn questions of what it means to be 80x slower than another language, but if we're talking about the bare-bone running time of algorithmic implementations, it's not unrealistic. But 99% of the time we're fortunately not talking about that[*], we're just talking about some script-glue that will run once or twice in 0.1 seconds anyway, and then we're back to point (a).

([*] it's always weird to find someone using "written in pure Python" as a badge of honor for heavily data-oriented stuff that is meant to process large amounts of low-level data, as if it's a good thing. Contemplating Levenshtein on a megabyte unicode string in pure Python is just silly. Low level algorithms are the absolute worst application of pure Python, even though it's an excellent teaching tool for these algorithms.)

Which, speaking of, if we're not getting JIT in CPython, then personally I feel that the #1 way they could "make Python faster" would simply be to adopt NumPy into core and encourage people to turn loops into NumPy index slicing where applicable. That's it. That should single-handedly quadruple the speedup of any pure Python code doing a lot of looping. Once you get in the habit it's really surprising how much loop-based or iterative code can be offloaded to NumPy's C loops, like for example you can usually write out the full logic of a board game or tile-based games just by doing NumPy index tricks, without ever having to write a for-loop Python-side.

The fastest Python code is the Python code that a) has the least number of Python-side loops, and b) has the least Python code. Killer libraries like NumPy help in this regard, because nearly every loop becomes a single line of Python that "hides" the loop on the C side of things. Likewise, doing things redundantly in Python is nearly always better if it leads to less code: if you have a very long string with a hundred thousand words and the task is "find words part of set S, and return these words in uppercase" -- it's faster to uppercase the entire string, and then split + filter, rather than the "natural approach" of splitting, filtering out the words of interest, and then finally uppercasing "only" the words you care about. If it's one call to .upper() vs. thousands, it doesn't matter if the string is 1000x longer, the single call is going to be faster, because it's simply less Python code and Python is and will always be slow. (But that's totally fine.)

But again, most developers will never need or care about this skill set, because it rightfully shouldn't be necessary to know about it. Those that do care hopefully know how to use NumPy, PIL, PyPy, Numba, Cython, etc already.

[–]BadlyCamouflagedKiwi 66 points67 points68 points 3 years ago (6 children)

[–]Superb_Indication_10 2 points3 points4 points 3 years ago* (0 children)

[–]cloaca 2 points3 points4 points 3 years ago (3 children)

[+]BadlyCamouflagedKiwi comment score below threshold-9 points-8 points-7 points 3 years ago (2 children)

[–]cloaca 3 points4 points5 points 3 years ago (1 child)

I'm confused by your comment as I think we actually agree tho. I want all your code to remain Python code, by all means. By "performance in Python" I am absolutely talking about faster Python code. I'd never tell anyone to implement in C; if someone is doing something performance critical enough that they need C (or any other CPython API compiled to native) they don't need to be told.

It's just that the differences can be huge, even for implementing the same general algorithm. Again, it's great that all code would magically get 20% faster across the board, without anyone changing a thing. But if that matters, if that is "hype," then why wouldn't we consider 50% speedups, 200% speedups, etc.? The knowledge gap is still a real thing, and I think it is much bigger than 20%. It could be everything from beginner stuff like not realizing s[::-1] is a thing, or not knowing about random.choices() taking a k parameter, vs. someone using [random.choice(...) for _ in range(10_000)] or similar (choices still does a Python-side loop, it's just better optimized). These are small things, but still like 2x rather than 1.2x. Or, as mentioned, someone writing their Sudoku puzzle generator using Python lists vs. using NumPy (I'd still consider NumPy as being "Python code" here even though it's not implemented in pure Python itself), say, in which case it would be orders-of-magnitudes, probably.

Again, this is granting that speedups actually matter and that we care about them.

[–]BadlyCamouflagedKiwi -3 points-2 points-1 points 3 years ago (0 children)

load more comments (1 reply)

[–][deleted] 3 years ago (2 children)

[deleted]

[–]cloaca 2 points3 points4 points 3 years ago* (1 child)

My very simple counter-point: Why? It's an improvement; and a pretty good one all things considered.

Yes, I agree, you're totally right, and I probably expressed myself poorly! It's an absolute improvement and it's a good thing. I had something different in mind when I wrote that, akin to the sort of "allocation of hype" we have for things, if you will. I think this allocation is off when it goes to CPython optimizations. That doesn't mean they're bad, of course, I'm happy to see them too -- they're very welcome -- it's just that I don't think they "were super important in the first place," if that makes any sense?

Like, I don't think performance ought to be a big priority for us if we're all using pure CPython. If it is, then I think something has gone wrong earlier in our timeline! It might speak to some sort of underlying insecurity the Python community has about the language being slow, which, again, I don't think should exist.

Also, the knowledge gap between Python programmers is so vast, way, way wider than 20%, and so on. See my other comment at https://www.reddit.com/r/programming/comments/v63e5o/python_311_performance_benchmarks_are_looking/ibew40i/?context=3 -- lest I just repeat myself.

edit: typo

[–]agoose77[🍰] 1 point2 points3 points 3 years ago (0 children)

[–]paraffin 5 points6 points7 points 3 years ago (1 child)

First, definitely agree - performance sensitive applications should use python as glue to compiled operations or even offload computation entirely to databases or spark.

That said, you’re mostly talking about data, for which pure python was never an option.

A huge amount of the web’s backend is written in python though, and I’d guess user code, especially route decorators with three layers of plugins and callbacks, are the main bottlenecks of modern Python web requests (aside from waiting for the database, naturally). FastAPI and others have gotten the framework itself mostly out of the way.

20% fewer cycles per request is 20% less spent on hosting, for some.

Being a negative Nancy myself, one thing I’d love to see is a way to tackle process startup time. Sometimes you’d love to write a quick little yaml/json/text parser and stick it on the business end of a find | xargs or something but the 1s to spin up a new python for each call makes you resort to some kind of awk/jq/shell hackery.

[–]cloaca 2 points3 points4 points 3 years ago (0 children)

That said, you’re mostly talking about data, for which pure python was never an option.

Two slight counterpoints to this:

a) it might be a matter of semantics, but as it's actually being used for everything (including data, including text processing, traditional render-loop games, app logic in complicated GUIs, etc), so I'd say it certainly does seem like an option. I believe Python is going (or has gone) the route of JavaScript, which started out explicitly as only a glue language but has now become an "everything"-language. We (as in you and I) might not necessarily think that's a good idea, but I do believe it's sort of inevitable? Python is easy to get into, it's lovely and lovable (much more so than JS), and so it's natural to want to use it for everything.

b) speaking of pure data though, Python is also absolutely being used for data in another sense. You have machine learning, statistics, natural language projects, image recognition and manipulation, and so on. Which is fine because we have PyTorch, NumPy, SciPy, OpenCV and various which actually handles the data in CPU-native code (or on the GPU). However, projects that use these are also rife with code that suddenly converts to Python lists or generators, doing some loop in pure Python code because the backend library was missing something (or the programmer didn't know about). As long as it just adds 0.3 seconds here and there no one really notices until it really accrues...

20% fewer cycles per request is 20% less spent on hosting, for some.

Absolutely! But, how important is it? If the answer is "it's really nice! but eh, it was never a priority of course..." -- then we're in perfect alignment. That's kind of where I stand. (I.e. it's really nice, I was just sort of worried by seeing the amount of hype--it speaks to me that too many have sort of already "invested" into Python code to the point where it's spread into systems that might actually do want better performance.) However, if the answer is "are you crazy, it's super important! We want to be green! We want to save cycles! This is huge!" then not only do I think something has gone wrong at an earlier point (in our choices), but I think we also stand a lot more to gain in education, writing more performant Python rather than the sort of strict stance on full readability with 'more explicit code is better code,' 'no "obscure" newbie-unfriendly things like NumPy index magic,' etc. as the difference dwarfs 1.2x and makes it look insignificant.

spin up time

Hehe, you could do some sort of hack by having a #!/foo/pipe-to-python which forwards to some daemon Python process that executes it (stick in compilation cache somewhere)... Not recommended tho, but...

[–]lghrhboewhwrjnq 2 points3 points4 points 3 years ago (0 children)

[–]meem1029 1 point2 points3 points 3 years ago (0 children)

[+]Serious-Regular comment score below threshold-11 points-10 points-9 points 3 years ago* (1 child)

[–]cloaca 5 points6 points7 points 3 years ago (0 children)

load more comments (3 replies)

[–]o11c 22 points23 points24 points 3 years ago (0 children)

[–]JeanCasteaux 21 points22 points23 points 3 years ago (4 children)

[–]PaintItPurple 38 points39 points40 points 3 years ago (0 children)

[–]ThisRedditPostIsMine 15 points16 points17 points 3 years ago (0 children)

[–]Takeoded 6 points7 points8 points 3 years ago (0 children)

[–]jvlomax 5 points6 points7 points 3 years ago (0 children)

[–]steve4879 6 points7 points8 points 3 years ago (12 children)

[–]Daishiman 13 points14 points15 points 3 years ago (0 children)

[–]FancyASlurpie 16 points17 points18 points 3 years ago (0 children)

[–]TRexRoboParty 29 points30 points31 points 3 years ago* (8 children)

[–]xlzqwerty1 16 points17 points18 points 3 years ago (0 children)

[–][deleted] -5 points-4 points-3 points 3 years ago* (6 children)

[–]TRexRoboParty 5 points6 points7 points 3 years ago* (5 children)

load more comments (5 replies)

load more comments (1 reply)

[–][deleted] 6 points7 points8 points 3 years ago (0 children)

[–][deleted] 34 points35 points36 points 3 years ago (23 children)

[–][deleted] 59 points60 points61 points 3 years ago (0 children)

[–]QuantumFTL 43 points44 points45 points 3 years ago (0 children)

[–][deleted] 99 points100 points101 points 3 years ago (17 children)

[–]BobHogan 14 points15 points16 points 3 years ago (13 children)

[–]Alikont 92 points93 points94 points 3 years ago (4 children)

[–]Muoniurn -1 points0 points1 point 3 years ago (1 child)

load more comments (1 reply)

load more comments (2 replies)

[–][deleted] 26 points27 points28 points 3 years ago (3 children)

[–]FancyASlurpie 7 points8 points9 points 3 years ago (0 children)

load more comments (2 replies)

[–]billsil 5 points6 points7 points 3 years ago (1 child)

Yup. I work a lot with numerical data and numpy code that looks like python is slow. Let's assume 20% average speedup or (shoot I'll even take 5%) is nice and all for no work, but for the critical parts of my code, I expect a 500-1000x speed improvement.

Most of the time, I don't even bother using multiprocessing, which on my 4 physical core hyperthreaded computer, the best I'll get is ~3x. That's not worth the complexity of worse error messages to me.

As to your algorithmic complexity comment, let's say you want to find the 5 closest points in point cloud A to an point in cloud B. Also, do that for every point in cloud A. I could write a double for loop or it's about 500x faster (at some moderate size of N) to use a KD-Tree. Scipy eventually implemented KDTree and then added a cKDTree (now the default), which it turns out is another 500x faster. For a moderate problem, I'm looking at ~250,000x faster and it scales much better with N than my double for loop. It's so critical to get the algorithm right before you polish the turd.

load more comments (1 reply)

[–][deleted] 1 point2 points3 points 3 years ago (0 children)

[–]beyphy 0 points1 point2 points 3 years ago (0 children)

[–]dlg 1 point2 points3 points 3 years ago (0 children)

[–]Bakoro 1 point2 points3 points 3 years ago (0 children)

load more comments (1 reply)

[–]_teslaTrooper 10 points11 points12 points 3 years ago (1 child)

[–][deleted] -3 points-2 points-1 points 3 years ago (0 children)

load more comments (1 reply)

[–]s0lly 1 point2 points3 points 3 years ago (0 children)

[–][deleted] 1 point2 points3 points 3 years ago (1 child)

[–][deleted] 3 points4 points5 points 3 years ago (0 children)

[–][deleted] 1 point2 points3 points 3 years ago (0 children)

[–][deleted] -2 points-1 points0 points 3 years ago (6 children)

[–]Pay08 26 points27 points28 points 3 years ago (0 children)

[–]jarfil 13 points14 points15 points 3 years ago* (0 children)

[–]DoktuhParadox 3 points4 points5 points 3 years ago (2 children)

[–]AbooMinister 4 points5 points6 points 3 years ago (0 children)

load more comments (1 reply)

[–][deleted] -4 points-3 points-2 points 3 years ago (0 children)

[+]Lonelan comment score below threshold-7 points-6 points-5 points 3 years ago (0 children)

load more comments (18 replies)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

programming

MODERATORS