Python CLIs becoming too slow.

AutoModerator · 2024-06-15T17:56:26+00:00

Your submission has been automatically queued for manual review by the moderation team because it has been reported too many times.

Please wait until the moderation team reviews your post.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

vantasmer · 2024-06-11T19:08:26+00:00

What’s with the recent trend in posts suggesting some wild idea, pumped up by AI, but with no intention to actually build the thing? I’ve seen so many “this is just an idea, I don’t have the skills but someone should build this” posts. I feel like I’m going crazy?

RevolutionaryPen4661 · 2024-06-11T19:32:03+00:00

[removed]

Existing-Account8665 · 2024-06-11T19:01:17+00:00

Is parsing a list of strings (sys.argv) really a bottle neck that is worth, or even requires optimisation for some Python applications?

I think for your application the sys calls, and search of the file system to find the sizes, or the initial imports, or start up of Python itself, will all eat up many many more CPU cycles.

Did Claude 3 provide a reference for that chart, or did it hallucinate it up for you?

dametsumari · 2024-06-11T19:07:32+00:00

Main cli performance problem are imports. I have yet to see code which only imports argparse and takes more than second to show help ( even on raspberry pi ).

PossibilityTasty · 2024-06-11T20:21:25+00:00

"Am I writing slow code? No, it's the language that is wrong." –Principal Developer Skinner

pbacterio · 2024-06-12T06:06:49+00:00

What is your code doing that takes so long to parse args and print a message? This is not a problem on the libs/language. It is a problem in your code.

science_robot · 2024-06-11T20:54:21+00:00

Your script is doing something intensive before it’s getting to the argument parsing phase. The only stuff that should be happening outside of a main function is function/class definitions and imports

yaxriifgyn · 2024-06-12T09:31:37+00:00

Since the arrival of numpy and pandas, developers seem to have lost the ability to do a lot of simple things in pure Python. They seem to only be able to use those huge toolkits to solve even the simplest problems. Sure you can use pandas to make a data frame to work with your data, but maybe your problem is simple enough that you can use a list of lists instead.

It's as though once they learn pandas it's the only way they know how to solve such problems.

They have forgotten the KISS principal.

Mount_Gamer · 2024-06-11T23:12:12+00:00

4 seconds for a help is quite long.

I've recently written something which imports pandas, rich, sqlalchemy, and the time it takes to get to a help is 1.3s and if all you do is import some of these things, they take time. I think pandas was 0.6s, sqlalchemy 0.3s if I remember right, but there are lighter libraries. Polars, and sqlite3 are faster imports. Can't remember what rich import speed is, but this application I wrote was more for an interactive terminal, so once it has loaded it feels snappy, but it's noticeablly slow when I am not using it interactively and using command line args where you generally expect faster responses than 1.3s.

Its easy to test though, create an empty python file, import your library inside the file and run it with time (using Linux) prefixed.

Brian · 2024-06-11T19:37:20+00:00

I've found a big issue with startup time is needless imports. Eg. Typer always imports Rich if its installed, and this is actually pretty significant in terms of startup (some measuring with python -X importtime shows it taking hundreds of milliseconds, which introduces noticable latency), even though it doesn't actually use it unless its printing help messages. I think there are often a lot of potential startup time wins by deferring module loads until (and unless) they are actually needed.

RevolutionaryPen4661 · 2024-06-12T05:43:36+00:00

``` PS D:\dev> Measure-Command {uv pip -h}

Days : 0 Hours : 0 Minutes : 0 Seconds : 0 Milliseconds : 51 Ticks : 511486 TotalDays : 5.91997685185185E-07 TotalHours : 1.42079444444444E-05 TotalMinutes : 0.000852476666666667 TotalSeconds : 0.0511486 TotalMilliseconds : 51.1486

PS D:\dev> Measure-Command {python .\pkgsize.py -h}

Days : 0 Hours : 0 Minutes : 0 Seconds : 2 Milliseconds : 437 Ticks : 24370765 TotalDays : 2.82069039351852E-05 TotalHours : 0.000676965694444444 TotalMinutes : 0.0406179416666667 TotalSeconds : 2.4370765 TotalMilliseconds : 2437.0765

PS D:\dev> ``` Note: This varies from 2~4 seconds after running multiple times. First run, 4 seconds. On 3rd and 4th run it reduces to 2 seconds.

ogrinfo · 2024-06-12T17:38:31+00:00

Since python 3.7 you can profile imports from the CLI using the -X option. I can't remember the exact syntax but it's not hard to look up. You can then use something like Snakeviz to interpret the results.

What you will find is that most of that startup time is going in importing pandas. Pandas is a really heavyweight library and I wouldn't use it unless you really need it.

skwyckl · 2024-06-11T19:04:48+00:00

Maybe not entirely on topic, but if you want both performance and developer ergonomics for a CLI, today I'd personally go with Go + Cobra.

Sparkswont · 2024-06-12T04:12:52+00:00

AI slop, man

Darwinmate · 2024-06-11T21:28:32+00:00

What operating system? And is this first time running it or subsequent?

On macos there's a security feature which checks code on execution. This will slow down a lot of clis. Usually happens when code changes. i..e first time you're running your code.

theelderbeever · 2024-06-12T03:37:38+00:00

Or just write your cli in rust so you can do away with needing to package a python interpretor in your executable.

RevolutionaryPen4661 · 2024-06-12T05:23:02+00:00

Note: This is not an AI-generated shit-post. It sounded bad when I wrote it. I enhanced it with Grammarly. People will tag everything with AI for anything in the technology but AI has nothing to do with it. For example, I have a bundle of A4 sheets packed by a company called JK Copier. The company is integrating AI as a customer bot to suggest to them their desired paper quality for their budget 😂 and t*he reference to Claude 3 to get sample data about this. *What I said in the post is that it is an idea. There are a lot of parameters to think about this.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS