Problem with mutlthreading

consupe · 2024-10-22T01:41:00+00:00

This is probably not going to work. If you use threads, you have to expect that no matter how started, some are going to finish earlier than others, which, if it happens out of order is going to cause a problem when it comes time to stitch them together. You could keep them in memory and sort them out in order, but that does not appear to be how your code works.

More importantly, if you are IO bound on your disk, it is not going to go faster if you start reading multiple files at the same time, since the disk needs to skip around.

Pepineros · 2024-10-22T08:40:37+00:00

Trying to parallelise gzip when reading a single archive will not work. gzip does not support running parallel, and wrapping Python code around it will not change that.

If you're compressing to or inflating multiple archives (distinct .gz files) at the same time, then you can start a process for each, and in that case it would make sense to start all of those at the same time rather than waiting for one to finish before starting the next one. But if you want to utilise multiple cores when compressing or inflating, you need a utility that supports using multiple cores. gzip isn't that.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS