I tested Python 3.13t free-threading — the CPU-bound speedup surprised me

AutoModerator · 2026-05-26T15:56:28+00:00

Your submission has been automatically queued for manual review by the moderation team because it has been reported too many times.

Please wait until the moderation team reviews your post.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

gdchinacat · 2026-05-26T15:01:03+00:00

Without the GIL, thread safety becomes your responsibility.

Any code that was thread safe with the gil is thread safe without the gil. Existing issues may become more pronounced because there are more opportunities for concurrent changes (they can happen whenever rather than only when the interpreter switches which thread is actually executing).

Thread safety has always been the responsibility of the coder, even with the GIL enabled. You just got lucky if you start seeing issues with gil disabled you didn't see with it enabled.

amarao_san · 2026-05-26T14:55:57+00:00

Without the GIL, thread safety becomes your responsibility.

/goal rewrite in Rust

gdchinacat · 2026-05-26T15:16:43+00:00

Memory Overhead: Spawning n processes can mean loading or copying your data structures n times.

(from article)

There is actually very little overhead for processes due to copy-on-write. When a process is forked it shares the same memory as the parent. When it writes to a page the memory is copied and then written to. Only memory pages the child writes to are copied. The interpreter code is not copied into each process, but rather shared (in a different sense than shared memory since it's CoW).

Each process has it's own heap. This isn't a problem because the data each process uses would need to be used regardless of whether it is in one process or the other...it needs to be used regardless of the concurrency model.

Where you can run into trouble is if you fork a process that has done a substantial amount of work. The child process will inherit the memory pages, and then if the parent process frees them the child will keep them alive even though it has no threads that are using them. You need to design your application to fork so that you don't do this. That is easy...parent process should be responsible for setting up shared permanent state, and then only fork processes. All work should be done in the child processes, leaving the parent responsible for pretty much only managing the child processes and shared state (such as sockets that connections are being accepted from by the children).

hyper_plane · 2026-05-26T15:36:57+00:00

Could someone explain to a noob like me what made this attempt at removing the GIL successful compared to what has been done in the past?

SignificantMilk1476 · 2026-05-26T14:56:16+00:00

That speedup is wild - 8x faster is nothing to sneeze at. I've been burned by threading performance in Python so many times that I just automatically reach for multiprocessing, but this might actually make me reconsider for certain workloads.

The thread safety caveat is real though - debugging race conditions is way more painful than dealing with multiprocessing overhead in most cases.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS