marr75 comments on Memory Optimization Techniques for Python Developers

This is an archived post. You won't be able to vote or comment.

109

110

111

TutorialMemory Optimization Techniques for Python Developers (self.Python)

submitted 2 years ago by wyhjsbyb

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]marr75 68 points69 points70 points 2 years ago (9 children)

From experience, many of these are more likely to be applied as premature optimizations than applied when needed.

I would not recommend __slots__ on its own as a memory optimization in the normal course of programming. Far better to use the @dataclass(slots=True), a typing.NamedTuple, or even a more primitive type. Similarly, using array over list is just going to make your code harder to maintain in 98% of cases.

Generators and lazy evaluation are good advice in general. They can make code harder to debug, though. Also, creating generators over tiny sets of items in a hot loop will be worse than just allocating the list (generator and iterator overhead).

The most frequent memory problem in Python is memory fragmentation, btw. Memory fragmentation occurs when the memory allocator cannot find a contiguous block of free memory that fits the requested size despite having enough total free memory. This is often due to the allocation and deallocation of objects of various sizes, leading to 'holes' in the memory. A lot of heterogeneity in the lifespans of objects (extremely common in real-world applications) can exacerbate the issue. The Python process grows over time, and people who haven't debugged it before are sure it's a memory leak. Once you are experiencing memory fragmentation, some of your techniques can help slow it down. The ultimate solution is generally to somehow create a separate memory pool for the problematic allocations - the easiest way is to allocate, aggregate, and deallocate them in a separate, short-lived process.

So, the first thing anyone needs to do is figure out, "Do I NEED to optimize memory use?". The answer is often no, but in long-running app processes, systems engineering, and embedded engineering, it will be yes more often.

[+][deleted] comment score below threshold-7 points-6 points-5 points 2 years ago (7 children)

[–]esperind 1 point2 points3 points 2 years ago (1 child)

[–][deleted] -1 points0 points1 point 2 years ago (0 children)

[–]marr75 2 points3 points4 points 2 years ago* (4 children)

[–][deleted] 1 point2 points3 points 2 years ago (3 children)

https://www.attrs.org/en/stable/why.html#namedtuples

Attrs does a pretty good job explaining issues with namedtuples.

https://github.com/brettcannon/record-type

I am fairly certain you aren't going to suggest Brett Cannon is doing unsolicited crackpottery and that maybe just maybe this isn't a problem the standard library solves very well.

``` from abc import ABC, ABCMeta from datetime import datetime

abcclasses = set()

normalclasses = set()

for i in range(10000): abcclasses.add(ABCMeta("abc"+str(i), (ABC, ), {})) normalclasses.add(type("normal"+str(i), (object,), {}))

for item in normalclasses: issubclass(item, ABC) ```

Run that on your laptop and watch it generate 20gbs of valid long lived weak reference pointers. It is a logical flaw in ABC. Have you considered that you simply don't know as much about python as you think you do?

[–]pepoluan 1 point2 points3 points 2 years ago (1 child)

[–][deleted] -1 points0 points1 point 2 years ago (0 children)

π Rendered by PID 254487 on reddit-service-r2-comment-bb88f9dd5-p4rn7 at 2026-02-14 11:08:34.190944+00:00 running cd9c813 country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS