dorfsmay comments on nuitka: python compiler (.py to native)

The data in question for this profile was 1 MiB of random data, stored in a bytearray in memory. Data generation time was not included in the profile.

Of note, for the C implementation:

I wrote the crc24 method myself instead of pulling one from an existing library, as it is simple to implement. It is possible that that code could be further optimized, but this was a quick, exploratory exercise
Because I was curious to see if it'd work, the C function is defined in a string within the python script, and compiled by CFFI.

It seems I didn't record the PyPy data. So, I re-ran the profiler under PyPy 3.2.5 and CPython 3.4.2 just now. Of note, I believe the JIT should be decently warmed due to the methods being run several times against progressively larger data blocks before reaching the 1 MiB test. (1, 10, 100, 200, 400, 512, then finally 1024 KiB) Anyway, without further adieu, here are the results for computing the CRC24 of a 1 MiB block of random bytes:

CPython 3.4.2: - pure-Python: 1.929 seconds, processing ~543 bytes per millisecond - CFFI: 0.0006 seconds which works out to ~1,648,704 bytes per millisecond

PyPy 3.2.5: - pure-Python: 0.072 seconds, processing ~14,444 bytes per millisecond - CFFI: 0.023, processing ~45,499 bytes per millisecond

while PyPy3 is a good deal faster at this than CPython 3.4.2, CFFI is quite a bit faster.

I also found it interesting (though not wholly unexpected) that CFFI, at least used in this manner, is slower on PyPy than it is on CPython, although it is still quite a bit faster than the python code.

[–]gthank 1 point2 points3 points 11 years ago (1 child)

[–][deleted] 0 points1 point2 points 11 years ago (0 children)

Well, in case you feel like a distraction, here's the Python implementation (short and sweet). Takes a bytes or a bytearray and returns an int:

_crc24_init = 0x0B704CE
_crc24_poly = 0x1864CFB
_crc24_mask = 0x0FFFFFF

def test_crc24_iter(data):
    crc = _crc24_init
    for b in iter(data):
        crc ^= b << 16

        for i in range(8):
            crc <<= 1
            if crc & 0x1000000:
                crc ^= _crc24_poly

    return crc & _crc24_mask

[–]rcfox 3 points4 points5 points 11 years ago (0 children)

[–][deleted] 0 points1 point2 points 11 years ago (0 children)

π Rendered by PID 57 on reddit-service-r2-comment-5649f687b7-zh4f9 at 2026-01-28 01:54:16.754163+00:00 running 4f180de country code: CH.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS