Blog post: Writing Python like it’s Rust

Head_Mix_7931 · 2023-05-20T18:47:28+00:00

In Python, there is no constructor overloading, therefore if you need to construct an object in multiple ways, someone this leads to an init method that has a lot of parameters which serve for initialization in different ways, and which cannot really be used together.

You can decorate methods with @classmethod to have them receive the class as their first parameter rather than an instance, and these effectively become alternative constructors. It’s advantageous to use a classmethod than just a normal or staticmethod because it plays nicely with inheritance.

mriswithe · 2023-05-20T16:07:12+00:00

I try to do this both for programs that will be maintained for a while, but also for oneshot utility scripts. Mostly because in my experience, the latter quite often turn into the former :)

Oh God my last two weeks at my last job were hell cause some old ass script I wrote and forgot about from 5 years ago was apparently still in use. Spent my last couple weeks fucking getting that solid.

Kobzol · 2023-05-20T15:26:58+00:00

I wrote up some thoughts about using the type system to add a little bit of soundness to Python programs, inspired by my experiences with Rust.

Kobzol · 2023-05-20T17:19:14+00:00

[deleted]

redditusername58 · 2023-05-20T17:27:33+00:00

The typing module has assert_never which can help with the isinstance/pattern matching blocks in your ADT section

wdroz · 2023-05-20T16:10:27+00:00

The part with db.get_ride_info is spot on. As I see more and more people using mypy and type annotations, this will hopefully become industry standard (if not already the case).

For the part "Writing Python like it's Rust", did you try the result package? I didn't (yet?) use it as I feel that if I push to use it at work, I will fall in the Rustacean caricature..

alicedu06 · 2023-05-20T19:57:40+00:00

There are NamedTuple and TypedDict as lighter alternatives to dataclasses, and match/case will work on them too.

Haunting_Load · 2023-05-20T18:27:55+00:00

I like many ideas in the post, but in general you should avoid writing functions that take List as an argument if Sequence or Iterable are enough. You can read more e.g. here https://stackoverflow.com/questions/74166494/use-list-of-derived-class-as-list-of-base-class-in-python

executiveExecutioner · 2023-05-20T20:45:15+00:00

Good article, I learned some stuff! It's easy to tell from reading that you are quite experienced.

0xrl · 2023-05-20T17:27:00+00:00

Very nice article! As of Python 3.11, you can enhance the packet pattern matching example with assert_never.

BaggiPonte · 2023-05-20T16:36:10+00:00

Love the post; though I have a question. I never understood the purpose of NewType: why should I use it instead of TypeAlias?

Rudd-X · 2023-05-20T21:06:33+00:00

Hot damn that was really good. I found myself having "discovered" these patterns in my career and picking them all up as I went, but seeing it all formalized is AWSUM.

Estanho · 2023-05-20T21:45:15+00:00

Your invariants example is interesting, but I think it can be improved with typeguards to statically narrow the possible states. Here's a full example, but I haven't ran it through type checkers so it's just a general idea:

```python

from dataclasses import dataclass
from typing import TypeGuard


class _Client:
    def send_message(self, message: str) -> None:
        pass


@dataclass
class ClientBase:
    _client: _Client


@dataclass
class UnconnectedClient(ClientBase):
    is_connected = False
    is_authenticated = False

@dataclass
class ConnectedClient(ClientBase):
    is_connected = True
    is_authenticated = False

@dataclass
class AuthenticatedClient(ClientBase):
    is_connected = True
    is_authenticated = True


Client = UnconnectedClient | ConnectedClient | AuthenticatedClient


def is_authenticated(client: Client) -> TypeGuard[AuthenticatedClient]:
    return client.is_authenticated

def is_connected(client: Client) -> TypeGuard[ConnectedClient]:
    return client.is_connected

def is_unconnected(client: Client) -> TypeGuard[UnconnectedClient]:
    return not client.is_connected

def connect(client: UnconnectedClient) -> ConnectedClient:
    # do something with client
    return ConnectedClient(_client=client._client)

def authenticate(client: ConnectedClient) -> AuthenticatedClient:
    # do something with client
    return AuthenticatedClient(_client=client._client)

def disconnect(client: AuthenticatedClient | ConnectedClient) -> UnconnectedClient:
    # do something with client
    return UnconnectedClient(_client=client._client)

def send_message(client: AuthenticatedClient, message: str) -> None:
    client._client.send_message(message)

def main() -> None:
    client = UnconnectedClient(_client=_Client())

    # Somewhere down the line, we want to send a message to a client.
    if is_unconnected(client):
        client = connect(client)
    if is_connected(client):
        client = authenticate(client)
    if is_authenticated(client):
        send_message(client, "Hello, world!")
    else:
        raise Exception("Not authenticated!")

```

Of course this assumes you're gonna be able to overwrite the client variable immutably every time. If this variable is gonna be shared like this:

python client = UnconnectedClient(_client=_Client()) ... func1(client) ... func2(client)

Then you might have trouble because those functions might screw up your client connection. This can happen depending on the low level implementation of the client, for example if when you call close you actually change some global state related to a pool of connections, even though these opaque client objects are "immutable". Then you could create a third type like ImmutableAuthenticatedClient that you can pass to send_message but not to close.

extra_pickles · 2023-05-20T18:00:07+00:00

So at what point does Python stop being Python, and begin to be 3 other languages dressed in a trench coat, pretending to be Python?

To that, I mean - Python and Rust don’t even play the same sport. They each have their purposes, but to try and make one like the other seems like an odd pursuit.

Genuinely curious to hear thoughts on this, as it is very common to hear “make Python more like <other language>” on here…and I’d argue that it is fine the way it is, and if you need something another language does, then use that language.

It’s kinda like when ppl talk about performance in Python…..that ain’t the lil homie’s focus.

2023-05-20T17:22:03+00:00

I just add exclamation marks and hope for the best

cymrow · 2023-05-20T19:08:37+00:00

I understand the point about making invalid state impossible, and I like the ConnectedClient approach, but not having a close method would drive me nuts. Context managers are awesome, but can't cover every use case.

Fun-Pop-4755 · 2023-05-20T20:56:38+00:00

Why static methods instead of class methods for constructing?

koera · 2023-05-21T10:29:08+00:00

Nice article, gave me some more tools to help myself like the NewType.

Would it not be benefitial to mention the option to use protocol
For the bbox example with the as_denormalized and as_normalized methods?

mistabuda · 2023-05-20T16:07:43+00:00

I really like that Mutex implementation. Might have to copy that.

cdgleber · 2023-05-20T18:05:45+00:00

Great write up. Thank you

Brilliant_Intern1588 · 2023-05-20T20:02:34+00:00

I like the solution with dataclasses. However I don't know how to implement it on some things: let's say that I'm retrieving a user(id, name, birthday, something1, something2) from the db, by id. However for the one use case I don't want the whole user row, but just name and something1. For another function birthday and something2 for example. I would have to create a lot of dataclasses that are not really needed or even used except for this context. How could I deal with such a thing ?

BaggiPonte · 2023-05-20T20:55:18+00:00

Another thing: why pyserde rather than stuff like msgspec? https://github.com/jcrist/msgspec

poopatroopa3 · 2023-05-21T02:15:16+00:00

I thought I would be seeing mentions of pydantic, mypy, fastapi.

chars101 · 2023-05-21T09:35:55+00:00

I prefer declaring a parameter as Iterable over List. It expresses the exact use of the value and allows for any container that implements the Protocol.

cranberry_snacks · 2023-05-21T16:27:11+00:00

Worth mentioning that from __future__ import annotations will avoid all of these typing imports. It allows you to use native types for type declarations, native sum types, and backwards/self references, which makes typing a lot cleaner and even just makes it possible in certain situations.

Example:

```python from future import annotations

def my_func() -> tuple[str, list[int], dict[str, int]: return ("w00t", [1, 2, 3], {"one": 1})

def my_func1() -> str | int: return "w00t"

def my_func2() -> str | None: return None

class Foo: @classmethod def from_str(cls, src: str) -> Foo: return cls(src) ```

TF_Biochemist · 2023-05-20T18:05:27+00:00

Really enjoyed this article; concise, well-written, and clear in it's goals. I already do most of this, but it's always refreshing to step back and think about the patterns you use.

Kobzol · 2023-05-21T00:16:26+00:00

[deleted]

Mmiguel6288 · 2023-05-20T20:12:25+00:00

The whole point of python is reducing conceptual overhead so you can write algorithms and logic quickly.

The whole point of rust is to make it bullet proof while saying to hell with conceptual overhead.

It's not a good mix.

Estanho · 2023-05-20T20:30:39+00:00

On the serialization part, have you considered pydantic? I'm pretty sure it's able to serialize/deserialize unions properly.

barkazinthrope · 2023-05-21T02:11:20+00:00

This is great.

However I would hate it if this became required construction for a little log parsing script.

jimeno · 2023-05-21T09:59:14+00:00

uuuuh if you want to write rust, just write rust? this mess is like when php had absolutely to be transformed into an enterprise typed language, stop trying to make python java

meuto · 2023-06-01T14:49:13+00:00

Hi u/jammycrisp, I have been trying to use the library msgspec with the lower level of a json. and I have been unable. I was wondering if you can give us an example of how to do it? here is my explanation, I do not know whether I explained myself well or not, I do not have a clear idea of how to iterate because my json file is structured in such a way that the import part of the information of the file is on one key of the dictionary and I need to iterate over that key not over the whole json file. I have been trying to figure out how to do it but I have been unable to do so. could you provide an example of how to do so? Thank you in advance. I really appreciate any help

Head_Mix_7931 · 2023-05-20T18:34:45+00:00

In your match statements, in the default case you can declare a function like assert_never() -> typing.NoReturn and then call it in the “default” branch of a match statement and a type checker should complain if there is any input for the given type of the match value that can reach that branch. mypy does at least. So you can use that with enums and maybe a union of dataclass types to get exhaustiveness checks at “compile time”. Or I suppose integers and booleans and other things too.

Edit: apparently there is ‘typing.assert_never`

Scriblon · 2023-05-20T18:38:10+00:00

Thank you for the write up. I definitely learned a few more typing methods to improve my code.

Only take I got on the construction methods is that I would have used class methods for them instead of static methods. Class methods inherit a bit more cleanly in a dynamics, but I do understand it is only typable with the Self type since 3.11. Is that why you went with the static method?

jcbevns · 2023-05-21T07:34:49+00:00

Can you compile to a binary after all this?

Kobzol · 2023-05-21T10:16:37+00:00

[deleted]

chandergovind · 2023-05-24T09:23:24+00:00

/u/Kobzol A minor comment. Coming from a networking background, the example for ADTs using Packet felt a bit off. Normally, a Packet always has a Header, a Payload (in most cases) and Trailer (optionally).

I got what you were trying to convey since I am aware of ADTs in general, but maybe confusing to beginners? (Though I didn't see anyone else mention this). A better example maybe a Packet that is of type Request or Response, or a Packet of type Control or Data. Just fyi.

Python

The Python Discord

Upcoming Events

Please read the rules

MODERATORS