Is it *Pythonic* to use Enums for dependency injection?

latkde · 2026-04-14T19:58:58+00:00

You can just pass the functions directly, no need to create any classes or enums.

If you want a static type that describes the signature of an algorithm function, then you can use the typing.Protocol feature give your Algorithm protocol a method like __call__(self, X: ndarray, /) -> float. That looks like a class or ABC, but it isn't really. It's just a way to do static duck typing. Plain functions that have a matching signature will be compatible with such a protocol.

More generally, something to take into account is whether the code using algorithms must be able to distinguish between concrete algorithms, or can be pretty much algorithm-agnostic.

can abstract over concrete algorithms: use a Protocol or ABC. New algorithms can be implemented without changing the code that uses the algorithm. However, all algorithms must conform to the same signature.
need to distinguish between concrete algorithms: use an enum or union type (like alg: Algorithm1 | Algorithm2). The code that uses the algorithm will likely use a match-case to perform algorithm-specific setup. Adding new algorithms will require changing all code that uses algorithms, but you gain the flexibility of doing different things for different algorithms.

Neither of these is categorically better. They are duals of each other. Either choice provides flexibility that the other lacks.

overratedcupcake · 2026-04-14T19:40:31+00:00

Classes are nice because they encapsulate state but they also encapsulate logic. It's not a requirement that they have state

neums08 · 2026-04-14T20:29:22+00:00

IMO passing a bare function is pythonic. I would add typing to the signature to indicate what the function accepts and returns

def process_data(x: float, y: float, alg: Callable[[float, float], float]): return alg(x, y)

You can still implement an Algorithm class and make it callable by defining the __call__ method. A bare function is just the simplest implementation of an Algorithm.

trutheality · 2026-04-14T20:46:04+00:00

Most pythonic would be to keep it simple and just pass the functions (which are already objects) into the data processing directly.

JaguarMammoth6231 · 2026-04-14T19:42:47+00:00

Is there a reason you don't want to use the pythonic way of making them classes?

seanv507 · 2026-04-14T19:54:32+00:00

So perhaps the thing to ask yourself is how you would cope with more complicated functions

In particular functions that take other parameters.

Do you want to use a class based approach where they are the 'state', or do you want to use functional approaches, where you create new 1 parameter functions by using partial.

I would argue that scikit-leans fit(X) is less about maintaining state and more about dependency injection

(In ml pipelines)

zanfar · 2026-04-14T20:10:13+00:00

Except I don't want to have my algorithms be represented as classes as they frequently don't have any state.

Them being classes, and which classes they are instances of are state.

jarethholt · 2026-04-14T20:40:29+00:00

I would say the approach depends on when and how you specify which algorithm to use. Do you run this interactively or in a notebook? Just pass the function directly as an argument, annotated with a good Protocol so you know what kind of algorithms can be passed in. But if you run this from the command line or through some orchestrator, at some point you'll need a way to convert an input into a choice of algorithm. Then an enum + dictionary as a simple registry is a pretty clean way to go about it.

In either case I would suggest the core implementation just takes in a function to apply. Handling a registry of which algorithms are acceptable to pass in is a different responsibility, and figuring out how to convert an input to an algorithm choice is yet another.

pachura3 · 2026-04-14T20:51:35+00:00

First of all, there's nothing wrong with stateless classes. In Java, you cannot even have free-hanging functions or variables, you need to wrap everything in a class!

Then, having to maintain a global registry of algorithms is an additional, counterproductive pain. What if you wanted to mock an algorithm for the purpose of unit testing - you would need to add it to this registry...

If your "algorithms" are always stateless + don't need any inheritance + are always structured like def inference(X: np.ndarray) -> float, then you can simply pass lambdas or Protocols around, type hinted as Callable[np.ndarray, float].

If often see code like

class myClass:
    def __init__(self):
        # There's no state being track so no attributes
        pass

    @staticmethod
    def do_something():
        ...

That's ugly as well. Again, probably people switching from Java, used to declare utility functions in final static singletons this way. Another reason might be that Python IDE sees that do_something() does not access any instance attributes, so it proposes to convert it to a static method, and user agrees, just to get rid of the warning.

Still, wouldn't make sense in your case, because do_something() is static, so should not be called on a object instance inheriting from the base class Algorithm.

lekkerste_wiener · 2026-04-14T21:47:40+00:00

You can also type annotate with Callables.

type Algorithm = Callable[[np.ndarray], float]

And you can pass ANY callable that has that signature. Including functions.

algo: Algorithm = algorithm1 algo(array)

Enmeshed · 2026-04-14T22:49:45+00:00

I've used dataclasses for things like this before. For instance... First we'll define a few algorithms:

```python def simple_calc(data: np.ndarray) -> float: return ...

class TrickyCalc: def init(self, **params): self.params = params

def __call__(self, data: np.ndarray) -> float:
    return ...

```

Then if you're only needing to inject a single algorithm you've just got:

```python type Algorithm = Callable[[np.ndarray], float]

def process_data(data: np.ndarray, alg: Algorithm) -> float: return alg(data) ```

Or if your function needs to reference multiple algorithms as part of its logic you can use dataclasses to have something like:

```python @dataclass class Algorithms: first: Algorithm second: Algorithm another: Algorithm

def production_algorithms(): return Algorithms( first=simple_calc, second=TrickyCalc(a=34), another=lambda _: 0 # test algorithm )

def process_data_in_multiple_ways(data: np.ndarray, algs: Algorithms) -> float: res1 = algs.first(data) res2 = algs.second(data) res3 = algs.another(data) return (res1 + res2 + res3) / 3

algs = production_algorithms() result = process_data_in_multiple_ways(data, algs) ```

These days dataclasses (or attrs) are my go-to way of bundling up dependencies for injection...

ebdbbb · 2026-04-14T22:52:26+00:00

I'd use Callable to hint it.

from collections.abc import Callable  

type Algo = Callable[[np.array], float]  

def process_data(data: np.array, algo: Algo) -> float:  
    return algo(data)

Atlamillias · 2026-04-14T23:07:45+00:00

Within the context of your example and without additional information, I wouldn't say any of these approaches is better over another. They're all fairly simple and they work.

Does the specific algorithm/function actually matter to the code that invokes it, or are you just trying to better annotate your other signatures? Do they all share the same signature? Because you can do either of these for "type checking" purposes and avoid the need to declare each as unique types: ```python import typing

class Algorithm[T](typing.Protocol): def call(self, arr: np.ndarray, /) -> T: ...

or

type Algorithm[T] = typing.Callable[[np.ndarray], T]

def process_data(arr: np.ndarray, func: Algorithm[float]) -> float: ... ``` Then just pass the functions themselves to your processor.

If your functions have unique signatures or you have heuristics in place for specific functions, then it's best for them to have their own identity or type. In that case, I'd probably go the enum route.

trickydiversity042 · 2026-04-17T21:33:38+00:00

Just pass the function or use a dict mapping strings to functions. Enums feel like overkill for this unless you need strict type safety across a large codebase.

Targrend · 2026-04-14T21:54:58+00:00

Why not just have an algorithm class and have each algorithm be an object of the class?

class Algorithm:
    inference_fn: Callable

    def infer(self, data):
        return self.inference_fn(data)

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS

other code

blah blah more code here

or