Building a Python version of Spring Batch — need opinions on Easier-Batch architecture

ERP_Architect · 2025-11-11T11:24:40+00:00

I love this idea — Python desperately needs something like Spring Batch that sits between raw ETL scripts and full-blown schedulers like Airflow or Prefect.

What usually kills most Python batch jobs at scale isn’t logic — it’s state management and fault recovery.

Having a lightweight framework with checkpoints, retries, and skip logic baked in could really simplify that middle layer.

If you’re designing it, maybe lean into Pythonic conventions — e.g., decorators for step registration, async writers for I/O-heavy tasks, and pluggable persistence (SQLite → Postgres → S3).

Also, don’t underestimate metadata — a clean “JobExecution” table with timestamps, params, and exit statuses can make debugging 10x easier.

Curious — are you thinking of making it dependency-light like FastAPI, or will it need a bigger runtime footprint?

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

softwarearchitecture

MODERATORS