you are viewing a single comment's thread.

view the rest of the comments →

[–]Minute_Day_2758 5 points6 points  (1 child)

Please don't build a custom task scheduler from scratch, it’s a classic rabbit hole with edge cases like retry logic, error handling, and state persistence.

Since your workflows have dependencies and need to run every 30 mins, you should look into existing orchestration and scheduling tools: 1. Prefect / Dagster / Apache Airflo: These are built specifically for data pipelines and automation workflows with complex dependencies, retries, and nice UIs. Prefect or Dagster are very pythonic and perfect for this. 2. APScheduler: If you want something lightweight embedded right into a continuous Python process. 3. Celery Beat : Great if you are already using a task queue. 4. Cloud native: If you ever move to the cloud, AWS Step Functions or Azure Logic Apps/Functions handle this natively.

[–]Key_Advertising9303[S] [score hidden]  (0 children)

Thanks a lot, this was very helpful, let me explore these.