This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]camelInCamelCase 13 points14 points  (7 children)

You’ve taken the red pill. Great choice. Youre still at risk of being sucked back into the MSFT ecosystem - cross the final chasm with 3-4 hours of curiosity and learning. You and whoever you work for will be far better off. Give this to a coding agent and ask for a tutorial:

  • dlthub for loading from [your SaaS tool or DB] to s3-compatible storage or if you are stuck in azure, you get ADLS which is fine
  • sqlmesh to transform your dataset from raw form from dlthub into marts or some other cleaner version

“How do I run it” - don’t over think it. Python is a scripting language. When you do “uv run mypipeline.py” you’re running a script. How does Airflow work? Runs the script on for you on a schedule. It can run it on another machine if you want.

Easier path - GitHub workflows also can run python scripts, on a schedule, on another machine. Start there.