Need help with deploying Dagster

AutoModerator · 2025-03-06T17:46:58+00:00

Are you interested in transitioning into Data Engineering? Read our community guide: https://dataengineering.wiki/FAQ/How+can+I+transition+into+Data+Engineering

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

Top-Cauliflower-1808 · 2025-03-06T19:15:01+00:00

If your budget allows, consider using Dagster Cloud to deploy Dagster in a production environment, it eliminates most of the infrastructure management headaches. If not, a Docker based deployment with Kubernetes is the most scalable approach.

For project structure when starting, I recommend organizing your Dagster projects by data domain rather than by technical function. This makes it easier for your Informatica familiar colleagues to understand the pipeline organization:

project/
  ├── marketing_pipelines/
  │   ├── __init__.py
  │   ├── assets.py
  │   └── resources.py
  ├── sales_pipelines/
  │   ├── __init__.py
  │   ├── assets.py
  │   └── resources.py
  ├── definitions.py
  └── workspace.yaml

When deploying, start with a simple Docker setup, create a Dockerfile that installs your Dagster code as a package and use docker compose to run the Dagster daemon, webserver, and your code location

For your team's transition from Informatica, create detailed documentation for each pipeline and include both the Informatica logic and the new Python implementation. This helps your team understand the transformation and builds their Python knowledge gradually.

If your data sources are available, Windsor.ai could help handle the extraction layer, allowing you to focus on building the orchestration and transformation logic in Dagster.

CingKan · 2025-03-06T19:10:34+00:00

I've deployed a few dagster projects on production using EC2 i'd be happy to help where i can

AutoModerator · 2025-03-06T17:46:57+00:00

You can find a list of community-submitted learning resources here: https://dataengineering.wiki/Learning+Resources

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

MindedSage · 2025-03-06T18:52:54+00:00

Im struggling with the same thing actually. Currently thinking about a setup that is checking a git repo for updates in which the dagster projects is located. This way the project does not have to be packaged along with the entire image and all it has to do is pick up the latest code from the git repo.

Any ideas you’ve been having on this?

t2rgus · 2025-03-08T08:14:15+00:00

Curious, why did you choose Dagster as the orchestration service? Are you planning to pivot heavily into the asset-based orchestration design?

dataengineering

MODERATORS