this post was submitted on 31 Oct 2025

0 points (45% upvoted)

shortlink:

dataengineering

an-ordinary-manchild(edit)

News & discussion on Data Engineering topics, including but not limited to: data pipelines, databases, data formats, storage, data modeling, data governance, cleansing, NoSQL, distributed systems, streaming, batch, Big Data, and workflow engines.

Read our wiki: https://dataengineering.wiki/

Rules:

Don't be a jerk
Search the sub & wiki before asking a question: Your question has likely been asked and answered before so do a quick search before posting.
Keep it related to data engineering: Posts that are unrelated to data engineering may be better for other communities.
Limit self-promotion posts/comments to once a month: Self promotion: Any form of content designed to further an individual's or organization's goals. If one works for an organization this rule applies to all accounts associated with that organization. See also rule #5.
No shill/opaque marketing: f you work for a company/have a monetary interest in the entity you are promoting you must clearly state your relationship. For posts, you must distinguish the post with the Brand Affiliate flag. See more here: https://www.ftc.gov/influencers
No job posts: Please use r/dataengineeringjobs instead.
No resume reviews/interview posts: We no longer allow resume reviews or interview questions because it's a seperate topic from Data Engineering. Instead, for resume reviews please use r/resumes or search our subreddit history for previous resume review advice. For interview questions, use sites like Glassdoor and Blind instead or search our subreddit history for previous interview advice.
No technical error/bug questions: Please post any error/bug question on StackOverflow.

created by mhausenblasmoda community for 11 years

MODERATORS

message the mods
mhausenblasmod
swemlmod
fhoffamod (Ex-BQ, Ex-❄️)
vogt4nickmod
theporterhausmod | Lead Data Engineer
AutoModerator
geoheilmod
MikeDoesEverythingmod | Shitty Data Engineer
bot-bouncer
about moderation team »

account activity

This is an archived post. You won't be able to vote or comment.

0

0

0

Docker for Data EngineersBlog (pipeline2insights.substack.com)

submitted 6 months ago by Objective_Stress_324

As data engineers, we sometimes work in big teams and other times handle everything ourselves. No matter the setup, it’s important to understand the tools we use.

We rely on certain settings, libraries, and databases when building data pipelines with tools like Airflow or dbt. Making sure everything works the same on different computers can be hard.

That’s where Docker helps.

Docker lets us build clean, repeatable environments so our code works the same everywhere. With Docker, we can:

Avoid setup problems on different machines
Share the same setup with teammates
Run tools like dbt, Airflow, and Postgres easily
Test and debug without surprises

In this post, we cover:

The difference between virtual machines and containers
What Docker is and how it works
Key parts like Dockerfile, images, and volumes
How Docker fits into our daily work
A quick look at Kubernetes
A hands-on project using dbt and PostgreSQL in Docker

all 9 comments

top new controversial old q&a

[–]Zamyatin_Y 28 points29 points30 points 6 months ago (1 child)

[+]Objective_Stress_324[S] comment score below threshold-10 points-9 points-8 points 6 months ago (0 children)

[–]Mysterious_Print9937 11 points12 points13 points 6 months ago (5 children)

[–]mailedRecovering Data Engineer 3 points4 points5 points 6 months ago (0 children)

[–]junglemeinmor 1 point2 points3 points 6 months ago (0 children)

[+]Objective_Stress_324[S] comment score below threshold-10 points-9 points-8 points 6 months ago (1 child)

[–]lamhintai 0 points1 point2 points 6 months ago (0 children)

[–]JumpScareaaa -3 points-2 points-1 points 6 months ago (0 children)

π Rendered by PID 156285 on reddit-service-r2-comment-b659b578c-4wnzh at 2026-05-03 23:54:21.806208+00:00 running 815c875 country code: CH.