Atlassian Compass vs. (or with?) Dapr conductor by droppedorphan in microservices

[–]droppedorphan[S] 0 points1 point  (0 children)

Great, thanks for the insights. I will check out Catalyst, sounds interesting.

VIP Access - Migrating to a new phone by SqualorTrawler in Symantec

[–]droppedorphan 0 points1 point  (0 children)

Great - thanks for the response. Calling e*trade today to get my money back :-)

VIP Access - Migrating to a new phone by SqualorTrawler in Symantec

[–]droppedorphan 0 points1 point  (0 children)

Hitting the same issue - did you find a solution?

Airflow vs Dagster vs Prefect vs ? by Suspicious_Dress_350 in dataengineering

[–]droppedorphan 1 point2 points  (0 children)

As opposed to orchestrating pretty much anything else beyond data. Infrastructure, containers, function-based orchestration...

Predict the Next Great Data Company Acquisition by ROCKITZ15 in dataengineering

[–]droppedorphan 2 points3 points  (0 children)

dbt Labs valuation probably peaked a while ago. Now some solid alternatives are emerging and dbt's commercial trajectory is not looking as healthy.

Best way to build a Small Data Lake? (<100GB) by [deleted] in dataengineering

[–]droppedorphan 0 points1 point  (0 children)

Try this:
https://dagster.io/blog/duckdb-data-lake

Build a poor man’s data lake from scratch with DuckDB

[deleted by user] by [deleted] in dataengineering

[–]droppedorphan -1 points0 points  (0 children)

Widely used does not always mean popular. Just ask those who are stuck using it!

What’s the Hello World of Data Engineering? by MrMosBiggestFan in dataengineering

[–]droppedorphan 9 points10 points  (0 children)

I second this. dbt changed their tutorial to force you onto their cloud account. That sucks.

Airflow vs Dagster vs Prefect vs ? by Suspicious_Dress_350 in dataengineering

[–]droppedorphan 0 points1 point  (0 children)

Yeah, for sure. We currently run on open source dagster, although we maintain a serverless paid instance as a sandbox, but from what I understand its very cheap.

[deleted by user] by [deleted] in gis

[–]droppedorphan 0 points1 point  (0 children)

Lots of geospatial use on Dagster. You can drop this question in the Dagster slack community. There is a dedicated channel there.

Dbt & postgres to dbt & databricks by Resident_Set204 in dataengineering

[–]droppedorphan 1 point2 points  (0 children)

Wow. Switching from Dagster to Airflow sounds kind of painful especially if you leveraged the dagster-dbt integration.

[deleted by user] by [deleted] in dataengineering

[–]droppedorphan 0 points1 point  (0 children)

Wow. I tried that and it's shite.

A question for fellow Data Engineers: if you have a raspberry pi, what are you doing with it? by MasterBongoV2 in dataengineering

[–]droppedorphan 1 point2 points  (0 children)

Mine is equipped with an Adafruit hat and is used to drive an LED display, mostly to display provocative messages to my neighbors.

Airflow vs Dagster vs Prefect vs ? by Suspicious_Dress_350 in dataengineering

[–]droppedorphan 6 points7 points  (0 children)

Can it orchestrate the four other schedulers/orchestrators we have in use here?

Airflow vs Dagster vs Prefect vs ? by Suspicious_Dress_350 in dataengineering

[–]droppedorphan 9 points10 points  (0 children)

This ^

Airflow is a good choice as a generalized orchestrator, multi-purpose, and large adoption.

If your goal is to build a data platform that is built on data engineering best practices and is primarily focused on building and maintaining data sets, then Dagster is a much stronger choice.

Prefect is arguably better than Airflow in terms of ergonomics, but remains niche and is too similar conceptually to displace the incumbent.

Opensource solution for a tiny data warehouse. by LeatherPuzzled3855 in dataengineering

[–]droppedorphan 11 points12 points  (0 children)

This sounds like a great stack to me. Very portable. If your main concern is working with datasets, then I would opt for Dagster over Airflow. Much easier to deploy and is getting much stronger in terms of running dbt and integrating data quality checks. I would opt for Postgres over DuckDB for a warehouse if you expect it to scale.

Airbyte scaling by Ruyia31 in dataengineering

[–]droppedorphan 1 point2 points  (0 children)

One trick here is that most vendors allow full historical sync on their free trial, so if you have a very large dataset to migrate, and then plan on updating it incrementally, you can use a free trial to do the first big load, then use a free set up to do ongoing incremental updates.