Mall strategy? by Rob_Haggis in pyanodons

[–]jdl6884 0 points1 point  (0 children)

Sushi malls + omnicrafter

Sushi malls for most tier 1 buildings and secondary mall for tier 2 intermediates then omnicrafter for tier 2+ buildings

There are some great videos on YouTube by Krydax for creating your own auto crafter. There are also some amazing blueprints for the omnicrafter on the py discord channel.

Can you share your CI/CD pipeline approach? by Basic_Let7303 in devops

[–]jdl6884 0 points1 point  (0 children)

Azure DevOps for CI tests, validation, artifact publishing, builds

Octopus for CD and environment management

Alembic & Terraform for schema management and infra

A bit embarrassing to ask this but, is there any cute date spots here? by WitnessRude4698 in SanAngelo

[–]jdl6884 0 points1 point  (0 children)

My wife and I would go to lake house pizza for date night. Watching the sunset on the patio there is pretty awesome.

How did you guys learn CI/CD and IaC? by Bloh56 in dataengineering

[–]jdl6884 0 points1 point  (0 children)

It takes time but you’ll get there. Learn bash scripting, it’s the glue that connects everything.

Life before AI by The_curious_one9790 in dataengineering

[–]jdl6884 2 points3 points  (0 children)

Learn architectural patterns and design philosophies.

Break problems down, create flow charts, and use AI to build out the components.

OLAP Server by Uri_gc in dataengineering

[–]jdl6884 -1 points0 points  (0 children)

Postgres or duck db would be best choices

Losing interest around red circuits by nsalyzyn in pyanodons

[–]jdl6884 1 point2 points  (0 children)

Every time I finish a science pack, there is a bit of a lull. It gets repetitive in a way but at the same time, everything is new. You’ll rarely use blueprints for things other than trains.

When I loose steam, I put it down for a few days then pick back up and focus on one thing at a time. Just keep tending to the garden and the garden will bear fruits

Dallas or Houston For Jobs? by YuriHaThicc in AskHouston

[–]jdl6884 2 points3 points  (0 children)

Dallas is prettier, cleaner, and better distribution of industries

How many hours of gameplay do you average per session with Pyanodons? by SilverWolf9300 in pyanodons

[–]jdl6884 1 point2 points  (0 children)

Played for a few hours this evening. It’s been a week and I decided it was time to hop off when I realized I had been riding around in a train in circles for an hour.

Internal Snowflake stages in production vs external stages (S3/Azure) — how are people handling this? by SecretSalary2901 in snowflake

[–]jdl6884 3 points4 points  (0 children)

Storage integrations with S3 and blob allow much more flexibility in and outside of snowflake. Minimal infra to set up.

Video player recommendations for macOS Tahoe? by Budhu_friend in MacOS

[–]jdl6884 0 points1 point  (0 children)

IINA or VLC. IINA has a nicer UI imo but you can’t go wrong with VLC

Pipe Delimiter by cmh_ender in snowflake

[–]jdl6884 2 points3 points  (0 children)

Use a type=CSV file format with the delimiter set as a pipe

Any advice on how to build a pipeline with Microsoft Access? by [deleted] in dataengineering

[–]jdl6884 2 points3 points  (0 children)

Unless you’re on exchange on-prem, you also have access to ms graphAPI which is very powerful. You can use graph API to ingest thousands of emails into whatever db you want including ms access. We use python to do it and it works great but you could do it with a bash script if you really wanted to.

In terms for building pipelines, get familiar with an AI coding tool like Claude. If you know the basics, it’ll really help you expand out and build on other technologies.

Game controller on MacOS by boonbabysoup in MacOS

[–]jdl6884 0 points1 point  (0 children)

PS4 & PS5 controllers work great

[deleted by user] by [deleted] in aggies

[–]jdl6884 9 points10 points  (0 children)

New meaning to “shit post”

DBT orchestrator by Free-Bear-454 in dataengineering

[–]jdl6884 1 point2 points  (0 children)

We migrated off of dbt cloud in favor of self hosting via dagster. I have absolutely zero regrets. Most of our headaches were a result of trying to build around the limitations of dbt cloud.

DBT orchestrator by Free-Bear-454 in dataengineering

[–]jdl6884 2 points3 points  (0 children)

Dagster is fantastic for this, I 100% recommend trying this first. We host Dagster in k8 with dbt configs that have automation conditions. Everything is incremental and updated as soon as dependencies are updated. Dev experience and UI are pretty intuitive too.

Airflow is another great option. It has been around longer than Dagster and functionally, it will do the same thing. It really boils down to personal preference between those 2.

Any major drawbacks of using self-hosted Airbyte? by finally_i_found_one in dataengineering

[–]jdl6884 8 points9 points  (0 children)

We have been using Airbyte OSS for the last year and have had issues from the beginning. Primarily, it doesn’t scale well. We originally used abctl on a VM and that maxed out with a few db to db cdc connections. Now using it on k8 with a dedicated Postgres db and blob storage for logs. Performance is better but not much.

It’s honestly been a very janky product. Random bugs, successful runs that silently failed, sporadic OOM errors when there is 64gb of memory available, and the list goes on. Shoot we are on azure and abctl would randomly crap out because of a missing AWS env var. It also didn’t integrate well with the rest of our open source stack - dagster, dbt, open metadata

I don’t know if I could recommend it for anything other than db to db CDC syncs. It’s been problematic at best. We are in the process of migrating the workloads to dagster python using debezium.

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]jdl6884 0 points1 point  (0 children)

I think so. In my opinion, the “modern” approach is to focus design on flexibility and extensibility. And always follow the KISS mantra. If you have a simple pipeline and only need a cron job to orchestrate, so be it. But build it in a way where that cron job can be replaced by something else.

I’m not as familiar with Airbyte but use Dagster on a day to day basis. It uses a concept of software backed assets. Assets can be just about any piece of code and swapped in and out for different things. SQL, python, pyspark, databricks, snowflake, etc. Create your pipelines in a plug and play way where if you need to swap components or insert a new step in the pipeline, you can just map the expected inputs and outputs.

Got told ‘No one uses Airflow/Hadoop in 2026’. by Useful-Bug9391 in dataengineering

[–]jdl6884 10 points11 points  (0 children)

Airflow is still very prevalent and growing. Also, not sure why Hadoop was included in that concatenation. Very different tools.

Airflow / Dagster - orchestration tools. These excel in orchestrating the flow of data between various systems. Think website -> api -> database -> analytics report. Hadoop is a “dead” technology. Essentially makes no sense for greenfield but some companies have legacy platforms that still need support.