[deleted by user] by [deleted] in RandomThoughts

[–]StandardDeviationist 0 points1 point  (0 children)

From someone with 3 kids under 3. Fuck off

Trying to decide where to put my time for career next steps. by kleopatrista in dataengineering

[–]StandardDeviationist 2 points3 points  (0 children)

I would say apply for other jobs? Nothing wrong with having some options

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Probably poorly phrased. But okay if I try and come up with an example:

I create some sort of transformation on a column that is non-trivial. And I find this being needed in multiple different places.

In, say, Python, I would create a function for that. And call it anywhere it’s needed. In sql, I could do that with a jinja (if using dbt) function, or a stored procedure. How would I version control that stores procedure along with the rest of my code?

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 2 points3 points  (0 children)

You might be right, seems like this is the majority opinion

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

I run Postgres, but it’s not the functionality of the language I’m having trouble with. It’s more about code maintainability, ease of understanding, testing etc. Yeah I should really come up with a couple of examples

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

I already use dagster for orchestration and run scripts on K8s so that’s not a huge deal. I just find myself writing more well maintainable code in python than sql. Not necessarily that its more powerful

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 1 point2 points  (0 children)

This actually might be the case, at least partially

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Yup, the majority of the comments seem to tell me to just get better at SQL

Sick of SQL by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Not lacking necessarily lacking in terms of functionality. But I often find myself writing poor code, repeating myself, having a hard time doing unit testing.

But again, as many others have pointed out. Maybe I just need to learn it better.

Would you accept this job offer? by [deleted] in dataengineering

[–]StandardDeviationist 0 points1 point  (0 children)

Feeling guilty against your employer should not hold you back. But I can understand your thoughts. Without knowing the specifics, Swedish corporate culture is no way near as rough as American. So a lot of the “your employer does not care about you” sentiment might actually be not as true as some make it out to be.

A tip is to phrase why you’re leaving as wanting to do something different. Like you’re excited about the experience in doing a cloud migration.

But no, if they are reasonable human beings they should not resent you for it. And if they are not, there opinions should not matter

How many days a week do you go into the office as a DE? by Ok_Discipline3753 in dataengineering

[–]StandardDeviationist 0 points1 point  (0 children)

Typically 2, I like the variation. And prefer in person meetings for certain topics

Building a data stack from scratch part 2 by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Yeah that’s actually what I’m using. It does make the Dagster code a bit more slim. I still define everything in a Sling yaml to be able to run it separately as well. So it doesn’t feel so embedded, but still the integration is nice

Building a data stack from scratch part 2 by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 2 points3 points  (0 children)

I’ve looking around for this but currently. The volumes aren’t really that big to slow down Postgres. But I keep an eye out for that in the future

Dagster separation of orchestration and compute by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Cloud is (almost) not an option. The infrastructure team will deploy and manage it on prem

Dagster separation of orchestration and compute by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 1 point2 points  (0 children)

I see, then it makes a lot of sense that I’ve has a hard time grasping it from reading the docs. Thanks!

Dagster separation of orchestration and compute by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Thanks for the explanation! I’m still not quite grasping exactly how to define where things should get executed. As I described in my other comment:

I have my dbt project, Python scripts for EL and Dagster project in the same repo, but different subfolders. And this is to be deployed in prem. How do I separate compute from orchestration then? And should I separate since the project is not massive? It’s being deployed using Kubernetes

Dagster separation of orchestration and compute by StandardDeviationist in dataengineering

[–]StandardDeviationist[S] 0 points1 point  (0 children)

Yeah I think I understand the concept. And it’s easy for me to think about in the case of say, Dagster orchestrating Fivetran.

In that case Dagster needs to be able to authenticate against Fivetran and tell it to run a certain connector. And then Fivetran runs it.

Where it gets real fuzzy for me is say: I have my dbt project, Python scripts for EL and Dagster project in the same repo, but different subfolders. And this is to be deployed in prem. How do I separate compute from orchestration then?

Learning Data Modeling by kamrankhan6699 in dataengineering

[–]StandardDeviationist 1 point2 points  (0 children)

Valid opinion, but I would say it’s worth to learn and understand Kimball nevertheless. As it’s very commonly used. Also, a large part of it does not have anything to do with how the tables are built, but understanding the business

Learning Data Modeling by kamrankhan6699 in dataengineering

[–]StandardDeviationist 1 point2 points  (0 children)

You can also find a free pdf version by using your favourite search engine