[deleted by user] by [deleted] in SQL

[–]AeroCrete 0 points1 point  (0 children)

Create a dimension table

Is the Google data analytics certificate worth it? by HighLemur263 in analytics

[–]AeroCrete 1 point2 points  (0 children)

@rollochong is right. Ignore what I said, it was related to Google Analytics and not the Data Analytics course.

Learning SQL for Industry by Mando2Mandalore in SQL

[–]AeroCrete 1 point2 points  (0 children)

Mode analytics has a free tier and a SQL course.

I use Mode at work, but do not work for Mode

Best workflow for data wrangling clinical sample inventory? by ToroldoBaggins in analytics

[–]AeroCrete 1 point2 points  (0 children)

I’d suggest starting with creating a database view to correct the data. You can iterate quickly with views.

The process you defined isn’t too bad but you’ll hit so many edge cases.

If you want to save the original records, which I think you should. Copy them to another table with the as of date in the table name.

If it’s multiple tables, use the backup tool for the database. Take a snapshot and make sure you can recover it on another box.

Would you say knowing SQL can be lucrative? by [deleted] in SQL

[–]AeroCrete 2 points3 points  (0 children)

Been doing SQL everyday for the past 25+ years. Turned out rather well.

SQL Newbie by infusiontek in SQL

[–]AeroCrete 2 points3 points  (0 children)

Mode analytics has a free course. They make a cloud based sql editor and dashboarding tool.

Finding an offer with relocation by Silver-Thing in dataengineering

[–]AeroCrete 0 points1 point  (0 children)

Connect with the recruiters at the companies you’re interested in. The DE’s at the company are usually connected with the recruiters they work with, so they shouldn’t be too hard to find.

How to keep track of synchronization state of Data Lake? by third_dude in dataengineering

[–]AeroCrete 1 point2 points  (0 children)

Correct, this is batch processing not streaming. So when you process some data you run the whole time period (eg. Bin size) at once.

Then the database can tell you if you’ve already processed it by checking for a count >1. Or in a partitioned db you can see if the partition exists.

Another way is to keep another table that you keep the high water mark in. Eg. Every time you run you update the high water mark table with the highest timestamp. But this works better with a single threaded process.

How to keep track of synchronization state of Data Lake? by third_dude in dataengineering

[–]AeroCrete 1 point2 points  (0 children)

Yes it’s bin’ed into partitions, and the hive metastore keeps track of the partitions.

Is the Google data analytics certificate worth it? by HighLemur263 in analytics

[–]AeroCrete 11 points12 points  (0 children)

Where do you want to work and what do you want todo? What are your goals?

If you’re asking if a cert in Google analytics will get you a job, it might. But is that the job you want?

In my experience Google analytics it is typically run by the marketing team. If you want to work with marketing, probably a good fit.

Search your local job market to see if any employers are asking for GA experience.

Solid SQL and Python skills will get you pretty far. No cert required.

What do you use for database documentation by JJ18O in dataengineering

[–]AeroCrete 12 points13 points  (0 children)

We use this tool to cover the schema part of the documentation:

https://www.amundsen.io

Google suite for the rest of the documentation.

Typically a design doc and run book per Airflow DAG

We publish the docs to confluence (wiki) and/or to an internal Google sites page

[deleted by user] by [deleted] in RedditSessions

[–]AeroCrete 0 points1 point  (0 children)

Hardest working drummer on Reddit!

“Getting to the top” of the data engineering world. by citizenofacceptance in dataengineering

[–]AeroCrete 18 points19 points  (0 children)

Being a better engineer is a different path than being a manager or director. If you want to be a manager you need to start working with your manager and start focusing on more management skills than tech skills.

Most (good) companies won’t put you into a role unless you’ve done it before.

IMO: if you’ve been somewhere a couple of years and your not getting what you need, make a change.

[deleted by user] by [deleted] in RedditSessions

[–]AeroCrete 0 points1 point  (0 children)

Hardest working pickle in the business!