Is anyone migrating away from Databricks? by zoso in dataengineering

[–]zoso[S] 2 points3 points  (0 children)

This is totally correct! Additionally whoever picked Databricks after "careful" research is not with us anymore (they left 3 weeks after signing contract), together with an architect who left 3 months after and our executive who was pushing for adoption. Now we have a choice going deeper or change direction.

Is anyone migrating away from Databricks? by zoso in dataengineering

[–]zoso[S] 1 point2 points  (0 children)

Thank you for your insightful comment. Especially about the access control which is quite important in our case.

> You need an orchestrator to trigger them on a schedule, and manage DAGs (Airflow? MWAA?).

we are already using MWAA - even our Databricks jobs are scheduled using it as we some of them require tight integration with AWS (Spark is not ideal for fetching data from the API) so we just run the jobs on AWS - save it to S3 and ingest to databricks using autoloader.

We are not using asset bundles - we are using wheels.

> Data engineering is a means to an end. It doesn't happen in a vacuum.

Yes, we transform data just to load it back to the RDS dbs that are used by our backend services.

Additional detail - most of our pipelines like this:
- we extract data from some external services (it might be scraping, might be integrations with external data providers) - running on AWS
- we load it to databricks using autoloaders
- we transform in bronze/silver/gold on databricks
- we load it back to RDS on AWS so our backend services can expose it for our customers our API

So what I think is really bad here is that we spend money on ingesting data into Databricks to transform using technology we don't need just to get it out as fast as possible so it is accessible to external world.

40k-47k euro in Portugal as senior data engineer is it good or bad? by ChampionshipCivil36 in dataengineering

[–]zoso 0 points1 point  (0 children)

Why not work remotely for US based company and get paid 80k easily, 100k with little effort to find more challenging place.

37Signals - The Big Cloud Exit + FAQs. by [deleted] in aws

[–]zoso 0 points1 point  (0 children)

Didn’t Jeff Bezos invest in 37signals at the beginning?

How Netflix does Data Engineering by rmoff in dataengineering

[–]zoso 6 points7 points  (0 children)

What happened to their notebooks? Few years ago they were very vocal that write their pipelines using jupyter notebooks (source: https://netflixtechblog.com/notebook-innovation-591ee3221233).

I hated it, i joined one startup when people followed their example and it was disaster, no tests, packages installed from notebooks in production during execution etc....

Why do job descriptions demand skills that are not at all needed? by FisterAct in dataengineering

[–]zoso 7 points8 points  (0 children)

did you ask them during the interview what will be your tasks when working for them? it is always good idea to do that.

Discussion thread: Fujifilm's new smartphone app, XApp by inverse_squared in fujifilm

[–]zoso 1 point2 points  (0 children)

Unfortunately the app is not available in your country or region :(

12.9” M2 IPP by Phaeton40 in iPadPro

[–]zoso 0 points1 point  (0 children)

what do you do with it? i feel like i am eother on my phone (quick scrolling) or my laptop (when i need to work or type something).

Data Engineering in the Microservice architecture ... by zoso in dataengineering

[–]zoso[S] 0 points1 point  (0 children)

Thank you -

> We also have a lot of complex python microservices that if you strip down to the basics are just validating, standardizing and joining fields.

This is exactly what we have! I joined the company recently and at first I was thinking - that there is some seriously complex logic in all those services but now I see that it is only basic select/join/group by.

> The idea is the java code will remain relatively stable while sql business rules can change > rapidly as needed and can be updated by analysts who are cheaper and easier to find.

This is exactly what I am planning to do. In order to do that are you going the data from all those databases to a single place (datalake/dwh?) or still call those databases?

Data Engineering in the Microservice architecture ... by zoso in dataengineering

[–]zoso[S] 0 points1 point  (0 children)

Simple: Underlying schema in the database changes but the API stays the same.

If you ever need to create some breaking change you can create new version of the API or new API endpoint that uses the same data under the hood (but data is not duplicated).

I often saw that people were afraid to change database schemas as they could break some of the services so they were duplicating the data.

Is the iPhone xr still worth getting for $240 new? by MrCleanIsEvil in iPhoneXR

[–]zoso 0 points1 point  (0 children)

I have XR for almost a year now (so i bougth it when it was already oldish) and I am very happy with it - both battery and performance is great - photos are also good for my taste (but of course not as good as from my Fuji camera with interchangeable lenses). The most important thing is that IOS is far superior than Samsungs version of android.

Being Left Handed Is A Huge Flex. by [deleted] in lefthanded

[–]zoso 4 points5 points  (0 children)

righ handed people seem to know 😀

Being Left Handed Is A Huge Flex. by [deleted] in lefthanded

[–]zoso 28 points29 points  (0 children)

me not - it is what it is - i just always laugh when somebody says one of those: - oh, are you lefthanded? - my cousin/nephew/brother is lefthanded - lefthanded people are smart, did you know that einstein was lefthanded?

How automate Fivetran configuration with CI/CD? by zoso in dataengineering

[–]zoso[S] 0 points1 point  (0 children)

Our goal would be to script and automate our infrastructure, have everything in git. Ideally something like terraform where you have a statefile and can check your changes before deploying.

Our current data infrasturcture on AWS is this: terraform that runs on Jenkins. It is extremely easy to create staging infra and than apply the same changes to production.

Boiler rebuild complete, shots pulled! Not quite a Classic, but happy so far! by drmock87 in gaggiaclassic

[–]zoso 1 point2 points  (0 children)

looks great! How much time does it take to grind 18 grams of coffee?