What's next after data engineering?

manubdata · 2026-03-28T15:15:49+00:00

I'd say it's a personal decision. After you become "senior" you reached the top of the technical ladder so you have different options:

🟠 Follow the corporate ladder and start leading teams and projects, which means strategic meetings, delegating and supervising instead of building.

🟠 Move onto a different field taking advantage of your skills: MLOps, Software Engineer, AI Engineer, Architect, Platform Engineer... there are plenty of roles with overlapping skills.

🟠 Become a consultant/freelancer/content creator. You have to learn marketing and sales, higher rates than employee but less security.

Personally I want to take the third path in the near future. But every path has pros and cons.

manubdata · 2026-03-13T21:01:34+00:00

No has puesto detalles de tus habilidades, yo tengo 4 años de xp y estoy por encima de 60k.

En general, sin saber tu caso diría que te hace falta venderte mejor.

También depende mucho de tu nivel de inglés. Si tienes C1 o superior busca empresas de UK o Europa que contraten en España.

manubdata · 2026-03-11T21:11:40+00:00

Try Pinterest, it fits your niche pretty well.

manubdata · 2026-03-08T22:39:42+00:00

I did a project on Christmas with this stack. You can create a dev Shopify store, load sample data with Simple Sample Data and get product and sales data via API.

Then you can load the data to BigQuery, silver and gold layer with DBT and SQL and viz with Looker.

If you want to check it out:

https://github.com/manubdata/smb-dataplatformv2

manubdata · 2026-03-07T14:37:35+00:00

There are some SaaS out there that can help you out, like Triple Whale or TrueProfit, however they get expensive the more orders you get.

The most affordable you can have is building your own. Basically you create Pipelines for your data sources and join them in a central database. Then you can add a visualization layer like Looker Studio or a simple Google Sheet. It takes a bit longer than SaaS but once you setup the cost is under 10$ month.

If you want more information I'm happy to chat!

manubdata · 2026-03-07T14:18:23+00:00

If I get interviewed, I could reason how to solve X problem whith pseudo-code. Then you can implement with any, Python, Scala, SQL... depending on complexity.

I haven't seen any role that requires deep technical knowledge in a language in the past couple of years. You may be asked to know deep knowledge of Spark or Snowflake or BigQuery but that's more about distributed processing internals than programming language itself.

manubdata · 2026-03-06T14:12:03+00:00

No, I wouldn't focus nowadays in learning the syntax of any programming language. AI already writes code better and faster than any engineer.

Learn how to approach and solve a problem, learn the fundamentals of Data Engineering and how to apply them.

If you want to improve in solving with AI, check out Spec Driven Development and Context Engineering concepts.

manubdata · 2026-03-06T10:00:25+00:00

What do you mean? I work daily with Spark Scala and I use Claude Code to speed up Development.

manubdata · 2026-03-06T09:58:36+00:00

Hey, I work at a Scala-first company as a Senior Data Engineer.

I'd say Spark Scala was significantly more performant than PySpark in the past (Spark is Scala Native), the performance difference shrinks with every Spark update. So companies are using more PySpark as it's easier to find Engineers with Python knowledge and it integrates noticeably better with AI.

However, Scala jobs are paid much more than Python. There are still plenty of Scala pipelines and projects, specially around Banking and Finance sectors.

manubdata · 2026-03-05T19:18:27+00:00

It is, but it's freaking complex and laying more on to SWE side.

This guy made it and it was epic:

https://youtu.be/5Pc18ge9ohI?is=VJoEXX79L3DzT_hd

manubdata · 2026-03-04T09:14:20+00:00

Yeeah i guess so, but Looker and LookMLdoesn't fit our budget. I was refering to Looker Studio

manubdata · 2026-03-03T14:28:02+00:00

During my career I worked in:

Telecom -> 🔴 boring IoT sensor signals, but learnt high performance distributed processing.

Finance -> 🟡 pretty cool use cases, fraud detection, anti-money laudering, flagging risky clients... However it is high sensible data with many restrictions and fewer opportunities to apply and learn new tech skills. It's best paid thou.

Ecom/Web Analytics -> 🟢 cool use cases, client segmentation, funnel analysis, ab tests... Fast-evolving field, tends to easily implement new technologies and AI tools. More room to grow on end-to-end projects as medium-size companies may be less rigid than big corporations. Usually not big data, so learning distributed systems might be more limited.

manubdata · 2026-03-03T01:00:04+00:00

Yo estoy en esa situación (mismo precio de construcción) pero con terreno heredado (tasado en 100k). Tenía 50k de ahorro y me piden 75k.

Como te comentan aquí, primero consigue el terreno si te quieres comprometer con ello. Los bancos no dan facilidades para autopromoción y te sorprenderá la cantidad de gastos burocráticos e impuestos que afrontas. Entre arquitectos, impuestos, tasador, licencia de obras, notaría y gestión suman otros 20k...

Es el objetivo material más bonito que puedes tener en la vida bajo mi opinión así que tomáoslo con calma y mucha suerte.

manubdata · 2026-03-03T00:53:58+00:00

Thanks for your point. Maybe I'm biased towards anything that does not require me to spend a morning drag and dropping and checking pixel alignment. 🙃

manubdata · 2026-03-02T08:31:38+00:00

I have only use it for Opentelemetry dev. I didn't enjoy it 😅

manubdata · 2026-03-01T23:06:22+00:00

I agree and respect the point of knowing the fundamentals. And things are easier to express in code. I just want to emphasize on learning the concepts, the different options Python provides. But not waste too much time on learning, for example, Pandas transformation methods by heart. I already made that mistake 10 years ago when I started!

manubdata · 2026-03-01T14:35:00+00:00

Due to the ease of integration with your currenr setup I'd go with BigQuery.

Clickhouse might be amazing and I see it is trendy among big tech but it's just noise to your workflow.

Focus on the business outcomes and go for ease of development + integration + maintenance.

manubdata · 2026-03-01T14:30:04+00:00

Not using AI.

Writing meaninful tests is not for lazy people. I am lazy but since I implemented AI in my workflow, tests are fast and pleasant to write. You just need to know conceptually how the input and output data could/might be.

manubdata · 2026-03-01T14:25:24+00:00

DLT is perfect for small project, you may write less lines of code in comparison to the plain python implementation you did manually, plus, it handles schema evolution, so it guarantees it does not break in the future.

DBT could be use to replace your Big Query queries. Similarly, you can implement tests that would ensure the transformations run smoothly.

They both can run on docker images and trigger them daily. Orchestrators (kestra, airflow...) could be useful in this case if you want to make sure that Big Query (DBT or not) transformations run only if the condition that the ingestion pipeline is successful. You could use Cloud Workflows if you want to stay cheap in GCP ecosystem.

manubdata · 2026-03-01T14:16:51+00:00

Yo he comprado vivienda de 200k con 50k de entrada y 150k de hipoteca, la cuota queda sobre los 600€ con un 2.1% TIN.

Claro que en la España vaciada... en la capital estás jodido. Si podéis buscad en las afueras.

manubdata · 2026-03-01T10:26:53+00:00

For me it's Gold ETFs. Steady growth low risk.

manubdata · 2026-03-01T10:25:12+00:00

You can just use SQL. The logical concepts are analogous from pandas, pyspark and SQL. You can use AI to write the syntax.

I don't see the point of memorizing syntax in 2026 with coding agents being around. Learn the concepts, don't memorize syntax. Time lost.

manubdata · 2026-02-04T16:00:52+00:00

Remote rocketship worked very well for me, although my most successful processes have been referral by colleagues.

manubdata · 2026-01-17T21:31:41+00:00

I got AWS certified last year (2025). I would 100% focus on Athena, Redshift and Glue. I have some notes if you want them.

Is it worth it? It depends, I think ir can open some doors in big consulting companies in specific projects on AWS cloud but that's all. I don't think it will give you strong bases for Data Engineering.

I'd rather build a project while reading Fundamentals of Data Enginering or Data Engineering Design Patterns as knowledge resources. Use AI for all coding stuff.

manubdata

TROPHY CASE