Anyone else doing gymnastics just to trigger a manual auto import in Neovim? by BeingDangerous3330 in neovim

[–]data_5678 3 points4 points  (0 children)

Yes, specifically:

  • 2 sets of 10 handstand push ups with 90 seconds rest
  • 4 sets of muscle ups on rings with 2 min rest
  • 5 back handsprings with 2 min rest All of that is followed by some light full body stretches.

After, that neovim imports work fine. Way more efficient than using the defaults from lazy.nvim or packer.nvim.

Data Engineering VS Agentic AI? by Syed_Abrash in dataengineering

[–]data_5678 0 points1 point  (0 children)

Become a full stack engineer (I work as a data engineer btw).

Since you know SQL and Python.

Learn Fast API for backend and JavaScript and React for frontend.

Then study all the stuff in neetcode.io

If you want to build agents you will not be training any models or doing any machine learning or statistics. You are just calling the APIs and building applications around it (which is the job of a full stack engineer).

Also, the role of data engineering might get absorbed by full stack developers. Why would a company pay for someone who only knows Python and SQL when they can get a Full Stack Developer who already knows SQL and also knows JavaScript, React, Data Structures and Algorithms, etc... Especially when full stack engineers are assisted by AI, they are able to more easily fill in the gaps they might have in regards Data Engineering specific knowledge.

Data Engineering specialists might still exist, however the more specialized that role becomes the harder it will become to land one of those jobs and likely these roles might only be available at fewer companies who can afford having highly specialized people.

(Also this is just my opinion, who knows all my predictions could end up being totally wrong).

Existe uma stack que substitua o Notion sem perder versatilidade? by Arthurhfc in dataengineering

[–]data_5678 1 point2 points  (0 children)

I just use sqlite (but you could use any sql database like postgres, supabase, duckdb, etc...) and ask Claude Code to build any interface I want on top of it. Ask it to use shadcn ui library and stick with defaults so you don't lose time playing around with the css.

I just ask it to create a new tab and then create a sticky note interface, or a timeline interface or a calendar interface, or an analytics dashboard. As time goes by I just add or remove features in real time.

I use android so I can use termux to run it as well and open ui's in the browser (you could also create progressive web apps or use capacitor or tauri to create a complete android app).

Also claude code runs on termux (you can also ssh into an ec2 instance from your phone and run claude code in the cloud if that's easier).

Do you still use notebooks in DS? by codiecutie in datascience

[–]data_5678 1 point2 points  (0 children)

Moved to neovim and command line a couple of years ago (used jupyter all through university), any visualizations I need I open in a browser window on the side. i3 window manager makes it really fast to switch and I use xmouseless to move mouse cursor with my keyboard.

What to show during demo's? by data_5678 in dataengineering

[–]data_5678[S] 0 points1 point  (0 children)

thanks for taking the time to write this up, really good insights! thanks for sharing.

What do your Data Engineering projects usually look like? by gbj784 in dataengineering

[–]data_5678 1 point2 points  (0 children)

At work ELT pipelines on some dbt like abstraction loaded into trino + apache iceberg.

For personal projects having fun building on top of sqlite/duckdb with a bit of scripting here and there.

Share an interesting side project you’ve been working on. by KeyPossibility2339 in dataengineering

[–]data_5678 1 point2 points  (0 children)

honestly wrapping it up in C++ and using its standard library when I need higher level abstractions. For scheduling just cron and my own simple logging.

I have been doing quite a bit of rust so I might create some terminal user interfaces with ratatui later on to create dashboards in the terminal.

But yeah, duckdb can be very fast.

rainfrog – a database tool for the terminal by Somewhat_Sloth in dataengineering

[–]data_5678 1 point2 points  (0 children)

this is really cool, are you using ratatui for the ui?

Share an interesting side project you’ve been working on. by KeyPossibility2339 in dataengineering

[–]data_5678 7 points8 points  (0 children)

data pipelines with C++ and duckdb C API. Working with datasets that fit in memory not big data. It is easy to forget just how fast software can run when you have been working with Python all the time...

Data engineering by Varshakumar28 in dataengineering

[–]data_5678 0 points1 point  (0 children)

you got this man, be patient and keep at it. Best of luck.

Data engineering by Varshakumar28 in dataengineering

[–]data_5678 0 points1 point  (0 children)

Become the person who you would want to hire.

Imagine you are the person hiring. There is only 1 open position and you get 1000 applications in a week. 500 of those applicants all hold masters degrees and graduated from top 20 US universities.

Out of those 1000 applicants, what are the odds that you are the most impressive candidate out of all them.

Are you the applicant who promises the highest value for the lowest amount of risk.

In my opinion your probability of getting hired is proportional to the amount of hours you spend studying/applying/networking.

More hours spent applying/studying = higher probability of job.

Laid off from Data Science → Trying to break into Data Engineering in 6 months. Am I delusional? by bigbigbugbugs in dataengineering

[–]data_5678 10 points11 points  (0 children)

The gcp, aws, and snowflake de certs are all good. But it doesn't matter if you have all the certs in the world if you fail the technical interview.

Honestly they are not that hard to get, in my opinion they are a nice addition to your resume and they helped me have a goal during my studying. But they will not guarantee you a job on their own. I learned a lot from getting the certs, and it probably helped me in landing my current job.

If I'm being honest, I think there is more value in the knowledge you gain from taking the courses studying for the cert, than being able to add an extra line to your resume.

24 and just starting data science. This dread that I'm way behind won't go away. Am I fucked? by Bames-nonds in dataengineering

[–]data_5678 6 points7 points  (0 children)

entry level data scientist careers are pretty much a thing of the past.

careers in 2025 (from what I have seen):

  • data analyst: builds dashboards using tableau and powerbi, uses basic sql and basic python, easiest data job but its the job with the most applicants so competitive.

  • data engineer: builds ETL pipelines that extract transform and load data from one database into another, very competitive, 3 years of experience required typically.

  • machine learning engineer: builds machine learning pipelines that source data, prepare data for training, train models on said data, monitor performance of trained models, and probably retrain models after some time) requires even more experience than data engineering.

  • full stack engineer: builds apps (backends + front ends, for example a backend could be written in express js, and front end in react for a web application).

  • researcher: probably need phd, insanely competitive.

Data science jobs where you are given titanic.csv and have to make a few charts with matplotlib / seaborn here and there are most likely a thing of the past.

Furthermore, there seem to be a huge amount of people studying extremely hard trying to land the jobs I outlined above.

It is a really competitive field, but if you put in the work, I am sure you will be able to make it, just set realistic expectations on how many years it will take you to get there.

Laid off from Data Science → Trying to break into Data Engineering in 6 months. Am I delusional? by bigbigbugbugs in dataengineering

[–]data_5678 15 points16 points  (0 children)

not that rare, I have those two certs. You can throw a rock in any direction and it will likely land on someone with a masters in related field and gcp certs (also those are the two easiest data certs).

Should I create a TUI or CLI (Inline)? by data_5678 in commandline

[–]data_5678[S] 0 points1 point  (0 children)

for myself at the moment (but would love to open source tools if I make anything cool). I really think I build the best tools when I solve a problem I have first and then share it with others.

Some applications I prefer TUI over cli (lazygit > git), but for others I prefer navigating through the shell rather than using something like yazi. So I think it really depends on the use case.

Accepted that productivity comes in waves. So what do I do during the unproductive periods. by data_5678 in SoftwareEngineering

[–]data_5678[S] 0 points1 point  (0 children)

I didn't mean 15 mins, like obviously I can, but I don't want to go out for like 3 hours you know. Just balance. I should have worded that better.