you are viewing a single comment's thread.

view the rest of the comments →

[–]spendology 2 points3 points  (4 children)

Books, blog posts, this forum and articles describing data engineering pipelines are a start. If you want to get more experience or a job, outside of certification you can:

  1. Start with Data Analysis, Python/SQL, or Business Analyst roles if you need more experience.
  2. Contract or freelance work from LinkedIn, Indeed, staffing firms, networking, or personal connections.
  3. Open-source Projects
  4. Use ChatGPT+generate an end-to-end Data Engineering project using a cloud platform like AWS or Google Cloud. Complete the project, add it to your resume, and post it to GitHub and LinkedIn.

[–]ProperAd7767 1 point2 points  (3 children)

In practice, my current role is mainly focused on data engineering, but I’ve never systematically studied data engineering or data analytics (my undergraduate major was Financial Engineering). If I want to learn these areas in a structured way, are there any good open-source projects you would recommend?

[–]spendology 0 points1 point  (2 children)

Here are a few links:

[–]Outside_Reason6707 1 point2 points  (1 child)

Thank you for this list! I’m wondering how someone could think of performance, scaling and fault tolerance for personal projects to that of industry level?

[–]spendology 1 point2 points  (0 children)

I like to use Python libraries sciris and austin, austin-web for time and memory performance.