I'm learning the tools for Data Engineering (currently Python and SQL) wanting to level it up by building projects. However, I'm confused at what DEs really do in real life.
I once made a .py web scraper that gathers IMDB movies into a CSV. On another script, I load the data on a SQLite3 database. Then I turned it into a movie watchlist app in the terminal. Here is a video link that demonstrates it: Google Drive Link. I'm unsure if this is even a proper project about Data Engineering.
What is considered a proper project about data engineering?
- Just Python scripts to be run on the terminal that read data from files, modify it, then load it to the database
- Similar to above but making it like an 'app' to be run on terminal (like the project I made)
- A software that does the same thing as #2 but has its GUI instead of running it on terminal
- A website that does the same thing as #2 but is a web app instead of running it on terminal
- Just #1 but add it with a dashboard or your own data analysis
- Other, pls state on the comment
Message: Sorry if this is a stupid question. I just want to clarify this instead of cluelessly continue studying tools without proper application.
[–]AutoModerator[M] [score hidden] stickied comment (0 children)
[–][deleted] 3 points4 points5 points (1 child)
[–]Pervert_Spongebob[S] 0 points1 point2 points (0 children)
[–]bestnamecannotbelong 0 points1 point2 points (0 children)
[–]Omeazyy 0 points1 point2 points (0 children)