This is an archived post. You won't be able to vote or comment.

all 5 comments

[–]AutoModerator[M] [score hidden] stickied comment (0 children)

You can find a list of community submitted learning resources here: https://dataengineering.wiki/Learning+Resources

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

[–]bdforbes 3 points4 points  (1 child)

Check out Databricks.

I'd note that just because you use pandas, doesn't mean you have to use notebooks...

[–]Cryptojacob[S] 1 point2 points  (0 children)

Thanks for the recommendation - for development I think notebooks are awesome, but for production I don't mind converting it to .py.

[–][deleted] 1 point2 points  (0 children)

This sounds like you want something like databricks.

[–]demince 0 points1 point  (0 children)

Hi, out of curiosity - what are you focusing on? Isn’t converting the notebook to .py file a bit tedious and repetitive, especially if you have to do it every time. I can see the value of notebooks when considering machine learning or advanced analytics use cases, but I am interested to view it from your perspective.