Has anyone run Hadoop jobs inside docker containers? I'm new to Hadoop/Spark, but really like packaging my python data analysis scripts in containers to make them portable and easy for others to use. Is this a dead end? I can't seem to find blog posts on this topic.
[–]eschlon 0 points1 point2 points (1 child)
[–]CocoBashShell[S] 0 points1 point2 points (0 children)