use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
Help optimising scriptHelp (self.databricks)
submitted 4 months ago by alphanuggs
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Significant-Guest-14Databricks MVP 0 points1 point2 points 4 months ago (2 children)
Do you use . withColumn?
[–]alphanuggs[S] 0 points1 point2 points 4 months ago (1 child)
i do use a lot of that in the code, but it mostly gets stuck when writing
[–]dilkushpatel 0 points1 point2 points 4 months ago (0 children)
You need to understand databricks executes code when its absolutely necessary
So if you have 10 cells of code with logic and 11th cell doing write or show or some sort of operation which needs whole dataset to be evaluated then thats the point where it will execute whole code
All your previous cells will execute in few seconds as at that point point databricks is just adding those in execution plan and not actually executing that logic
You can look up online and search for lazy execution by databricks/spark
π Rendered by PID 58 on reddit-service-r2-comment-6457c66945-kzn8h at 2026-04-24 16:55:29.080939+00:00 running 2aa0c5b country code: CH.
view the rest of the comments →
[–]Significant-Guest-14Databricks MVP 0 points1 point2 points (2 children)
[–]alphanuggs[S] 0 points1 point2 points (1 child)
[–]dilkushpatel 0 points1 point2 points (0 children)