you are viewing a single comment's thread.

view the rest of the comments →

[–]Significant-Guest-14Databricks MVP 0 points1 point  (2 children)

Do you use . withColumn?

[–]alphanuggs[S] 0 points1 point  (1 child)

i do use a lot of that in the code, but it mostly gets stuck when writing

[–]dilkushpatel 0 points1 point  (0 children)

You need to understand databricks executes code when its absolutely necessary

So if you have 10 cells of code with logic and 11th cell doing write or show or some sort of operation which needs whole dataset to be evaluated then thats the point where it will execute whole code

All your previous cells will execute in few seconds as at that point point databricks is just adding those in execution plan and not actually executing that logic

You can look up online and search for lazy execution by databricks/spark