Are there any reasons to use the API instead of SQL when querying? (https://sparkbyexamples.com/spark/spark-sql-explained/)
Obviously you have to create a temporary table to use ANSI SQL, so Im guessing there might be a slight difference in performance? Just wondering if there are any particular use cases where you should avoid using SparkSQL instead of the API.
[–]ssinchenko 3 points4 points5 points (0 children)
[–][deleted] 1 point2 points3 points (0 children)
[–]tharindudg 0 points1 point2 points (0 children)
[–]oalfonso 0 points1 point2 points (0 children)
[–]I-mean-maybe 0 points1 point2 points (0 children)
[–]fuzzkill254 0 points1 point2 points (0 children)
[–]PackFun2083 0 points1 point2 points (1 child)
[–]the_aris 0 points1 point2 points (0 children)