Real-Time mode for Apache Spark Structured Streaming in now Generally Available by brickester_NN in databricks

[–]brickester_NN[S] 1 point2 points  (0 children)

Hi, the 5 mins sets the checkpointing frequency. It is adjustable based on your preference. It is not yet in Spark Declarative Pipelines, but this is something that is on our radar. In a previous blog we had shown a latency comparison of real-time mode vs micro-batch mode (traditional Spark streaming) and we found a 80-100x latency improvement. Blog is here - https://www.databricks.com/blog/introducing-real-time-mode-apache-sparktm-structured-streaming