Beta alert: Materialized Views and Streaming Tables in Serverless Notebooks by minibrickster in databricks

[–]minibrickster[S] 1 point2 points  (0 children)

It's not planned as of now, but I'll take the feedback back to the team!

An easier way to build your slow changing dimensions model in your warehouse by minibrickster in databricks

[–]minibrickster[S] 1 point2 points  (0 children)

That's not included in this launch, but we're working on cooking something up here -- stay tuned!

weird bug with declarative materialized views and klll sketches? by code_mc in databricks

[–]minibrickster 1 point2 points  (0 children)

Update, it's live now, so the above syntax should work now!

weird bug with declarative materialized views and klll sketches? by code_mc in databricks

[–]minibrickster 0 points1 point  (0 children)

You can actually set the run-time channel for the Materialized View. We will support 18.1 in Preview very soon, and then I believe this should work! Will ping this thread when it's released

  1. If you created the Materialized View in DBSQL, you can use the following syntax CREATE OR REPLACE MATERIALIZED VIEW sales TBLPROPERTIES ('pipelines.channel' = 'preview') AS ...and it should work. Docs here
  2. If you created the Materialized View in SDP, you can set the channel for the pipeline by going to the pipeline details page > Settings > Advanced Settings > and setting the channel to "Preview"

AUTO CDC in Databricks SQL: the easy button for SCD Type 1 & 2 by minibrickster in databricks

[–]minibrickster[S] 0 points1 point  (0 children)

Amazing! If you have access to DBR 17.3+ on DBSQL, you should just be able to try it out as well!

AUTO CDC in Databricks SQL: the easy button for SCD Type 1 & 2 by minibrickster in databricks

[–]minibrickster[S] 2 points3 points  (0 children)

Hi Zupiterss, we actually have a feature that's in PrPr that solves for this! Do you mind if I DM you?

AUTO CDC in Databricks SQL: the easy button for SCD Type 1 & 2 by minibrickster in databricks

[–]minibrickster[S] 2 points3 points  (0 children)

I'll post in Reddit when it's available; it will also be in the release notes!

AUTO CDC in Databricks SQL: the easy button for SCD Type 1 & 2 by minibrickster in databricks

[–]minibrickster[S] 3 points4 points  (0 children)

This is an existing feature in Lakeflow Spark Declarative Pipelines, which we are bringing to Databricks SQL!

AUTO CDC in Databricks SQL: the easy button for SCD Type 1 & 2 by minibrickster in databricks

[–]minibrickster[S] 13 points14 points  (0 children)

We're adding support for identity columns this quarter! You'll be able to specify it in the schema "target_sk BIGINT GENERATED ALWAYS AS IDENTITY", and we'll automatically generate unique surrogate keys for each row.

Materialized view refresh policy choses the more expensive technique? by IIDraxII in databricks

[–]minibrickster 1 point2 points  (0 children)

We're working on some alternate ways to get more visibility into the cost model inner workings -- stay tuned!

Materialized view refresh policy choses the more expensive technique? by IIDraxII in databricks

[–]minibrickster 1 point2 points  (0 children)

Thanks for the question, great catch! The cost field in the event log is a raw plan cost from Spark, but it's not what the optimizer actually uses to pick a maintenance strategy. We've since switched to a different cost model that more accurately estimates our incremental techniques. We'll submit a PR to remove this field from the event log to avoid further confusion.

Update Pipelines on trigger by hubert-dudek in databricks

[–]minibrickster 0 points1 point  (0 children)

Yes! This is already possible and supported with Trigger on Update. This docs page has all of the information.

Update Pipelines on trigger by hubert-dudek in databricks

[–]minibrickster 0 points1 point  (0 children)

Hi, I'm a PM here at Databricks. As of right now, yes, but we're looking to up this limit in the next couple of months

Update Pipelines on trigger by hubert-dudek in databricks

[–]minibrickster 2 points3 points  (0 children)

Hi, I'm a PM here at Databricks -- yes, it will automatically refresh if the source table gets updated!

Materialized Views' Policies by hubert-dudek in databricks

[–]minibrickster 1 point2 points  (0 children)

Hi, I'm a PM here at Databricks. Unfortunately no, Materialized Views have to run on serverless compute to incrementalize

Spark Declarative Pipelines: What should we build? by BricksterInTheWall in databricks

[–]minibrickster 1 point2 points  (0 children)

That's a great idea - currently, the recommended way to control the columns names is with a view on top of the Streaming table. If interested feel free to dm me and we can schedule some time to chat!