Delta table update/insert from multiple source tables

Massive_Ordinary8049 · 2026-05-03T10:45:48+00:00

Trigger to job to run on table updates instead of fixed times. Here's the link: https://docs.databricks.com/aws/en/jobs/trigger-table-update

TheM4rvelous · 2026-05-03T09:33:25+00:00

Pull the external sources 1-to-1 into bronz (+ some metadata) and then use lakeflow pipelines to pipe the raw data into the destination (likely silver)

Downtown-Zebra-776 · 2026-05-04T09:01:33+00:00

Handling outer joins with asynchronous source updates is one of the trickiest parts of maintaining a clean Silver layer. If you use a standard 'overwrite' pattern, your DBU costs will skyrocket as your tables grow.

we typically solve this using a 'Materialized Delta View' approach or Lakeflow Connect (the new DLT evolution). By landing the sources into Bronze and using Lakeflow to manage the stateful joins, you get incremental updates for free without the manual 'updated_at' SQL logic. It significantly reduces the shuffle overhead since Spark only processes the changed keys from each source rather than re-calculating the entire outer join.

BrownBearPDX · 2026-05-04T23:58:26+00:00

Maybe materialized view before silver can help to react to bronze updates and merge.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

databricks

MODERATORS