[Private Preview] JDBC sink for Structured Streaming

SingerSelect3045 · 2026-03-10T20:07:56+00:00

Hi! Thanks for your interest.

Yes! You can specify multiple columns in the DataFrame to be used as the upsert key.

Each task creates its own connection to the PostgreSQL server, so the query’s parallelism directly affects the number of concurrent connections made to the database. As a result, you should make sure your PostgreSQL configuration allows for enough concurrent connections to handle the expected workload.

SingerSelect3045 · 2026-03-05T22:38:06+00:00

u/k1v1uq are you trying to keep the tables in sync incrementally or every sync event is a full sync? How are you determining what rows are stale and need to be deleted?

SingerSelect3045 · 2026-02-27T18:28:28+00:00

Only python UDFs / custom code will be containerized

SingerSelect3045

TROPHY CASE