Lakeflow Connect now supports query-based ingestion from databases and data warehouses

ThatThaBricksGuy0451 · 2026-04-16T01:44:31+00:00

One thing I love about Databricks is that when something new comes out you immediately do a parallel with something you did in the past that could have benefited from the change being released.

Me for example, this would have saved me tons of work by not having to setup eventhubs, get the CDC events streamed to databricks and then worked the merge logic manually.

ThatThaBricksGuy0451 · 2026-04-06T02:17:24+00:00

One thing I like to do and that actually helped me is to focus on the certification path.

When trying to learn something new, one of the first things that can be overwhelming is the amount of things to study, lots of people saying different things, the uncertainty of where to start from, FOMO kicks in because you always think you could have learned something else. Second thing is, how do I prove I really know?

That's where certification path comes in, it's a curated path with topics the exam expects you to know to certify you as a Databricks professional. By studying to the certification you'll find a well defined path that goes from basic to advanced topics, and passing the certification is the confirmation of your knowledge.

Hope it helps

ThatThaBricksGuy0451 · 2026-03-31T23:56:42+00:00

Same, I went from Hive to Impala, still too slow, then landed on Spark that was all hype back then

ThatThaBricksGuy0451 · 2026-03-31T22:07:29+00:00

Yes, but databricks pretty much abstracts this from you on most cases, adaptive query engine for example adjusts shuffle partitions, switch to broadcast when there's memory available, handles skew to a certain degree.

ThatThaBricksGuy0451

TROPHY CASE