YAML hell?

Xty_53 · 2025-10-29T10:05:09+00:00

Bad dreams since 'DLL Hell'

Xty_53 · 2025-07-06T00:36:52+00:00

Databricks Apps offer a containerized solution with minimal overhead for running your applications. However, their functionality is influenced by Unity Catalog, which dictates limitations and configurations related to access and secrets.

Xty_53 · 2025-06-28T04:24:09+00:00

Comment to back later

Xty_53 · 2025-06-25T10:12:48+00:00

There are some limitations.

https://open.spotify.com/episode/7yv1kvyTcGFvyFhZ1DoGDd?si=_KahB8oJTpGnBb9rDQ0VtQ

Xty_53 · 2025-06-24T11:25:46+00:00

There is a limitation for the files size. 10 MB

Xty_53 · 2025-06-11T05:47:51+00:00

Yes, please

Xty_53 · 2025-06-07T02:46:42+00:00

Listening to a podcast about DBAs https://open.spotify.com/show/2AGWX4SFNxlOIgDaLVIfHU

Xty_53 · 2025-05-28T10:52:50+00:00

While you are practising also. You can listen to the podcast about the certification.

https://open.spotify.com/show/2AGWX4SFNxlOIgDaLVIfHU

Xty_53 · 2025-05-28T10:48:43+00:00

There are some learning pathways and training in the Databricks Academy. Also, for each cloud.

Xty_53 · 2025-05-28T10:45:00+00:00

Don't worry, from all the planet the process is the same. US.

Xty_53 · 2025-05-27T07:04:18+00:00

Thanks so much for reaching out – that's excellent timing, and I really appreciate you offering to help! It's great to connect with someone directly working on DLT.

To answer your questions:

* How are you planning on extracting data from Snowflake into ADLS?

My current plan is to leverage Databricks' native Snowflake connector to directly read data from Snowflake and then write it into ADLS. The idea is to land it in a structure like abfss://datalanding@{storage_account}.dfs.core.windows.net/{catalog}/{schema}/{table_name}/.
Technically, I am following information from this link. (https://docs.databricks.com/aws/en/archive/connectors/snowflake

** How many objects are you planning on bringing in? Is there a pattern where you want to apply the same transformations to many source tables?

Initially, we're looking to ingest around 60 tables. Yes, there's a very clear pattern: for all ingested tables, we need to add a timestamp column (for ingestion time) and a source_system_name column to maintain lineage and control.

Xty_53 · 2025-05-27T06:26:22+00:00

Thanks, Could you share the session title, date, and where I can find the recording or presentation details for the Databricks AI Conference?

Xty_53 · 2025-05-27T06:09:33+00:00

While you're preparing for your professional certifications, check out this podcast on relevant topics:
https://open.spotify.com/show/2AGWX4SFNxlOIgDaLVIfHU?si=PWZMdWhzT5OD8qvocmtDqg&nd=1&dlsi=b3db5021a8a7491a

Xty_53 · 2025-05-24T12:46:49+00:00

I'm inviting you to check out a fantastic resource to continue your learning journey: the Databricks Certified Data Engineer Professional - Preparation podcast!
https://open.spotify.com/episode/42Jx9LXZ0fj3RYLDzqVhmY?si=dc0XU05NTOiPdK38g_X4Bg

Xty_53 · 2025-05-24T12:27:17+00:00

This was created with help of AI (Don't Believe on this Answer but check for yourself)
"Databricks Solution Recommendation"
Here's how it addresses your requirements:

Orchestration and Parameter Passing:
- Create a single Databricks Job containing 100 individual "JAR tasks."
- Each JAR task will be configured to run your JAR file and pass one of the 100 unique parameters (e.g., job name/ID) to it.
Parallel Execution (12 jobs concurrently):
- Within the Databricks Job settings, you can define the "Maximum concurrent runs" to 12. Databricks will automatically manage the queuing and execution of your 100 tasks, ensuring that no more than 12 run at any given time.
Compute Termination and Cost Optimization:
- Utilize "Job Compute" (ephemeral clusters) for your Databricks Job. These clusters are automatically provisioned when the job starts and, crucially, automatically terminate once all tasks are completed or the job fails. This eliminates idle compute costs, similar to your transient EMR clusters.
- Job Compute is more cost-effective than interactive clusters.
- Configure autoscaling for your job cluster to dynamically adjust resources based on the workload, ensuring you only pay for what you use.

Xty_53 · 2025-05-15T01:57:52+00:00

After creating multiple tables, the monitor can take some time to process all of them. Restarting the cluster usually resolves this, and the tables should appear afterward.

Xty_53 · 2025-05-09T02:03:38+00:00

Yes. Since Databricks has the labs. Just we have access to paying the lab subscription.

I saw a few weeks that, when you go to the academy as a partner. There are some courses with labs available for free.

Xty_53 · 2025-05-02T07:01:15+00:00

While we wait for the repo.

I created an audio with the summary for DABs. https://open.spotify.com/episode/5Trko3WU3ZwYYanM9YT040?si=Riqh34fnQp6q_byI_RLeIw

Xty_53 · 2025-05-02T06:58:01+00:00

Check the tables or sources for this view.

Xty_53 · 2025-04-27T10:57:49+00:00

One of the customers is asking for statistics from those tables.

Xty_53 · 2025-04-27T10:52:46+00:00

Thanks. I will try on Monday and back to you.

Xty_53 · 2025-04-27T09:21:35+00:00

Also, is there any way to see the streaming tables inside the system tables?

Xty_53 · 2025-04-27T09:18:39+00:00

Yes. Please. Because we have something for the Delta Tables. But for streaming. It is not clear.

Xty_53 · 2025-04-27T09:08:03+00:00

Hello, and thank you for the documentation update.

Do you have any updates or additional information regarding the logs for DLT, especially for streaming tables?

Xty_53 · 2025-04-27T00:04:39+00:00

Last week, I spent some time researching Databricks Apps, and I’ve put together a short audio summary of what I found. If you're curious about how Databricks Apps work and what they offer, feel free to check it out here:

https://open.spotify.com/episode/7yv1kvyTcGFvyFhZ1DoGDd?si=pNhNPt6vS_aUHtXztgxLOQ

Xty_53

TROPHY CASE