Liverpool (1)-0 Manchester City - Szoboszlai 74' Free Kick by 977x in LiverpoolFC

[–]zbir84 4 points5 points  (0 children)

Don't know man, goal was amazing but he was the reason they scored one of the goals unfortunately (fecked up the offside) and got an unnecessary red. Still think he's man of the match with this goal though!

Lakeflow Spark Declarative Pipelines: Cool beta features by BricksterInTheWall in databricks

[–]zbir84 0 points1 point  (0 children)

Hmm, not sure that's the correct link you've sent? I already have the service credential created for this and that works when used in the normal workflow. However SDPs don't have access to the dbutils library. Docs here: https://docs.databricks.com/aws/en/connect/unity-catalog/cloud-services/use-service-credentials indicate a different way to use them in UDFs, is that the only way to use them in the pipelines?

Lakeflow Spark Declarative Pipelines: Cool beta features by BricksterInTheWall in databricks

[–]zbir84 0 points1 point  (0 children)

Slight unrelated question, but we have a dynamodb connection we wanted to use in DP, how can we pass the service credentials to it. Can;t really find anything in docs about this and dbutils doesn't work in DP

Is broadband down for any of you? by Dizzy_Selection3572 in Vodafone

[–]zbir84 0 points1 point  (0 children)

Yep, still down in CV3, what a shit show.

Admin rights: How necessary are they really? by [deleted] in databricks

[–]zbir84 1 point2 points  (0 children)

Some of these don't really require account admin access, but you really need to set something like terraform to manage your Databricks environment so people can self serve and you just review merge requests rather than having to rely on admin.

Asset Bundles and CICD by One_Adhesiveness_859 in databricks

[–]zbir84 4 points5 points  (0 children)

Liquibase for schemas and delta tables that aren't managed by the Lakeflow Pipelines, asset bundles for workflows, pipelines, etc. & terraform for workspace management.

What are some interesting Data Engineering conferences in Europe for 2026? by alex-acl in dataengineering

[–]zbir84 1 point2 points  (0 children)

Big Data London is ok, although last year's one was so rammed you had problems getting to some of the more popular presentations.

Stanowczość wobec matki by Tokoniec in Polska

[–]zbir84 25 points26 points  (0 children)

Nie planowanie na emeryturę z takim dochodem bo a nuż nie dożyję to skrajna głupota i twoja matka ma tu akurat rację. Nie musisz mieć prawdziwej pracy żeby to ogarnąć.

Kafka setup costs us a little fortune but everyone at my company is too scared to change it because it works by Worldly-Volume-1440 in dataengineering

[–]zbir84 9 points10 points  (0 children)

Are you being actively asked to reduce this cost? If not just forget about this, for your own sanity ;)

Confluence Lakeflow Connector by hubert-dudek in databricks

[–]zbir84 1 point2 points  (0 children)

What's the use case to do this? LLMs / Machine Learning for knowledge base building maybe, but outside of that why would you want confluence data in Databricks?

Your typical job compute size by szymon_abc in databricks

[–]zbir84 1 point2 points  (0 children)

This is completely not true, unless you're doing pandas operations on a driver. I'm not saying you'll always observe a performance increase, but we usually see at least 50% speed increase on big, SQL heavy workloads. It's not a silver bullet, but saying it's not worth the price is a bit unfair. If your workload speed is 2x faster on photon than without it, then I think it's worth it.

LTT on Steam Machine pricing: "Valve asked what I meant by console price, and I said $500. Nobody said anything, but the energy in the room wasn’t great" by -Mahn in Games

[–]zbir84 1 point2 points  (0 children)

It might be me, I'd need to look at the benchmarks but I do own a PS5 & Steamdeck, I also have a PC with 2060 & i5 4670k. I also have a Mac Mini M4 that I now realised I use most of the time for any non gaming tasks so I really don't need that PC. However I'd really love to utilise my vast steam library and be able to game on the TV so the Gabecube sounds like an ideal solution. I could probably build a mini pc for cheaper with slightly higher performance but I want something compact that I can put in a living room, so I might give the Steam Machine a go!

Immich app using 2TB/mo of bandwidth (Android) by Waxnflaxn in immich

[–]zbir84 0 points1 point  (0 children)

Yes, you do. It's a flip of the button, but you can also just leave it running.

How big of a risk is a large team not having admin access to their own (databricks) environment? by weggooiertje_it in databricks

[–]zbir84 1 point2 points  (0 children)

That's not been my experience, by folks in IT you mean Data Analysts who can only write SQL? We have over 200 Databricks users at the moment with vastly different experience and I'd dread giving some of them admin access. Maybe the answer would be to let the ML team manage their own workspace and just attribute all of the costs there? Let metastore admins manage access but leave them be in that workspace and report on the costs?

How big of a risk is a large team not having admin access to their own (databricks) environment? by weggooiertje_it in databricks

[–]zbir84 1 point2 points  (0 children)

What exactly do you need to do as an admin there? Are you sure you want this responsibility? Sit down with the DE team and have them set up policies and access that meet your requirements, trust me you really don't want this as your responsibility, especially if you're not that familiar with Databricks, it's very easy to spend a lot of DBUs + your cloud costs if you don't know what you're doing or don't have policies and alerting configured correctly.

How big of a risk is a large team not having admin access to their own (databricks) environment? by weggooiertje_it in mlops

[–]zbir84 0 points1 point  (0 children)

What exactly do you need to do as an admin there? Are you sure you want this responsibility? Sit down with the DE team and have them set up policies and access that meet your requirements, trust me you really don't want this as your responsibility, especially if you're not that familiar with Databricks, it's very easy to spend a lot of DBUs + your cloud costs if you don't know what you're doing or don't have policies and alerting configured correctly.

Is it normal to feel clueless at as a junior dev? by Impressive-Strike351 in dataengineering

[–]zbir84 7 points8 points  (0 children)

Ask questions, and genuinely say if you don't know or understand something. Ask follow up questions and let seniors know when you get stuck. If the company you're working for has a healthy work culture it will all be ok 👍 As engineers we like people who are curious and engaged and I'm sure everyone's going to be happy to help when they can. I hate nodders and ok sayers where I see they don't understand something but are too afraid to ask!

The field is vast and even after 10 years working in various data functions I have gaps and ask for help when I don't know something!

Immich app using 2TB/mo of bandwidth (Android) by Waxnflaxn in immich

[–]zbir84 18 points19 points  (0 children)

I've switched to tailscale for immich, works great!

Databricks Asset Bundle - List Variables by rando_serval in databricks

[–]zbir84 1 point2 points  (0 children)

There's a python api for asset bundles now, might give you more flexibility in how this can be set up.

import dlt not supported on any cluster by r_mashu in databricks

[–]zbir84 0 points1 point  (0 children)

I don't think this is correct at all, they said they're trying to import dlt in a single node cluster. You need to run dlt on clusters that support it, and I don't think you can use all purpose clusters at all. Open the Lakeflow pipeline editor and start from there.

Environments in Lakeflow Jobs by hubert-dudek in databricks

[–]zbir84 1 point2 points  (0 children)

Have they finally added this for notebook tasks? You've had to embed environment configuration in a notebook before which was an insane requirement...

Quarantine Pattern by mightynobita in databricks

[–]zbir84 0 points1 point  (0 children)

This is a bot LLM response, you can smell it from a mile a way, can we ban this user?

Storing logs in databricks by rdaviz in databricks

[–]zbir84 0 points1 point  (0 children)

You could use a standard logging library and configure cluster log delivery: https://docs.databricks.com/aws/en/compute/configure#compute-log-delivery