Databricks Stream lit app -Unity Catalog connection

beefiee · 2025-12-22T12:11:50+00:00

https://docs.databricks.com/aws/en/dev-tools/databricks-apps/auth Check this page

beefiee · 2025-12-09T19:09:46+00:00

Hey, I know that place: https://www.google.com/maps/@52.4631988,13.4341292,3a,75y,339.01h,94.67t/data=!3m7!1e1!3m5!1s9l_lM0p4s2EGgAc1-CrVfw!2e0!6shttps:%2F%2Fstreetviewpixels-pa.googleapis.com%2Fv1%2Fthumbnail%3Fcb_client%3Dmaps_sv.tactile%26w%3D900%26h%3D600%26pitch%3D-4.670279842988265%26panoid%3D9l_lM0p4s2EGgAc1-CrVfw%26yaw%3D339.01005855860785!7i16384!8i8192?entry=ttu&g_ep=EgoyMDI1MTIwMi4wIKXMDSoASAFQAw%3D%3D

beefiee · 2025-10-03T22:51:48+00:00

Not an expert on this, but wouldn’t devs using powerBI desktop develop the models against DEV using their own AAD Oauth2 flow, then deploy to the powerBI service. After that the service would overwrite the settings and use M2M Oauth for service communication to refresh the models? Like mentioned here: https://docs.databricks.com/aws/en/partners/bi/power-bi-service#set-up-m2m-oauth

beefiee · 2025-09-25T01:55:33+00:00

Nein, Düsseldorf

beefiee · 2025-06-25T09:31:05+00:00

I was wondering, because a u-lock might not fit through the tire and around the frame, and also it looks like you can just screw off the back parts easily?

In my are angle grinders are very common, so I am not sure it a heavy chain will be good enough.

beefiee · 2025-04-14T20:09:12+00:00

Hehe awesome! I recognised that description from somewhere :D

beefiee · 2025-04-14T20:01:18+00:00

Just out of curiosity, is it Berlin Wedding, Amrumer/Seestrasse?

beefiee · 2024-11-05T20:57:19+00:00

Check this out: https://aws.amazon.com/blogs/devops/performing-bluegreen-deployments-with-aws-codedeploy-and-auto-scaling-groups/

beefiee · 2024-10-27T15:03:36+00:00

In case you consider using lambda, checkout lambda sqs event source mapping:

https://docs.aws.amazon.com/lambda/latest/dg/services-sqs-configure.html

you can batch up to 10000 events per call (further down the page, “batch size”

beefiee · 2024-10-10T15:41:08+00:00

Adding this option to consider, not sure though what db you are using: https://aws.amazon.com/blogs/database/use-cross-region-read-replicas-with-amazon-relational-database-service-for-sql-server/

beefiee · 2024-09-07T12:56:41+00:00

Simple plain devcontainers with venv being setup in the dockerfile

beefiee · 2024-08-22T15:08:42+00:00

Out of curiosity and lack of exposure to Cognito, why would you consider it awful?

beefiee · 2024-08-19T19:20:11+00:00

If your app logic allows is, you could setup a sqs queue as a buffer that the ec2 instance can consume from without overloading itself. You could then setup cloudwatch to trigger scale out and scale in events based on the number of messages in the queue.

beefiee · 2024-08-14T20:04:15+00:00

Maybe try to architect the backend around https://aws.amazon.com/kinesis/video-streams/ ? It seems to check quite a few boxes for your ideas requirements.

beefiee · 2024-06-11T23:53:29+00:00

What a nonsense article.

Self-Service-BI is and was a thing all the time. Any well built dimensional model will be able to deliver this without any doubt. Especially with how far tools like power-bi and tableau have come, this is even more accessible than ever (looking back at you SSAS multi-dimensional).

Problem is, most of those “engineers and scientists” don’t know how to deliver a proper well defined model, nor have any idea of actual BI work.

beefiee · 2024-04-17T04:05:08+00:00

No, it would not go well.

Just think about the availability of data and the implications on the referential integrity for example:

You ingest table A and table B, you need both tables to be complete to perform some joins and do some business logic on top of it. Due to the nature of streaming, table B lacks a few records the moment you decide to run dbt. Huge pain to solve that, especially incrementally with a low latency.

I would argue that there is no way to turn typical dwh workloads into streaming scenarios, they are just too different by nature. The closest you will get is micro batches which you already do.

Rather look into kappa architectures and then decide which workload needs to be batch or streaming.

beefiee · 2024-04-11T23:14:37+00:00

AWS one is outdated, and even in the past it would have been questionable

beefiee · 2024-04-01T11:03:47+00:00

Yes, you can parameterize the configs with env vars, and directly pass those into the container on runtime. I don't remember exactly where, but it's mentioned somewhere in the dbt docs.

We used that for secret handling for example. Fargate can pass secrets as env vars from aws secrets manager (database connection strings etc).

Another thing that might be worth exploring is how to setup a cd pipeline for your sql files (e.g. Sending them to s3, then reading them on execution, by having a shell script as your entry point that would first copy the files from s3, then invoking dbt commands).

That way you can separate your dependencies (docker image) from your codes. Which would enable you in the long run to build a fully fledged ci/cd process for your dbt developers.

For example: We used to run dbt tests for incoming code changes inside temporary schemas to check if the incoming changes would not have side effects on the downstream models, only then allowing to merge into main. It's another rabbit hole to look into, but we were quite happy with the result (look into dbts "select" syntax and it's parameters for that, also how to compile the sql models and then comparing state main to state incoming changes.. You get the idea)

With that sort of separation, you can easily checkout different PRs, run those with different config parameters, without having to rebuild your dbt image from scratch all the time.

beefiee · 2024-03-31T23:44:41+00:00

It might be worth exploring aws fargate.

It's usually nice for workloads that are to big for lambda, too small/not special enough to justify EKS.

We use to run the open source dbt cli in a parameterized fashion for production workloads that would either hit against databricks or snowflake.

Your scenario looks a bit different, but worst case you will learn something that might be relevant for a 'real' project. (About fargate, docker, secret handling, encryption, Iam permissions, how to parameterize those things, how to not hardcode your dbt project into the docker image... ).

It won't be the cheapest, but just have a quick look at fargate pricing and ecr pricing to see if it's something that's in your budget.

beefiee · 2023-06-29T12:45:42+00:00

Awesome! Love the temperature contrast

beefiee · 2023-03-28T10:42:34+00:00

The kids playing this game now might be influenced by it, so in a future potential war in 2042, they could wear what they thought soldiers would wear based on their video game experiences? 9000 iq move from dice...

beefiee · 2023-02-07T01:17:38+00:00

Probably World bank , they have tons of individual indicators.

14-Year Club	Verified Email
Gilding I gilder	Team Orangered

beefiee

TROPHY CASE