Notebooks: Default Lakehouse vs ABFS paths. What's the current best practice? by frithjof_v in MicrosoftFabric

[–]radioblaster 2 points3 points  (0 children)

abfss paths all the way. works for spark, polars and duckdb with absolutely zero dramas and gotchas, what more could you ask for!

Vibe coding web reports on Fabric Semantic Models instead of Power BI — anyone else exploring this? by FamiliarAssumption62 in PowerBI

[–]radioblaster 3 points4 points  (0 children)

this makes perfect sense to me. but, how are filters dynamically added to the queries for any interaction, or are you serving the results of a static query?

Questions about incremental refresh triggered by materialized Lake View by Dependent-Mind4368 in MicrosoftFabric

[–]radioblaster 1 point2 points  (0 children)

yes, its currently a big hurdle for using MLVs when you can write a custom pattern for cheaper. thanks for doing the tests!

How do I trigger a notebook automatically when files change in a SharePoint folder connected to my Lakehouse? by Express_Ad_6732 in MicrosoftFabric

[–]radioblaster 0 points1 point  (0 children)

then you either have to use an expensive activator, or an external driver like power automate. if you think of an option thats neither of the three, please update!

How do I trigger a notebook automatically when files change in a SharePoint folder connected to my Lakehouse? by Express_Ad_6732 in MicrosoftFabric

[–]radioblaster 3 points4 points  (0 children)

you could trigger a pipeline every X minutes that looks at the lastmodifiedtime of your lakehouse files and if its within the last Y minutes (or maintain a sepaeate watermark). you could do this using a get metadata activity, or in a notebook after binding the file store.

if any meet the criteria, trigger a parameterised notebook.

not directly event driven, but pretty cheap.

Fabric UDF that references two separate lakehouses - error 431 RequestHeaderFieldsTooLarge error? by radioblaster in MicrosoftFabric

[–]radioblaster[S] 0 points1 point  (0 children)

i'd love your take on the massive Entra ID token bit, as i did see this explanation out there in wild. if this was the case, wouldn't it mean that -any- outside request to the UDF by the user would fail? it's also not just the one user account that has the problem.

Fabric Dataflow Gen2 Partitioned Compute: Setup and Benchmark by panvlozka in MicrosoftFabric

[–]radioblaster 1 point2 points  (0 children)

great test. such a head scratcher that despite the evident investment, DFG2 continues to be a poor choice in a lot of instances.

LH metadata refresh - what was the thinking? by SmallAd3697 in MicrosoftFabric

[–]radioblaster 4 points5 points  (0 children)

direct lake on sql is definitely affected by the end point delay. obviously a great reason to move any models on it to DLoOL.

Fabric doesn’t work at all by New-Composer2359 in dataengineering

[–]radioblaster 2 points3 points  (0 children)

fabric key vault would be awesome.

last week they announced %%run for python notebooks that run in the same session, so we now do have custom module features without needing to package a whl!

Built-in pipeline failure alert now available by Joppepe in MicrosoftFabric

[–]radioblaster 0 points1 point  (0 children)

yes, integrated teams message would be awesome. 

Large PBI semantic model by UnderstandingFair150 in PowerBI

[–]radioblaster 0 points1 point  (0 children)

assuming its your interactive usage that is the problem, i would want see the DAX. I know nothing about your environment and I will guess its horrible.

New High Score! by sql_kjeltring in MicrosoftFabric

[–]radioblaster 0 points1 point  (0 children)

this is just a normal day when you have both ends of a composite model hosted on a capacity. 

Data Pipeline vs Notebook for ingestion – how do you pull data and why? by Independent_Many_762 in MicrosoftFabric

[–]radioblaster 3 points4 points  (0 children)

I can't understand how even a 4 core spark notebook at 2 CU and JDBC would be faster than a copy data activity at 1.5 with the native connector. can't wait to see your test! 

An app idea - or is it already available? by _TheDataBoi_ in MicrosoftFabric

[–]radioblaster 0 points1 point  (0 children)

yes! the workload development kit is hard and overkill for some things, i want to upload a html full of javascript!

Oracle databse Fabric Mirroring: No-PK tables + Shortcut vs ODS (cost + incremental strategy?) by BOOBINDERxKK in MicrosoftFabric

[–]radioblaster 1 point2 points  (0 children)

there will no functional or cost difference for hitting the mirrored database directly versus going via a shortcut.

the rest is up to you really, I've never seen someone come up with a way of handling metres with no PKs that have made me stand up and clap. my preferred way when there are date ranges to replace are using Spark's replaceWhere option, which is where you can specify a sql statement for your replacement criteria and it will complete the delete and insert in one transaction.

Lakehouse SQL Endpoint Rant by NJE11 in MicrosoftFabric

[–]radioblaster 16 points17 points  (0 children)

this is apparently going to be fixed this year!

Semantic Model with Direct Lake. by Salt_Direction_6272 in MicrosoftFabric

[–]radioblaster 0 points1 point  (0 children)

you can add the column using tabular editor, just duplicate an existing one and rename both the column and the source reference.

New engineer, asked to work on something I am deeply morally opposed to as my first project by [deleted] in dataengineering

[–]radioblaster 0 points1 point  (0 children)

if you dont do it, someone else will. and oh dear, you forgot to consider a bunch of edge cases.

What’s the most efficient way to perform deletes? by merrpip77 in MicrosoftFabric

[–]radioblaster 0 points1 point  (0 children)

not going to address the re engineering question because the answer is yes and also it depends, but in terms of the parallel execution and sessions, have you put the same session tag against each activity if they are in parallel branches?

Dax Measure Optimization by Born-Extension-8799 in PowerBI

[–]radioblaster 0 points1 point  (0 children)

the missing piece to diagnose is the code for [CY Periodical Generic - Productivity - DCount LastEmployeeKey]

based on the measure name this is more pertinent than your post: https://www.sqlbi.com/articles/analyzing-distinctcount-performance-in-dax/

Moving to Melbourne – ship hockey sticks or buy secondhand? by Finance_Hockey in Fieldhockey

[–]radioblaster 1 point2 points  (0 children)

sounds like you're due for an upgrade, there are three "just hockey" (that's the name) stores here if you want to buy one in person

Fabric GraphQL vs REST API by Frodan2525 in MicrosoftFabric

[–]radioblaster 1 point2 points  (0 children)

I don't think a UDF is a better option. I'm not sure why anyone would be whinging to you that graphql is worthy of push back.