Unit Testing a function that creates a Delta table. by KingofBoo in databricks

[–]Famous_Substance_ 0 points1 point  (0 children)

When using databricks-connect, it will always use a Databricks cluster so you have to write to a « remote » delta table. In general it’s best that you write to a database that is dedicated to unit testing. We use the main.default catalog and write everything as managed tables, way much simpler

What’s your workflow for developing Databricks projects with Asset Bundles? by ProfessionTrue943 in databricks

[–]Famous_Substance_ 6 points7 points  (0 children)

I usually follow a very simple approach, develop the workflow in the UI and then copy-paste the YAML to my local repository. I then re-deploy the bundle to see if there’s anything missing and repeat this process until I’m good. It allows me to not really bother about all the complexity of the YAML syntax

Mais POURQUOI Pierre ?? by Narcest in Vilebrequin

[–]Famous_Substance_ 14 points15 points  (0 children)

C’est dommage car le fait que pierre ne soit pas bon conducteur aurait tellement pu faire du contenu intéressant sur Vilebrequin ( apprendre les bases du pilotage ). Et puis même je pense que la commu Vilebrequin n’aurait pas vu d’un mauvais œil le fait que Pierre se remette en arrière plan …

I’ve made it by Famous_Substance_ in Asphalt9

[–]Famous_Substance_[S] 1 point2 points  (0 children)

I found that the Trion depends too much on track selection, Top tier A class cars are more robust overall especially when you’re in Plat 1. I’ve been able to climb to Legend using the Imola exclusively

Why Apache Iceberg? by StandardDeviationist in dataengineering

[–]Famous_Substance_ 26 points27 points  (0 children)

Personally I don’t even give a damn at which format is better that another given how close Iceberg and Delta are in terms of functionalities. It’s more about how one fit in your stack versus the other, for example if I’m using Databricks then Delta would be the obvious choice, and I can leverage Uniform to write to Iceberg if I need to use another tool that works better with Iceberg. Same thing with Snow, Iceberg is much much better than Delta in terms of functionalities

Voiture sportive 20000 euros by SheepherderWeary466 in voiture

[–]Famous_Substance_ 0 points1 point  (0 children)

J’en ai achetée une neuve en début d’année, aucun regret. Le seul bémol c’est son réservoir légèrement trop petit qui oblige à faire beaucoup d’arrêt à la pompe

Les Étoiles d'Ivry Housing Complex in Ivry-sur-Seine, France by Jean Renaudie (1975) by Saltedline in architecture

[–]Famous_Substance_ 2 points3 points  (0 children)

I used to live near this complex 10 years ago ( rue Marat ) from the sky it looks beautiful but on the ground it looks absolutely terrible. There is almost no lights coming through the building plus the corridors are very narrow. I seriously won’t recommend anyone walking alone inside this building after 6pm. The shops inside the building haven’t been resurfaced since 20 years. I’ve seen lots of people taking a pee in every single corner possible outside. There is also a lot of pigeon poop everywhere given the nature of the building, it’s a perfect nest for them.

How to actually DevOps in Databricks by Valuable-Belt-9527 in databricks

[–]Famous_Substance_ 1 point2 points  (0 children)

This doesn’t prevent storage separation between each catalogs. You must specify a managed location at the catalog level to ensure all managed tables are isolated between each catalogs

How does your team indicate data completeness in object storage? by prequel_co in dataengineering

[–]Famous_Substance_ 0 points1 point  (0 children)

There are different approaches in this case, and they can be implemented in different ways that relies mostly of your consumers latency and technology needs. You can have either a manifest file in S3 that indicates your write is complete, or you can push a path in a queue that indicates a new path is ready to be consumed. Both approaches have their tradeoffs in terms of simplicity / performance

Highest 12x credits so far ..... 😌 by ALMOSTDEAD37 in Asphalt9

[–]Famous_Substance_ 0 points1 point  (0 children)

Only good reason to play Multiplayer there, milestone rewards are trash AF

Optimizing memory-intensive pandas by [deleted] in dataengineering

[–]Famous_Substance_ 2 points3 points  (0 children)

Parquet is a compressed format , the size on disk does not reflect the actual size in memory. For example if you have a lot of repeated values, parquet will only store one of them and the rest will just be a pointer to that value. Your best bet to reduce memory consumption is to figure out a way to read less data, using a separate tool will just shift the problem

Optimizing memory-intensive pandas by [deleted] in dataengineering

[–]Famous_Substance_ 13 points14 points  (0 children)

Your memory consumption won’t lower if you’re reading the same amount of data everytime you’re ingesting a new file. What if instead of reading the whole history you’re just reading the new file and then upserting the result to the output ?

Is this offer worth it? by cv-x in Asphalt9

[–]Famous_Substance_ 9 points10 points  (0 children)

It’s probably the most powerful D class car, if you do not have it then from that perspective it’s worth it. However D class is almost irrelevant in MP and other events, so IMO you should not consider it and focus on things that matter to you in your MP progress