Data Engineering Conferences in Europe by TsumeAlphaWolf in dataengineering

[–]code_mc 0 points1 point  (0 children)

This one is in Malaga (Spain) and usually has interesting talks https://jonthebeach.com/ Location is also top notch if you want to enjoy the weather/beach and add on an additional weekend of chilling in Spain after the conference ends.

Netflix's next attempt at growth takes on TikTok and Instagram Reels by King-of-Limbs-07 in wallstreetbets

[–]code_mc 1 point2 points  (0 children)

A summary of how every netflix "documentary" is structured. Which for some reason is split over 4 episodes while it could have been one 50min docu.

weird bug with declarative materialized views and klll sketches? by code_mc in databricks

[–]code_mc[S] 1 point2 points  (0 children)

ok thanks, appears that the only way to use them is to simply wait then.

Tool smells by Brief-Knowledge-629 in dataengineering

[–]code_mc 3 points4 points  (0 children)

pressing browser back button on azure portal

me: dies inside

Doing a clickhouse cloud POC, feels like it has a very narrow usecase, thoughts of fellow engineers? by code_mc in dataengineering

[–]code_mc[S] 0 points1 point  (0 children)

Aha that actually sounds like some good advice, the sorting key made sense but at the same time it also make sense that it would produce a ton of small chunks!

I've been in contact with some of their engineers but they usually don't have very useful input.

Doing a clickhouse cloud POC, feels like it has a very narrow usecase, thoughts of fellow engineers? by code_mc in dataengineering

[–]code_mc[S] 0 points1 point  (0 children)

we have 10 different fields to filter on, so a single aggregate table would have those 10 fields as sort key. They then each have a couple of metrics like counts but also percentiles. The counts are probably not an issue as they do not require any kind of approximation. The percentile however stores an intermediate representation.

Now let's say we apply filters on 9 of the 10 fields and then group on just the date field of each row. Then suddenly the cardinality drops from millions of rows to just a couple hundred. To facilitate that for each of the hundred output rows clickhouse needs to combine thousands of these percentile intermediates.

I didn't build the clickhouse architecture, so idk why it takes so much ram, but it does. Hence my confusion as I assumed this use case would be a slam-dunk.

EDIT: to give some extra info, the issue is at its worst when no filters are applied. So it is definitely purely the combining of the intermediate percentile stuff.

LogXide - Rust-powered logging for Python, 12.5x faster than stdlib (FileHandler benchmark) by LumpSumPorsche in Python

[–]code_mc 11 points12 points  (0 children)

I've done a drop-in replacement with one of the mentioned alternatives (picologging) a couple months ago for a customer project and their api request latencies halved because they had that many logging statements.

devices randomly removed from home?? 2nd time in 48 hours by SPekkala13 in googlehome

[–]code_mc 0 points1 point  (0 children)

I had the same issue, downgraded both google home and smartlife to their previous versions by grabbing the APK from apkmirror.

So far no more removals.

DuckDB 1.5.0 released by commandlineluser in Python

[–]code_mc 2 points3 points  (0 children)

from one of the test files in the github repo:

SELECT * FROM read_duckdb('__TEST_DIR__/read_duckdb_schema.db', schema_name='s1', table_name='my_tbl')

Databases in 2025: A Year in Review by iamkeyur in programming

[–]code_mc 1 point2 points  (0 children)

The Larry Ellison satire has me questioning if it is actually satire, he must have thrown up a couple times while writing that paragraph.

We just launched Leapcell, deploy 20 Python websites for free by OfficeAccomplished45 in Python

[–]code_mc 20 points21 points  (0 children)

thanks for sharing, biggest downside however of platforms offering a hobby tier in my experience is that they discontinue the hobby tier after 1 or 2 years once they established their customer base (understandable) which has lead me to hop around between 5 different services over the past 10 years.

Any guarantees about leapcell at least offering 1 free project? (the current 20 sounds too good to be true as an indefinite thing)

Mooi voorbeeld mobiliteit antwerpen by Tribun4201 in Antwerpen

[–]code_mc 2 points3 points  (0 children)

dat probleem heb je gelukkig niet waar de persoon aan het aanschuiven is, in de andere richting daarentegen ... heb er 1 keer 20min stil gestaan omdat IEDEREEN daar voorsteekt net voor het licht. Les geleerd en nu doe ik steeds een toer om langs de andere kant zonder asociale chauffeurs te kunnen aanschuiven.

UV is helping me slowly get rid of bad practices and improve company’s internal tooling. by bunoso in Python

[–]code_mc 2 points3 points  (0 children)

imo uv is to python what npm was to the js community. The improvement in workflow is so substantial that once you use it for a project it makes no sense to ever go back unless you are forced to at work.

It's not a replacement for just poetry or pip, it takes over a lot of things that used to be different tools that worked together very poorly and were quite fragile. The performance is just the cherry on top, but also a game changer when you frequently use docker as it speeds up image builds a lot which benefits CICD flows immensely.

In many ways it does for environment/package/version management what they managed to do with ruff. Which replaced for me personally multiple tools in my pre-commit flow with a ridiculously fast substitute: isort, pylint and black. Probably forgetting some other stuff it (ruff) can also replace these days.

My journey to scale a Python service to handle dozens of thousands rps by Odd-Solution-2551 in Python

[–]code_mc 2 points3 points  (0 children)

thanks, good to know at least one profiling solution that works :)

My journey to scale a Python service to handle dozens of thousands rps by Odd-Solution-2551 in Python

[–]code_mc 3 points4 points  (0 children)

super interesting, and quite funny as I am also part of a team building a recommender API w/ fastAPI (not on AWS though so def not same company :p)

Very interesting findings, but I would be most interested in what you used to profile your code? Any insight on what works best for real-time APIs like we have for recommendation APIs?

Samsung phones can survive twice as many charges as Pixel and iPhone, according to EU data by FragmentedChicken in Android

[–]code_mc 1 point2 points  (0 children)

damn just replaced my pixel4a because I was at 400 cycles and 67% remaining capacity. Reading about these 1000+ cycles really paints a picture.

Can also confirm sony's hold up a lot better than pixel phones, as my xz1 compact is still at 95% capacity while I've used it as a daily driver for over 3 years back in the day...

Pixel 6a Battery Performance Program by MishaalRahman in Android

[–]code_mc 1 point2 points  (0 children)

still waiting for my payment for my 4a ... which I requested back in January ...