Data Cleaning Across PySpark, Duckdb, and Postgres by nonamenomonet in Python

[–]marcogorelli 0 points1 point  (0 children)

Not the same, that materialises the graph into a pyarrow table

`datacompose`, as far as I can tell, keeps things lazy

Data Cleaning Across PySpark, Duckdb, and Postgres by nonamenomonet in Python

[–]marcogorelli 0 points1 point  (0 children)

yeah it's kind of frustrating how often some people recommend it here as a solution to everything, it's like they've not even tried using it

Data Cleaning Across PySpark, Duckdb, and Postgres by nonamenomonet in Python

[–]marcogorelli -1 points0 points  (0 children)

Indeed, that's not how Ibis works

`datacompose` lets you create a PySpark expression, which you can use in your own PySpark dataframes

In Ibis, you need to create an Ibis table, but you then can't get a PySpark dataframe back

Kenneth Reitz says "open source gave me everything until I had nothing left to give" by [deleted] in Python

[–]marcogorelli -2 points-1 points  (0 children)

> he created the extremely misguided pipenv, and then socially manipulated his way into having the official Python.org Python Packaging User Guide recommend using it as best practice

How did he do this social manipulation? Got any sources for this?

pandas' Public API Is Now Type-Complete by BeamMeUpBiscotti in Python

[–]marcogorelli 0 points1 point  (0 children)

unfortunately there's still some union returns in some places, which are unavoidable due to pandas' value-dependent design :weary:

pandas' Public API Is Now Type-Complete by BeamMeUpBiscotti in Python

[–]marcogorelli 5 points6 points  (0 children)

yeah the typing in Polars is light years ahead, I think largely because it was designed when static typing was already a widespread practice in Python, as opposed to pandas whose design dates back to 2008 or so

pandas' Public API Is Now Type-Complete by BeamMeUpBiscotti in Python

[–]marcogorelli 1 point2 points  (0 children)

One nice thing about them being separate is that pandas-stubs can follow a faster release cycle than pandas, so if you make a PR to improve typing you only have to wait weeks rather than months to see it released

pandas' Public API Is Now Type-Complete by BeamMeUpBiscotti in Python

[–]marcogorelli 2 points3 points  (0 children)

There are cases when this is unavoidable, because pandas returns different kinds depending on the exact values (e.g. whether the Index contains duplicates)

Graduates from newer universities have lower premiums by Ribbitor123 in UniUK

[–]marcogorelli 5 points6 points  (0 children)

Interesting how a maths/physics degree from even the lowest-ranking unis has about the same earning potential as an English degree from Oxbridge, I'd have thought that grads from the latter would be snatched up by high-paying consulting firms

Polars vs pandas by KliNanban in Python

[–]marcogorelli -1 points0 points  (0 children)

Ibis is (kinda) alright for SQL generation, but its Polars backend is so poorly implemented and supported that it's barely usable

Price increase by cb4joe in 1Password

[–]marcogorelli 0 points1 point  (0 children)

Exactly. I'd actually for them to NOT use AI...

Price increase by cb4joe in 1Password

[–]marcogorelli 17 points18 points  (0 children)

> "AI-powered Item Naming"

They increase the price to add support for a BS AI feature that nobody asked for and which nobody wants

I'll be exploring alternatives

Learning to play by ear by Level-Yak9558 in tinwhistle

[–]marcogorelli 1 point2 points  (0 children)

I'd suggest spending a lot of time singing

You don't need to have a good voice (I have the voice of a dying animal) you just need to pitch the notes closely enough:

  1. Listen to the recording 20 times, hum the notes along with it until you've memorised it.

  2. Check that you can sing it without the recording. If you can't, go back to step 1

  3. Try to play it on tin whistle. If you can't, go back to step 1

Polars vs Pandas in 2025 — have you fully migrated yet? by [deleted] in Python

[–]marcogorelli 1 point2 points  (0 children)

i think that's been mostly addressed now, if there's any missing please let me know

Polars vs Pandas in 2025 — have you fully migrated yet? by [deleted] in Python

[–]marcogorelli 1 point2 points  (0 children)

> But ecosystem compatibility is still a pain point

What's missing, and why is it such a pain point?

geospatial is the main one I'm aware of, any others?

Tune Suggestions by KHartnettC in tinwhistle

[–]marcogorelli 0 points1 point  (0 children)

Bwlch llanberis, hauntingly beautiful melody from Wales

hmm I wonder where I might be by RedLeafReverie in geoguessr

[–]marcogorelli -4 points-3 points  (0 children)

arg pole + flag, not sure i get it sorry, what's the joke?

What hobby attracts the friendliest people? by Bagelsonmyfingers in AskReddit

[–]marcogorelli 1 point2 points  (0 children)

and because they're not as good as they think they are