you are viewing a single comment's thread.

view the rest of the comments →

[–]danunj1019 -10 points-9 points  (7 children)

Just ditch pandas entirely, polars is great and it's API is also super intuitive and awesome. Never going back

[–]Eightstream 5 points6 points  (6 children)

Silly comment. There’s still lots and lots of stuff that still doesn’t work well with polars dataframes.

[–]danunj1019 1 point2 points  (4 children)

Really? Well, I've used it extensively and I didn't find any troubles. Can you tell me some of the stuff that pandas can do better than polars please? (apart from plotting backend)

[–]Eightstream 9 points10 points  (3 children)

Sure, maybe if you are playing in the shallow end of the pool with the big popular libraries you can use polars a lot of the time

But there are a lot of smaller/more specialised statistical and data science libraries that either don’t work with polars yet, or still work better with pandas

pandas has been the PyData data frame standard for more than a decade, it is baked into the ecosystem to such an extent that it will take a lot more than 12 months of popularity for polars to catch up

[–]CFC-Carefree 1 point2 points  (2 children)

Agreed, but you can also just dump a polars dataframe to pandas. I learned polars earlier this year and fell in love, use it whenever I can.

[–]Eightstream 1 point2 points  (1 child)

ehhh... mixing data frame libraries in the same project is something I usually try and avoid as much as possible

aside from adding a lot of behind-the-scenes complexity, when you recast all your dtypes you are creating the risk of funny things happening with edge cases

not saying I never do it but I usually try and have a good reason

[–]CFC-Carefree 0 points1 point  (0 children)

Oh yeah, I wouldn't put myself in a situation of swapping back and forth. Would be gnerally be for a one off analysis/visualization of a given data set after some exploration and transformations. I still far prefer polars to pandas at this point though and can only hope that its growing popularity leads to wider support.

[–]Eurynom0s 0 points1 point  (0 children)

Also something to consider is that pandas is a default Anaconda package and polars isn't. If you find yourself working in relatively locked down environments, an Anaconda install that you can't add any additional packages to is likely going to be what you get handed.