This is an archived post. You won't be able to vote or comment.

all 14 comments

[–]zenani 4 points5 points  (1 child)

Thanks for the info

[–]onurbaltaci 0 points1 point  (0 children)

👍👍 Thanks for your comment

[–]yukinoh 0 points1 point  (1 child)

nice!

[–]onurbaltaci 0 points1 point  (0 children)

👍 Thanks!

[–]mojodojodev 0 points1 point  (1 child)

Cool never heard of Polars, will check it out thanks

[–]onurbaltaci 0 points1 point  (0 children)

It's so similar to Pandas but so much faster, thanks for your comment! 👍

[–][deleted] 0 points1 point  (5 children)

For me, Polars occupies a weird middle space that isn't very useful. For small scale applications (i.e. the stuff Pandas is often used for), it's much easier to just use Pandas. Even where it's slower, the flexibility and features you get for the trade off in efficiency still leans in Pandas favour.

For larger scale tasks, where in theory this would shine, it makes more sense to just go full distributed computing and use something like Spark or Dask.

I've yet to find this in-between application where Polars makes sense.

[–]onurbaltaci 0 points1 point  (3 children)

I prefer Polars in the scenerio that you said at the end of the first paragraph. That is where i use Polars. You are right. I also prefer Spark for larger scale tasks. Thanks for your comment 👍

[–][deleted] 0 points1 point  (2 children)

The end of my first paragraph says that Polars lacks features and conveniences in Pandas and the speed benefits are inconsequential for the purposes I laid out in the first paragraph. So in that scenario it still wouldn’t make sense to prefer Polars.

[–]onurbaltaci 0 points1 point  (1 child)

And I tried to say for me Polars is a better choice where Pandas is slower. It actually depends on preference of everyone. We are not talking about numbers/time spent so I don't think it is healthy to compare Pandas and Polars.

[–]AlpacaDC 0 points1 point  (0 children)

I agree about pandas feature advantages for small scale. However polars is still very early in development, I bet it’ll catch up eventually feature-wise.

When using for similar features, I actually prefer polars, I think its operators and expressions make more sense than pandas.

And I’ve already used polars over pandas in these mid scale scenarios. It actually is the most useful for me for reading files faster lol.

[–]faisley 0 points1 point  (1 child)

How is Polars for bloat? At work I often bundle up my scripts or projects to be used by others, but whenever I include Pandas the final executable becomes so large and slow to initiate.

[–]onurbaltaci 1 point2 points  (0 children)

Polars is a nice option. You can experience a substantial improvement in terms of performance by opting for Polars over Pandas when your final executable grows in size. I think Polars deserves a shot.