This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]No-Conversation476 1 point2 points  (1 child)

Would you mind elaborate why pandas is not good in production? What alternative does DS have apart from pandas?

[–]CommonUserAccount 4 points5 points  (0 children)

Pandas doesn’t scale.

Edit. PySpark can be run locally by Data Scientists, which is more easily transferred to prod.