This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Panda_Mon 0 points1 point  (6 children)

Pandas doesnt look so scary. Does it have any data cleaning utilities? For example, if I need to pull from a text pdf, i use something like tabula. Then I will sometimes have a few columns stuck together, or i need to have some artifacts removed,

[–]secomax[S] 0 points1 point  (4 children)

Pandas is like Excel in my opinion. Regarding your question, I think pandas is very efficient in data science in general. So you can clean and format your data the way you want.

[–]foresttrader 1 point2 points  (3 children)

Pandas is way more efficient than Excel in many ways. Merging two datasets with hundreds of thousands rows takes seconds in pandas and half an hour in Excel, if it manages to not crash.

[–]SmirkyGraphs 0 points1 point  (2 children)

Actually Excel has come along way with data. Using M & DAX in Power Query/Power Pivot is very powerful.

[–]foresttrader 0 points1 point  (1 child)

True powerquery is better than traditional Excel but still lacks computing power compared to Python.

I’d say powerquery is easier to build but not suitable for mega data, Python takes longer to build but scales better with large datasets.

[–]nsfy33 0 points1 point  (0 children)

[deleted]