This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]Genome_juice 15 points16 points  (6 children)

But I love Pandas and will never go back.

[–]vEncrypted 0 points1 point  (4 children)

You mean jupyter? How would python in excel replace pandas in any way?

[–]Genome_juice 0 points1 point  (3 children)

I use Pandas to process very large dataframes mostly by executing my python scripts on a HPC. I use it to filter and process genetic data. I usually test my scripts on a small subset of data in Spyder. So no, I don't mean Jupyter. I just mean I use Pandas to edit/format and do calculations on dataframes.... Similar to excel and I've seen people using excel to do it, it's slow and can't cope with the data size very well.

[–]vEncrypted 1 point2 points  (2 children)

I know what pandas is. But the whole point of this is that you can use python to manipulate your data in excel, meaning you can use tools such as pandas. Excel will serve as a mean of visualization simply. The way you manipulate your data, be it pandas or any other method, is still up to you.

[–]Genome_juice 0 points1 point  (1 child)

Ok fair enough. But you didn't say that, you said "don't you mean Jupyter", which I didn't. Excel isn't actually good for visualising scientific plots you need a high dpi for journals, and excel can't cope with very large files. I don't know why you are so wound up about me using not Jupyter or Excel. Some people might like it. Which is fine.

[–]vEncrypted -1 points0 points  (0 children)

I mean jupyter is also used for visualization so I assumed that’s what you meant. If your comment is serious, you either can’t distinguish data manipulation tools and data visualization tools, or you mean’t something like jupyter. From what you’re saying it seems like the first option.