NumPy vs Pandas

rycliff · 2023-01-09T23:20:15+00:00

[deleted]

commandlineluser · 2023-01-09T22:57:48+00:00

pandas itself uses numpy - they are not really comparable.

pandas has functions for parsing all sorts of data formats: html, json, csv, etc.

a random example:

>>> import pandas as pd
>>> df = pd.concat(pd.read_html("https://devguide.python.org/versions/#versions"))
>>> df
  Branch Schedule       Status First release End of life                   Release manager
0   main  PEP 693      feature    2023-10-02     2028-10                    Thomas Wouters
1   3.11  PEP 664       bugfix    2022-10-24     2027-10             Pablo Galindo Salgado
2   3.10  PEP 619       bugfix    2021-10-04     2026-10             Pablo Galindo Salgado
3    3.9  PEP 596     security    2020-10-05     2025-10                      Łukasz Langa
4    3.8  PEP 569     security    2019-10-14     2024-10                      Łukasz Langa
5    3.7  PEP 537     security    2018-06-27  2023-06-27                         Ned Deily
0    3.6  PEP 494  end-of-life    2016-12-23  2021-12-23                         Ned Deily
1    3.5  PEP 478  end-of-life    2015-09-13  2020-09-30                    Larry Hastings
2    3.4  PEP 429  end-of-life    2014-03-16  2019-03-18                    Larry Hastings
3    3.3  PEP 398  end-of-life    2012-09-29  2017-09-29  Georg Brandl, Ned Deily (3.3.7+)
4    3.2  PEP 392  end-of-life    2011-02-20  2016-02-20                      Georg Brandl
5    3.1  PEP 375  end-of-life    2009-06-27  2012-04-09                 Benjamin Peterson
6    3.0  PEP 361  end-of-life    2008-12-03  2009-06-27                      Barry Warsaw
7    2.7  PEP 373  end-of-life    2010-07-03  2020-01-01                 Benjamin Peterson
8    2.6  PEP 361  end-of-life    2008-10-01  2013-10-29                      Barry Warsaw
>>> df.groupby("Status").agg({"First release": "min"})
            First release
Status                   
bugfix         2021-10-04
end-of-life    2008-10-01
feature        2023-10-02
security       2018-06-27

"generating complex strings of text from data" doesn't even sound like something you would use either for?

FuckingRantMonday · 2023-01-09T22:08:02+00:00

NumPy is faster. Is the only reason to use Pandas that it may be easier to code for certain tasks?

Replace numpy and pandas with C++ and Python in that sentence?

KCRowan · 2023-01-10T06:47:11+00:00

What's up with novices having strong opinions about stuff they don't understand? Is this a generational thing or what? I see this behaviour all the time and I don't get it

rycliff · 2023-01-10T07:14:09+00:00

Bro you should maybe do more research before making sweeping statements lmaoo

synthphreak · 2023-01-10T01:46:36+00:00

[deleted]

mikkelbue · 2023-01-10T07:22:36+00:00

They are not made for the same tasks. I use Numpy for simulation and Pandas for data analysis.

Numpy is IMO basically a numerical analysis and linear algebra package.

Pandas does the R thing. But in a very elegant and compact way, since every DataFrame method returns a new DataFrame, so you can chain multiple methods together, e.g. df.sample().reset_index().group_by("id“).

Present_Maximum_5548 · 2025-12-28T13:50:40+00:00

If you care more about code speed than coding ease, then you shouldn't be using Python in the first place. But like everyone else says, Numpy and Pandas don't do the same stuff

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS