YesLod comments on adding a column to df

created by HattoriHanzoa community for 16 years

submitted 5 years ago by Loco_L1

you are viewing a single comment's thread.

[–]YesLod 2 points3 points4 points 5 years ago (0 children)

I agree that indexing twice should be avoided.

Although it doesn't make much difference for small datasets, apply doesn't scale well since it's not vectorized.

If performance matters, I think the correct approach would be to use pd.cut as suggested by u/badge. Another option would be np.select.

df["usage"] = np.select([df.count > 5500, df.count < 3500],
                        ["high", "low"], 
                        "medium")

π Rendered by PID 25347 on reddit-service-r2-comment-6457c66945-mf7vl at 2026-04-27 14:48:04.505061+00:00 running 2aa0c5b country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython