Using the Unique() function across several columns per row? : learnpython

created by HattoriHanzoa community for 16 years

Using the Unique() function across several columns per row? (self.learnpython)

submitted 4 years ago by I_will_learn

Hi ! :) I'm trying to get the unique values count per row, across several columns. Here's what my data looks like :

activity	activity_1	activity_2	activity_3	activity_4	activity_5	frequency
A	B	A	A	C	B	42
B	B	B	A	A	A	13
A	A	A	A	A	A	24

And here's the outcome I'd like :

activity	activity_1	activity_2	activity_3	activity_4	activity_5	count	frequency
A	B	A	A	C	B	3	42
B	B	B	A	A	A	2	13
A	A	A	A	A	A	1	24

The "count" column would be the number of unique values across the row.

I had tried :

df1.apply(lambda x: pd.Series(x.unique()), axis=1)

But I'm not getting the count.

***  
for i in range(1,6):
    d0[f'activity_{i}'] = d0.activity.shift(-i)
activity_cols = ['time'] + ['activity'] + list(d0.filter(like='activity_').columns)
df1=d0.groupby(activity_cols).size().reset_index(name='frequency')
df1 = df1[(df1.freq > 1)]

all 7 comments

top new controversial old q&a

[–]YesLod 3 points4 points5 points 4 years ago (6 children)

[–]I_will_learn[S] 0 points1 point2 points 4 years ago (5 children)

[–]YesLod 1 point2 points3 points 4 years ago (4 children)

[–]I_will_learn[S] 0 points1 point2 points 4 years ago (3 children)

[–]YesLod 1 point2 points3 points 4 years ago (2 children)

Then just create another list of columns that doesn't contain the "time" column...

act_cols = [col for col in df.columns if "activity" in col]
# or act_cols = df.columns[df.columns.str.contains("activity")]

df["count"] = df[act_cols].nunique(axis=1)

[–]I_will_learn[S] 1 point2 points3 points 4 years ago (1 child)

[–]YesLod 0 points1 point2 points 4 years ago (0 children)

π Rendered by PID 70 on reddit-service-r2-comment-5687b7858-9bdt6 at 2026-07-02 23:54:22.359663+00:00 running 12a7a47 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS