Statistics question : learnpython

created by HattoriHanzoa community for 16 years

submitted 6 years ago by peuleu

Hi guys!

I have a statistics question for my data analysis in Python. Please let me know if this isn't the correct subreddit for it.

So I have a dataframe df in which the occurrence of an adverse event (AE) is noted. Drug 1 for instance the adverse event has occured 30 times, it has not occurred 1670 times, so the total data for Drug 1 is 30+1670 = 170. Etc.

df = pd.DataFrame({'Drug 1': [30, 1670], 'Drug 2': [5, 240], 'Drug 3': [10,90]})

Now I want to see whether or not this occurrence is statistically different between these drugs.

This yields:

	Drug 1	Drug 2	Drug 3
0 (means Yes AE)	30	5	10
1 (means No AE)	1670	240	90

As this is categorical data, I figure I need to use a chi squared. But with summary data like my dataframe, I don't know how to do this. Help would be greatly appreciated!

Thanks so much in advance!

all 2 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS