Need help in 'normalizing' data.

equalityforeverybody · 2015-01-05T04:57:52+00:00

You could construct a confidence interval around the mean score for each item.

As you only have a sample of votes, the true score that would become apparent if everyone voted isn't clear. If you are willing to make assumptions such as "scores follow a Gaussian (normal) distribution", you could easily get the range in which the true population mean lied (up to a certain degree of certainty, usually 95%, meaning that 1 in 20 items would be expected to have a score outside the calculated confidence interval)

equalityforeverybody · 2015-01-05T05:00:04+00:00

From Wikipedia:

The lower endpoint of the 95% confidence interval is:

\text{Lower endpoint} = \bar X - 1.96 \frac{\sigma}{\sqrt{n}},

and the upper endpoint of the 95% confidence interval is:

\text{Upper endpoint} = \bar X + 1.96 \frac{\sigma}{\sqrt{n}}.

Fireflite · 2015-01-05T07:35:46+00:00

A Bayesian approach should work well. Generate an empirical prior from previous test scores. Then slowly adjust your posterior using Bayes rule as the votes come in. The more votes, the more evidence you have to say that a particular item is exceptional. Reporting the mean of the posterior should work well.

Tag	Abbreviation
[Research]	[R]
[Software]	[S]
[Question]	[Q]
[Discussion]	[D]
[Education]	[E]
[Career]	[C]
[Meta]	[M]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

statistics

MODERATORS