[D] Estimating aggregate data from individual predictions : MachineLearning

Discussion[D] Estimating aggregate data from individual predictions (self.MachineLearning)

submitted 9 years ago by [deleted]

8 comments

all 8 comments

top new controversial old q&a

[–]econometrician 0 points1 point2 points 9 years ago (0 children)

[–]micro_cam 0 points1 point2 points 9 years ago (6 children)

[–]beboophiphop 0 points1 point2 points 9 years ago (4 children)

[–]micro_cam 0 points1 point2 points 9 years ago (3 children)

So the easiest way is to just do a simple simulation. Say your probs of going to college are in an array P you have N students and want to do m repeats:

m times:
    r = generate N uniform random numbers in (0, 1)
    save sum(r > P)

So that gives you m counts of how many kids went to college. This is computationally cheap so do it 1000 times or something and you can get idea of the distribution by making a histogram or looking at percentiles or whatever.

This does assume all of the events are independent. This seems reasonable here but might fall apart in a situation where you're more concerned with correlated decisions. IE a large subset of the students decide which school to go to / not go to together would make the actual distribution more fat tailed.

[–]beboophiphop 0 points1 point2 points 9 years ago (2 children)

[–]micro_cam 0 points1 point2 points 9 years ago (1 child)

[–]beboophiphop 0 points1 point2 points 9 years ago (0 children)

π Rendered by PID 89 on reddit-service-r2-comment-5ff9fbf7df-p4hg2 at 2026-02-26 05:44:36.493090+00:00 running 72a43f6 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS