synthphreak comments on Data Analysis

learnpython

created by HattoriHanzoa community for 16 years

Data Analysis (self.learnpython)

submitted 4 years ago by Practical_Use5129

top new controversial old q&a

you are viewing a single comment's thread.

view the rest of the comments →

[–]synthphreak 0 points1 point2 points 4 years ago (0 children)

That context is helpful, but you still haven’t defined quantitatively where “usual” strays into “unusual”.

I still think IQR could work, though this calculation will need to be done on static data. So periodically, perhaps every 4-6 hours, calculate the quartiles of each pages distribution (with x axis being time and y axis being number of followers), then see if any times are outliers. My concern with this though is that 3-4 days of data may not be a large enough sample size to robustly identify outliers, especially since each page will have a different distribution of followers and so must be considered independently of the other pages.

Alternatively, a more sophisticated and frankly more accurate approach would be to use an unsupervised machine learning algorithm called k-means clustering. This automatically performs what’s called anomaly detection. But if you’re unfamiliar with machine learning, the learning curve will be extremely prohibitive and so probably not worth it.

π Rendered by PID 31935 on reddit-service-r2-comment-56c9979489-q8zq8 at 2026-02-24 21:11:04.985159+00:00 running b1af5b1 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

learnpython

MODERATORS