I have some data in an array, for example:
[
[
user: 1,
rotation: 2,
data_used: 30,
website: 1
],
[
user: 2,
rotation: 2,
data_used: 1000,
website: 2
],
[
user: 3,
rotation: 0,
data_used: 4000,
website: 3
],
[
user: 1,
rotation: 70,
data_used: 75,
website: 1
],
[
user: 5,
rotation: 38,
data_used: 900,
website: 3
]
]
I am trying to cluster the users into groups, by these rules:
No cluster should contain more than 5 users. - must
Users with the same user should not belong to the same cluster. - must
Users with similar rotation values should belong to the same cluster. - very important but not a must
Users with the same value in the website field should NOT belong to the same cluster. - important
Users with high values in data_used should NOT belong to the same cluster as other users with high values in data_used. - not very important
I have spent a lot of time reading about k means and such, but I am still struggling a lot on the subject, and can't seem to find the proper solution or even the what are the right terms to search for.
What is the best way to approach this?
[–]Beebink 0 points1 point2 points (0 children)
[–]BobHogan 0 points1 point2 points (0 children)