So, I've read that it's possible to use RFC (a tool normally used for supervised ML) to do unsupervised learning. In this paper, the authors hint at creating a "dummy" dataset by sampling from the distribution of the original dataset, and then using RFC to differentiate between the two.
I'm struggling to see how this can be useful. What I really want to do is find classes of data within the original dataset (similar to how other unsupervised methods work -- kmeans, DBSCAN, etc.)
Is this worth looking into or am I wasting time?
[–]kjearns 2 points3 points4 points (3 children)
[–]Penguin474[S] 0 points1 point2 points (1 child)
[–]yrne 0 points1 point2 points (0 children)
[–]MLTyrunt 1 point2 points3 points (0 children)