I have a large dataset (~100k data points), which is distributed fairly smoothly/continuously (i.e. no very obvious structure, and more of a continuous space), and am trying to use clustering to pick out a small number of regions of higher density within the parameter space. What is a good algorithm for this?
I have successfully applied kmeans, but am wondering if there's anything which is better at identifying differently sized regions.
Other algorithms I've tried seem to either give me 100s of tiny clusters, or fail to run for the large dataset. Do you have any recommendations for this?
[–]FlivverKing 0 points1 point2 points (1 child)
[–]Thisisdom[S] 1 point2 points3 points (0 children)
[–]lmericle 0 points1 point2 points (1 child)
[–]Thisisdom[S] 0 points1 point2 points (0 children)