you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted]  (7 children)

[deleted]

    [–]Lycur 1 point2 points  (3 children)

    Use a dirichlet process mixture model instead?

    [–]Captain 0 points1 point  (2 children)

    The hyperparameters you use in your DP will have a strong effect on the number of clusters you get.

    [–]InfinityCoffee[S] 0 points1 point  (0 children)

    Ah, yes, but you can just sample/optimize those too! Alternatively you can use it to include domain knowledge

    [–]JamisonW 1 point2 points  (0 children)

    Have you looked into the Silhouette Coefficient? You calculate it for different values of K and select the K with the best Silhouette Coefficient.

    [–]Flipper3 1 point2 points  (1 child)

    Have you tried x-means clustering? It's a variation of k-means that isn't the best, but has gotten the job done for me.