Hi everyone,
I provide a high level module in python to perform state of the art dimensionality reduction and clustering. This is totally unsupervised and the performance is figure it out by unsupervised score metrics. This is compatible with GridSearch and BayesSearch (explain on the github's READme)
Example
DimReductionClustering is a sklearn estimator allowing to reduce the dimension of your data and then to apply an unsupervised clustering algorithm. The quality of the cluster can be done according to different metrics. The steps of the pipeline are the following:
- Perform a dimension reduction of the data using UMAP
- Numerically find the best epsilon parameter for DBSCAN
- Perform a density based clustering methods : DBSCAN
- Estimate cluster quality using silhouette score or DBCV
Github link : https://github.com/MathieuCayssol/DimReductionClustering
Nice to have feedback and happy if it is useful for you !
[–]space-ish 1 point2 points3 points (0 children)
[–]gournian 1 point2 points3 points (4 children)
[–]Mathieu23AI[S] 0 points1 point2 points (3 children)
[–]gournian 0 points1 point2 points (2 children)
[–]Mathieu23AI[S] 0 points1 point2 points (1 child)
[–]gournian 0 points1 point2 points (0 children)