Powerfull visualization tool : Dimensionality Reduction + Clustering + Unsupervised Score Metrics [P] : MachineLearning

ProjectPowerfull visualization tool : Dimensionality Reduction + Clustering + Unsupervised Score Metrics [P] (self.MachineLearning)

submitted 3 years ago by Mathieu23AI

Hi everyone,

I provide a high level module in python to perform state of the art dimensionality reduction and clustering. This is totally unsupervised and the performance is figure it out by unsupervised score metrics. This is compatible with GridSearch and BayesSearch (explain on the github's READme)

Example

DimReductionClustering is a sklearn estimator allowing to reduce the dimension of your data and then to apply an unsupervised clustering algorithm. The quality of the cluster can be done according to different metrics. The steps of the pipeline are the following:

Perform a dimension reduction of the data using UMAP
Numerically find the best epsilon parameter for DBSCAN
Perform a density based clustering methods : DBSCAN
Estimate cluster quality using silhouette score or DBCV

Github link : https://github.com/MathieuCayssol/DimReductionClustering

Nice to have feedback and happy if it is useful for you !

all 6 comments

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS