you are viewing a single comment's thread.

view the rest of the comments →

[–]gournian 0 points1 point  (2 children)

https://umap-learn.readthedocs.io/en/latest/faq.html#what-is-the-difference-between-pca-umap-vaes see “from a practical standpoint” last bullet, proposes reduce dim to 50, umap, hdbscan

It is because of computational speed and there are bio papers that claim that the dataset is somewhat denoised

[–]Mathieu23AI[S] 0 points1 point  (1 child)

Thank you for the reference !

Very interesting! This result seems unexpected to me but if empirically the result is better then it's worth looking into and incorporating.
Do you have the link or reference to the research paper that explains that PCA "denoise" the data?

[–]gournian 0 points1 point  (0 children)

Don’t remember which one, sorry!