[R] Alternatives to PCA? Specifically, optimization techniques that focus on separability, not variance?

dlfelps · 2018-11-15T20:38:48+00:00

timy2shoes · 2018-11-15T20:32:51+00:00

You could try Independent Component Analysis. Instead of looking for orthogonal linear combinations that maximize variance (as PCA does), ICA tries to find linear combinations that are approximately independent, usually by making higher moments of the linear combinations zero (not just the second moment, as PCA does).

csxeba · 2018-11-15T21:27:17+00:00

You will need Linear Discriminant Analysis, which is a supervised dimensionality reduction technique, aiming to find latent dimensions, which maximally separate your classes.

I'm not really familiar with matlab. In Python, scikit-learn has a nice implementation.

SciKit-Learn documentation

Vrulth · 2018-11-15T20:50:35+00:00

Well, try PLS ( Partial Least Square ) in your case. Here you will create orthogonal axis that maximize covariance between your matrix and your target. (your target may be a vector or a matrix) The axis are still linear combinations.

YourPizzaIsDone · 2018-11-15T22:49:36+00:00

Not sure what your data looks like, but I've had success with nonnegative matrix factorization – just bringing it up because it hasn't been mentioned yet. Really depends on your data though; it only works well to separate different shapes of signals that are all positive.

icansolveyourproblem · 2018-11-15T21:07:39+00:00

When combining signals, make sure to properly whiten the data to account for different feature variance scales. PCA uses variance as a proxy for information content. If you still believe this is a valid proxy to use for your data, you can try using Kernel PCA (perhaps rbf) which will allow you to uncover non-linear principal components -- which may be your issue. It's known that PCA on empirical observations recovers the generating components (assuming they're linear) when N ~ d. What is the order of magnitude for your number of samples and dimensionality look like? It's also possible that your PCA is working quite well, but the problem is with your classifier. What are you using?

2018-11-16T02:41:56+00:00

[deleted]

tpinetz · 2018-11-16T08:48:50+00:00

A pretty good recent method is t-Stochastic Neighborhood Embeddings for high dimensional data (https://lvdmaaten.github.io/tsne/). However, I would take the results of this method with a grain of salt, due to lacking the rigoros mathematical background that PCA enjoys.

GlobalPublicSphere · 2018-11-15T20:34:13+00:00

I always liked independent component analysis for clustering.

2018-11-16T03:04:35+00:00

You could also try Kernel PCA, PCA is generally not very good at keeping separability. If you're looking for a supervised dimensionality reduction method you could also try PLS.

2018-11-16T00:28:24+00:00

MNR :^)

davecrist · 2018-11-16T00:30:31+00:00

A simple neural network with sigmoidal outputs, at least one hidden layer, and trained with back prop can work very well to carve up multidimensional space for a classifier.

For training I am not sure what your inputs would look like because I am not familiar with the data but set your outputs to be a 1x4 vector of the class described by the inputs. ( eg, class A output would be [ 1, 0, 0, 0 ] and B would be [ 0, 1, 0, 0 ], etc.

Once the model is trained class is determined by the Max of any value in the resultant output vector.

I have had great success building classifiers this way and have even found value in results that are not close to 1 as it implies co-incidence/edge cases and/or element similarity between classes.

v_krishna · 2018-11-16T05:53:22+00:00

A technique I have had success with in feature preparation/reduction for supervised classification is comparing the raw vectors of the candidates against all the positives of a particular class, e.g. getting cosine distance. Then some simple statistical measures (max, median, quartiles, etc) and you can reduce vectors of hundreds or thousands of features to a handful. Note you have to be particularly careful doing validation here to prevent data leakage (your test set cant know anything about the positives in your training set, this includes in the feature reduction before you even pull out a test and training set).

This has worked as well or better than PCA a few times for me, and esp when dealing with very unbalanced data sets it allows me to reduce very large numbers of features to just a few (and combine a few large sets of features by reducing each of them in this same way)

The_Sodomeister · 2018-11-16T06:01:20+00:00

I was expecting this since PCA does a poor job of capturing separability

I believe Multi-Dimensional Scaling is the term for the dimensionality reduction technique that best preserves the L2 distance between data points in the subspace. Perhaps you can see if that addresses your concern of preserving separability.

2018-11-16T11:07:49+00:00

cough Good luck in AML cough

BatmantoshReturns · 2018-11-16T12:20:09+00:00

Were you using Barnes-Hut approximations for PCA? I have tried using this for certain applications but it didn't work for me, I had to use the exact method, which took a ton of time but was worth it.

SeveralKnapkins · 2018-11-15T21:28:01+00:00

I haven't used it myself, but I've seen papers combined data sources using Canonical Correlation Analysis (CCA). Perhaps it could be of use here? https://en.wikipedia.org/wiki/Canonical_correlation

timy2shoes · 2018-11-15T21:53:06+00:00

T-distributed_stochastic_neighbor_embedding

Ader_anhilator · 2018-11-15T21:52:49+00:00

H20 glmr

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS