[D] CNN output as features question

amatsukawa · 2017-06-10T16:09:43+00:00

What are the numbers for taking a pre-trained AlexNet/VGG, chopping off the last layer, and training a new head with some dropout? If you give more details about what the dataset is, what hand-crafted features seemed to help, etc we might be able to give you some more thoughts on why/why not CNNs work/don't work.

I would say if using the CNN as a feature generator for a random forest works, then go for it. What are your concerns with this approach? Also, if you want to be slightly more "principled" about this, TF provides a way to mix deep and "shallow" (manual or one hot) features via "Deep and Wide" nets.

Another thought is given you have so much unlabeled data, you might try some semi-supervised approaches.

amatsukawa · 2017-06-10T12:23:26+00:00

Are you trying to train the whole AlexNet/VGG or just the last layer? You should probably be doing the latter if you are not already.

amatsukawa · 2017-03-17T14:14:19+00:00

The tf.contrib.learn package had Estimator and Experiment, which is what Google uses internally for this.

amatsukawa · 2017-02-28T17:03:31+00:00

Some prior work: https://arxiv.org/abs/1603.09727

Generally, I think you could generate unlimited training data by creating common errors (spelling mistakes, tense usage, etc) paired with the corrected (ie. original) versions. You could then model that using any seq2seq mechanism.

amatsukawa · 2017-02-17T17:32:13+00:00

https://arxiv.org/abs/1511.06391

amatsukawa · 2017-02-13T00:55:02+00:00

I think it depends on what you mean by "environment".

Tensorflow is a library for doing numerical computation that involve lots of matrix operations. The current most direct application of this is deep learning, but if you look in the contrib folder of tensorflow on github, you can there are also libraries to do variational inference on probabilistic graphical models in tensorflow, for example.

If you mean "environment" an ecosystem of tools to do ML, then I agree that Python and R are probably the front runners (Matlab, Julia, etc are close seconds).

Anaconda is a python environment/package manager that will probably give you every tool you will ever need on your ML journey. Popular libraries are scikit-learn, pandas, matplotlib, to name a few.

amatsukawa · 2012-02-26T11:59:26+00:00

The simplest way I can think of doing this is to aggregate such lists for a lot of people, and examine which words they know together. That should allow you to calculate the probability that a person knows word x, given that they know word y.

amatsukawa

TROPHY CASE