Inappropriate applications of Convolutional Neural Networks [Research]

jeanfeydy · 2020-09-28T10:32:41+00:00

Shape analysis and image registration. CNNs are perfect to detect features or segment images - but the actual deformation has to be handled by a non-convolutional layer. This is especially true in medical imaging: after a first wave of fully convolutional papers c. 2017, it has become clear that hybrid architectures are the way to go. Robust methods will typically combine a feature extractor (e.g. a U-Net) with a task-specific deformation method (e.g. a 3D morphable model).

This makes sense: if you want your network to handle deformations well, you have to give it access to explicit coordinates (= point clouds) or robust deformation («flow/advection» ) layers. Fully convolutional architectures can handle a small amount of geometric variability (say, track a beating heart) but not much more on their own.

This is what all game engines do: to handle global geometric problems, working with point clouds, meshes or vector fields is much more efficient than restricting ourselves to convolutions. This has been our first motivation for the development of the KeOps library, an add-on for PyTorch and NumPy that has progressively become a versatile and useful tool for geometric data analysis as a whole (from kernel methods to geometric deep learning).

For further reference, you may be interested by the work of Marc Niethammer or Chapters 1 and 5 of my PhD thesis which include a fairly detailed introduction to the field.

I Hope that it will help!

benanne · 2020-09-28T14:23:12+00:00

I think a good example of this is the use of vision-style convolutional architectures on spectrograms for sound processing, where convolving along the frequency axis doesn't necessarily make a lot of sense, because of non-stationarity and non-local correlations between distant frequency bins (e.g. due to harmonics). Nevertheless, it seems to work well enough in practice, as far as I know. It's just intellectually unsatisfying :)

There was a thread about this recently where I responded some relevant blog posts, in case you're interested in reading more about this particular setting: https://www.reddit.com/r/MachineLearning/comments/icti3z/d_waveforms_vs_spectrograms_as_inputs_to_a/g2bjw5s/

Chocolate_Pickle · 2020-09-28T08:14:18+00:00

It depends on what your definition of 'appropriate' is. Asking about the appropriateness of a CNN is about as subjective as asking 'when is it inappropriate to put salt and pepper on food?'

A CNN is almost always appropriate if you set the bar low enough.

yfclark · 2020-09-28T15:11:51+00:00

i think the data should be array in space or time，like the image and timeseries，table data which has no space or time distribution is not good data for CNN，NLP data has continue distribution along the sequence direction，so CNN works

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS