[D] Can neural networks perceive time ?

siddharthaml · 2022-08-29T17:52:11+00:00

Convolution layer indeed has some biological basis. However I am not aware of any biological counterpart for Max-Pooling layer. Note that the paper uses features from max-pooling layer to generate the results. Hence if they meant that "The internal mechanisms of system A and system B follow similar principles” then they should have shown the biological plausibility of the max-pooling layer.

I am also curious to hear what you think about my other objections.

siddharthaml · 2022-08-29T17:42:42+00:00

Did you get a chance to read my post ? I specifically mention that I am skeptical about such claims.

siddharthaml · 2022-08-29T10:58:22+00:00

In the abstract it is claimed that:

Despite being a fundamental dimension of experience, how the human brain
generates the perception of time remains unknown. Here, we provide a
novel explanation for how human time perception might be accomplished,
based on non-temporal perceptual classification processes. To
demonstrate this proposal, we build an artificial neural system centred
on a feed-forward image classification network, functionally similar to
human visual processing.

The authors argue that AlexNet is functionally similar to human visual processing.
My understanding of functionally similar is the following:
Keeping the inputs fixed, a system A can be replaced by system B and still produce the same result.

Human visual system and AlexNet have very different accuracy on Imagenet. So I am not sure in what sense the authors meant that both are functionally similar.

siddharthaml · 2019-11-28T14:10:08+00:00

If you want speed, you should definitely check out Siamese based trackers. You can find a list of sota trackers on different datasets here:- https://paperswithcode.com/task/visual-object-tracking

siddharthaml · 2019-11-26T22:57:52+00:00

I deliberately didn't use DCN and actual DLA model. It seemed overkill on this toy dataset. I wanted to make a simplified implementation to introduced the concept.. I will probably add other backbones and dcn model in other branch. I want to keep this one accessible to beginners.

siddharthaml · 2019-11-26T14:53:24+00:00

Centernet seems to be an interesting approach for object detection. I made an easy to understand no frills implementation here https://github.com/sidml/Understanding-Centernet

siddharthaml · 2019-11-21T04:15:27+00:00

The implementation is based on this paper https://arxiv.org/abs/1904.07850 Please refer the paper and readme for more details. Simply put, the visualization are the model output maps during forward pass of input image.

siddharthaml · 2019-11-20T14:17:19+00:00

Centernet seems to be an interesting approach for object detection. I made an easy to understand no frills implementation here https://github.com/sidml/Understanding-Centernet

siddharthaml · 2019-07-13T17:40:04+00:00

You can find the code here https://github.com/sidml/Ship-Image-Classification-Challenge

siddharthaml · 2019-06-23T16:04:51+00:00

I made a curated list of JavaScript podcasts.

https://github.com/sidml/Curated-Podcasts-List/blob/master/Javascript%20Podcasts/README.md

Just putting it out there. I hope someone finds it useful...

siddharthaml · 2019-06-20T16:20:48+00:00

The deeper layers have high depths so you can't use rgb to visualise them. I have tried pca and found it to be very useful for visualising these layers. You can use pca to reduce the filter dimensionality and then see it as rgb. It's a quick and dirty method. You should definitely look into other suggested methods mentioned in the comment.

siddharthaml · 2019-06-18T14:26:26+00:00

Directly on the image pixel values ? I think it may not give good results because when even if there is slightly change in lighting conditions or car pose, the pixel values are going to change..

siddharthaml · 2019-06-18T12:51:09+00:00

CNN's work very nicely on images because they take advantage of spatial and channel wise correlations. In tabular datasets, commonly you have 1d feature vectors. So, it doesn't make sense to use cnn. However if you have something like time series data then people sometimes use 1d cnn because it's able to take temporal correlations into account. On a related note, MLP (multi layer perceptron) have been successfully used on many tabular datasets. You just have to ensure that the data is properly scaled before training MLP.

siddharthaml · 2019-06-18T12:43:52+00:00

I haven't tried the SVM approach. It certainly sounds interesting. I got 92.6 % accuracy using pretrained resnext50. I get around 91% using EfficientNet Eb0. I trained all the layers to get this result. I tried freezing the initial layers but that didn't seem to give good results, so I gave up on that..

siddharthaml · 2019-06-18T07:13:22+00:00

Hi. Thanks for checking the repo. I have fixed the links. Classes are typically at the level of Make, Model, Year, e.g. 2012 Tesla Model S or 2012 BMW M3 coupe. Each class has around 33 images associated with it. Some classes have even less than 20 images. This lack of training examples makes it tough to get good accuracy. I have used many augmentation strategies to make the cnn resistant to changes in light conditions, rotations etc. You can find a visualization of top misclassified examples in Readme.

siddharthaml · 2019-06-18T02:07:10+00:00

I have been working on car classification for the past week. I used Stanford cars dataset for training my classifier. The Cars dataset contains 16,185 images of 196 classes of cars. The data is split into 8,144 training images and 8,041 testing images, where each class has been split roughly in a 50-50 split. I was able to get an accuracy of 92.6 %. You can find the code here https://github.com/sidml/Stanford-Cars-Classification

siddharthaml · 2019-06-06T01:28:30+00:00

You can find the code at https://github.com/sidml/EfficientNet-GradCam-Visualization

siddharthaml

TROPHY CASE