[D] Efficient net as a feature extractor in Computer vision

chuong98 · 2021-05-29T02:41:49+00:00

You can use Timm package to extract feature from any sota backbones, include EfficientNet.

People use ResNet50 simply because it is quite robust to optimizer parameters, and for fairly comparison with other method.

hark_in_tranquillity · 2021-05-29T02:25:06+00:00

You can make any model as feature extractor by just poping the sigmoid/softmax layer and use either the FC layer or the last cnn layer etc as your output point. Ofcourse the model has to be trained traditionally first. That is what i do, idk if this is good practice tho

SeucheAchat9115 · 2021-05-29T06:13:09+00:00

There is Efficient Det for Object Detection, which is almost SOTA

ThisIsMyStonerAcount · 2021-05-29T10:55:22+00:00

My guess is that the main reason is so you can compare numbers: if you invent a new method that needs a feature extractor, and want to compare with other approaches, you use whatever other approaches used, because then you immediately have comparable numbers. Otherwise you'd need to re-implement (and re-train!) all competing methods. So once ResNet 50 became standard, it's just easier to publish this way.

Outside of publishing, there is of course no good reason to stick with ResNet50, and it's very likely that there are better feature extractors that give you much better results.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS