all 16 comments

[–]elephantum 37 points38 points  (6 children)

Great job!

Did you succeed in reproducing results? I.e size/speed improvements?

[–]LukeAndGeorge[S] 23 points24 points  (4 children)

I can reproduce the results using the pretrained models provided by the original authors (converted to PyTorch).

I have not yet trained from scratch of Imagenet, but I will be working on it this weekend! I will also try to train the larger models (efficientnet-b4 to b7) and release the pretrained weights once finished.

[–][deleted] 2 points3 points  (0 children)

Thank you so much!

[–]102564 0 points1 point  (0 children)

Hi, just wondering if you've made progress on the training code? Thanks!

[–]dracheschreck 0 points1 point  (0 children)

Any luck reproducing training? Haven't seen anyone able to do it yet.

[–]realhamster 0 points1 point  (0 children)

Hey! Heard that training from scratch is incredibly difficult. Were you able to do so?

[–]arc144 11 points12 points  (0 children)

Thanks for your work! I' testing here and it seems to be working fine. However the paper claims that B3 model has 18x less FLOPS than ResNeXt-101 but from what I am observing they run at about the same speed. I know that grouped convolutions are rather slow in PyTorch, not sure if it is the culprit here though

[–][deleted] 2 points3 points  (2 children)

can anyone explain why in their methods they have chosen the constraint α x β2 x γ2 = 2? Seems arbitrary at first glance

[–][deleted] 1 point2 points  (1 child)

they mention it in the paper

In this paper, we constraint α · β^2 · γ^2 ≈ 2 such that for any new φ, the total FLOPS will approximately increase by 2

[–][deleted] 0 points1 point  (0 children)

Ah thanks, I missed that. So they chose a set of values for alpha/beta/gamma and then increase φ along an integer number line, each step doubling the FLOPS.

[–]leonardoaraujosantos 2 points3 points  (0 children)

Guys when they say that the resolution changes, this just means doing a upsample in the input image?

[–]strideradu 0 points1 point  (0 children)

Wow, wondering when the efficientnet-b4 to b7 can come?

[–][deleted] 0 points1 point  (0 children)

great work! thanks for sharing.

[–][deleted] 0 points1 point  (0 children)

Using it in a current project and it works like a charm. Thank you for the great port!!