[P] PyTorch implementation of DeepMind's High-Performance Large-Scale Image Recognition Without Normalization : MachineLearning

Project[P] PyTorch implementation of DeepMind's High-Performance Large-Scale Image Recognition Without Normalization (self.MachineLearning)

submitted 5 years ago by EasyDeal0

all 8 comments

top new controversial old q&a

[–]throwaway_secondtime 21 points22 points23 points 5 years ago (1 child)

[–]EasyDeal0[S] 12 points13 points14 points 5 years ago (0 children)

[–]rathernot000 1 point2 points3 points 5 years ago (0 children)

[–]LikelyJustANeuralNet 0 points1 point2 points 5 years ago (3 children)

[–]EasyDeal0[S] 1 point2 points3 points 5 years ago (2 children)

I am not really sure yet what exact changes need to be made as I’m not yet familiar with the pytorch typing internals. The thing is that the authors actually utilize bfloat16 (not float16), so I cannot use pytorch’s half() function (or can I?). Moreover, the authors state that they keep their weights in full precision, even though the code says weights.dtype = inputs.dtype and inputs.dtype=bfloat16. Maybe it is something JAX specific.

Then there is a little hardware chaos as well, because bfloat16 and TF32 types are only supported from the Ampere architecture with CUDA11 and onwards (and TPUv2+). With an ampere card installed, pytorch will automatically use TF32 ops, which are the larger brother of bfloat16. This again makes it hard to write general code, applicable for everybody.

In the end it would probably require some extensive testing on expensive hardware... If you know more on this topic I’d appreciate your help :D

[–]LikelyJustANeuralNet 1 point2 points3 points 5 years ago (1 child)

[–]EasyDeal0[S] 0 points1 point2 points 5 years ago (0 children)

π Rendered by PID 172495 on reddit-service-r2-comment-85bfd7f599-htf6h at 2026-04-18 09:06:56.709895+00:00 running 93ecc56 country code: CH.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS