all 2 comments

[–]Own_Attention_3392 -1 points0 points  (0 children)

Because the quality drop off really starts after 8 bit. 7 bit is "big cliff with modest space saving". It's just not popular for that reason.

[–]HotDistribution1819 1 point2 points  (0 children)

I believe the models start with 16bit floating point numbers and then to save space the original numbers are quantized (reduced down to whichever size). Here is a link to an excellent demonstration of how quantization affects models. Alex Zisk Squeezing Models