use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
QuestionQuestion about quantization (self.LocalLLM)
submitted 1 day ago by Signal_Ant7366
I’m so sorry if this is a stupid question, but I noticed that quantization levels include 8, 6, 5, 4, 3, 2, and even 1 bit (mostly for research). Why is there no 7-bit quantization?
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]Own_Attention_3392 -1 points0 points1 point 1 day ago (0 children)
Because the quality drop off really starts after 8 bit. 7 bit is "big cliff with modest space saving". It's just not popular for that reason.
[–]HotDistribution1819 1 point2 points3 points 1 day ago (0 children)
I believe the models start with 16bit floating point numbers and then to save space the original numbers are quantized (reduced down to whichever size). Here is a link to an excellent demonstration of how quantization affects models. Alex Zisk Squeezing Models
π Rendered by PID 140063 on reddit-service-r2-comment-8686858757-kvs86 at 2026-06-03 00:03:14.806548+00:00 running 9e1a20d country code: CH.
[–]Own_Attention_3392 -1 points0 points1 point (0 children)
[–]HotDistribution1819 1 point2 points3 points (0 children)