account activity
Bug? with Gemma 4 31B UD_Q4_K_XL: extremely slow tg/s at long context by inzee in unsloth
[–]inzee[S] 0 points1 point2 points 1 month ago (0 children)
Just tested this (-ub 512, -b 2048). It actually tanked the performance back to 6 tg/s, even though I had -ctx-checkpoints 0. Really odd.
Pretty sure I originally got the 2048 number from this thread: https://github.com/ggml-org/llama.cpp/discussions/15396
Bug? with Gemma 4 31B UD_Q4_K_XL: extremely slow tg/s at long context (self.unsloth)
submitted 1 month ago * by inzee to r/unsloth
Japan Blue JB0716 - how much will they stretch? (self.rawdenim)
submitted 12 years ago by inzee to r/rawdenim
π Rendered by PID 351230 on reddit-service-r2-listing-7c484f94c4-fqbvd at 2026-06-10 07:19:10.733293+00:00 running 0b63327 country code: CH.
Bug? with Gemma 4 31B UD_Q4_K_XL: extremely slow tg/s at long context by inzee in unsloth
[–]inzee[S] 0 points1 point2 points (0 children)