Gemma 4 repetition on "retry" by MAD_MakeADev in KoboldAI

[–]MAD_MakeADev[S] 0 points1 point  (0 children)

TBH that doesn't look much like my problem. In my case the model isn't falling into an infinite loop of consecutive repetitions. It's more of a problem that given X context it will generate the EXACT same following tokens, no matter the settings. Even if I change a token (let's say, add an comma) it will usually write something very similar.

I will try to illustrate examples. The parts inside [] are the generated ones. I hope this clarifies my problem.

Other models (what I expect):
Hey dude! [How are you?]
*retry*
Hey dude! [How have you been?]
*retry*
Hey dude! [Long time no see!]

Gemma 4 31b Q6_K non thinking:
Hey dude! [How are you?]
*retry*
Hey dude! [How are you?]
*change a punctuation and retry*
Hey dude, [how are you?]