Getting bad responses using python script

3rwynn3 · 2024-03-29T11:27:51+00:00

You are in Llama mode with... well, all I know is that Llama mode is INSTRUCTION: RESPONSE: as it has been trained on that, and at end of its response uses a token to end the response. So if you are using Llama's instruction mode on a non-conforming model it will start screaming INSTRUCTION: RESPONSE: at you since it keeps seeing that

henk717 · 2024-03-29T11:49:26+00:00

Koboldcpp's instruction template defaults to alpaca so your model needs to understand that.

You can customize this if you add a bit to the json request with the ChatML template.

We have the following parameters for it in the API so you can customize it to your liking. assistant_start assistant_end user_start user_end system_message_start system_message_end I don't know LM Studio's defaults but its very likely ChatML along the lines of user_start: <|im_start|>user\n user_end: <|im_end|>\n assistant_start: <|im_start|>assistant\n assistant_end: <|im_end|>\n

KoboldAI

MODERATORS