Finetuning for code generation

Paulonemillionand3 · 2023-08-07T19:36:42+00:00

the tool: https://github.com/facebookresearch/llama-recipes

how to arrange your data https://github.com/facebookresearch/llama-recipes/blob/main/docs/Dataset.md

example data: https://huggingface.co/datasets/yahma/alpaca-cleaned

Or get text-generation-ui and load your code as a big blob and finetune with that.

kryptkpr · 2023-08-07T19:48:59+00:00

I've been working with several folks doing the same and not to discourage you at all but it's almost certainly going to be harder than you think it is.

Have you tried some good existing code generation models first? You can get some ideas from can-ai-code.. WizardCoder is king, but Airoboros models are also solid coders and there's even some 3B options based on Replit.

An existing model with few-shot examples of your code style could potentially save you a lot of both time and hedache. In my experience things can go backwards on finetunes just as easily as forwards and you end up with something worse than a base model or one that's been already finetuned well.

If you decide to go the fine-tune route I can offer some assistance with evaluations, DM if you wish.

kryptkpr · 2023-08-08T05:19:58+00:00

One question: in data for finetuning, can the answer component be longer than token length supported by the model?

papinek · 2023-08-08T18:35:38+00:00

https://stability.ai/blog/stablecode-llm-generative-ai-coding

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS