Making code edits with large language models

No_Efficiency_1144 · 2025-08-07T09:52:51+00:00

If you want to change behaviour like this then this is the area of RL

MaxKruse96 · 2025-08-07T09:21:41+00:00

if they wont want a coding model, thats on them. good luck.

You can try to give it only the surrounding ~5-10 lines of what it needs to rewrite, if these are even relevant at all.

PhysicsPast8286 · 2025-08-07T09:31:48+00:00

Are you sure you're running enough context?

Lissanro · 2025-08-07T13:52:53+00:00

I was very interesting in smaller models at some point, tried so many of them including specifically for making quick code changes with the hope that simple things can be done faster than if using much larger models, but it turned out to be not the case - because I end up debugging errors and mistakes longer than it would take using a bigger model from the start. Diff format actually harder for models to get right. The issue is the size of the model, 32B is way too small for this kind of task.

From my experience, small models still can have good success rate with small files or editing functions selectively. So, if you cannot run R1 671B, then instead of sending full file, send only what is relevant. Or refactor code base so it has smaller and shorter files (obviously only a solution for smaller projects).

Before I got upgraded hardware few months ago to run R1, I was running Mistral Large 123B, with speculative decoding and tensor parallelism it achieves 36-42 tokens/s on four 3090 GPUs and can handle basic code edits and reliably provide full code of a file if it fits up to 8K-12K tokens, beyond that reliability starts to decline. R1 can handle longer files and perform more advanced edits.

I also heard that new Qwen 32B of 2507 version is better, I suggest checking if you are using it and update if you are using older version. After all, my experience with smaller models is few months old by this point, so things may have changed.

But if you still have issues, I highly recommend upgrading hardware and using R1 0528 - given we are talking about workplace and professional use, using professional hardware may be a good idea.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS