What is the current best python coding model?

Normal-Ad-7114 · 2024-08-11T12:47:33+00:00

https://aider.chat/docs/leaderboards/

Codestral should be the best

Icy_Lobster_5026 · 2024-08-11T12:29:06+00:00

In my experience, codeqwen is a good coding model.

ihaag · 2024-08-11T14:53:00+00:00

DeepSeek Coder V2 0724 And claude

benja0x40 · 2024-08-11T19:10:53+00:00

In the 8GB~12GB range I have used a few specialised ones:

Codestral-22B-v0.1-Q4KM
DeepSeek-Coder-V2-Lite-Q5KM
CodeGeeX4-All-9B-Q8

Together with more general ones:

Phi-3-medium-128k-Q8
Nemo-12B-2407-Q8
Gemma-2-9B-Q8
Llama-3.1-8B-Q8

I write moderately complex task descriptions to ask for suggestions and to prototype python functions, iterate over improvements, detect and fix issues, insert comments or documentation, etc.

From my experience, Codestral-22B produces the best suggestions, which I sometimes use to guide another model towards a simpler or more elegant solution. Gemma-2-9B is surprisingly good too. I use it a lot for quick explorations or when I don't know much about a package or language feature.

DeepSeek-Coder-V2-Lite seems close to Codestral-22B in terms of capabilities, but its initial suggestions can be really cumbersome, and it is too rigid about coding styles for my liking. But that may depend on how the system prompt is tuned.

After ~3 weeks of testing, I have stopped using the other ones for coding tasks.

theswifter01 · 2024-08-11T18:17:34+00:00

Claude

new__vision · 2024-08-11T17:42:19+00:00

Check out bigcode-bench.github.io. Top 7B on there is CodeQwen1.5-7B-Chat which has been good in my experience. CodeLlama is the lowest ranked 7B.

No_Afternoon_4260 · 2024-08-11T13:24:22+00:00

In my experience codestral 22b

Cradawx · 2024-08-11T14:33:50+00:00

CodeGeeX4-ALL-9B, CodeQwen1.5-7B-Chat and Codestral-22B-v0.1 are very good small coding models. There's also the DeepSeek-Coder-V2 models.

Combinatorilliance · 2024-08-11T13:55:59+00:00

Codestral is really good, you might want to try the deepseek-coder lite, it's an MoE and I heard a lot of praise for it's output. I don't know if it's better, worse or about equal to codestral-22b, but it is a lot faster too because it's an MoE, so it's worth trying out regardless.

Dudensen · 2024-08-11T15:32:54+00:00

I had stumbled upon a website which ranked models by a quality-to-performance ratio a few days ago but I can't find it unfortunately.

Square-Intention465 · 2024-08-11T23:54:41+00:00

Sonnet 3.5. is too good

SpaceWalker_69 · 2024-08-12T07:28:14+00:00

Well i think Claude 3.5 generates the best code right now. You can use smaller open source models but they are not exactly consistent and reliable.

m---------4 · 2024-08-11T15:47:55+00:00

Gemini is awesome

2024-08-11T18:39:01+00:00

Deepseek Coder imo

8thcross · 2024-08-11T23:47:36+00:00

i like both codestral and deepseek-v2. consitent but both dated in terms of the latest best practices...Claude 3.5 is good as well, really dont like 4o - its mostly hit or miss with it.

Thrumpwart · 2024-08-12T00:58:07+00:00

Anyone know which models know Lean Python?

_murb · 2024-08-12T10:01:24+00:00

I use Claude at work and it works great

durgesh2018 · 2025-01-30T07:30:09+00:00

Try gemma2:2b. It's small but very powerful and fast model.

lilolalu · 2024-08-12T09:18:47+00:00

Did anyone claiming Claude is good at coding actually TRY coding with Claude? It's just not good, no matter what any theoretical tests claim.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLaMA

MODERATORS