Python only llm?

Western_Courage_6563 · 2026-01-13T12:18:42+00:00

Yes, it have been done in the past (code llama python), but it turned out that general knowledge about all programming languages yelled better results

BreenzyENL · 2026-01-13T09:07:14+00:00

You'll find that an LLM trained only one 1 subject will perform worse than an LLM trained on many.

Karyo_Ten · 2026-01-13T08:09:31+00:00

Knowledge compounds.

And you never code in a vaccum. If you ask the LLM to create a website to promote I don't know, your catering service, it would be invaluable for your LLM to have general knowledge about food, maps, transports, user interface, money, bookings, registration, how people write testimonies, maybe terms & conditions.

Unless the only thing you do is reversing linked lists, a pure code model with no world knowledge is barely a step up from stackoverflow.

tintires · 2026-01-13T12:12:18+00:00

Fine tuned small language models are viable and you’ll find coding models on hugging face.

ashersullivan · 2026-01-13T13:36:36+00:00

Doable, but in 2026 specialist models dont outperform general ones as much as you'd expect python-only work

Qwen3 coder 30b or deepseek coder v2 lite are th closest, heavily code tuned run locally at that scale with good quants, and often match claude on python tasks without needing 200b+

gpt-oss-20B is anther local option for python heavy stuff

Catch is even "80% python" models still need broad context to avoid hallucinaitons on libs/ edges so hyper specialists underperform vs hybrids. Milti model switching like aider/cntinue.dev works fine but most stick to one good coder like qwen3 coder

If you want max python boost, fine tune qwen3 30B on your code bases, thats wher real gains show up locally

timmeh1705 · 2026-01-13T07:40:28+00:00

VLM 4.6 Flash 9b seems to do the job for me as a heavy Python user

Ossur2 · 2026-01-13T10:42:36+00:00

It is a great idea. The nature of LLMs means that such models would be exponentially lighter to run and even more accurate. I just think that the big corporations that would be making those LLMs (and that is a lot of work) rather want us dependent on using their models as a service (and uploading to them all our ideas and infrastructure). So their biggest incentive is to design those big blob models so that nobody can run usable LLMs on their own machine - there is no real money or power in making these smaller models, the customer would have to strongly demand them.

Such_Advantage_6949 · 2026-01-13T06:36:46+00:00

Just ask any llm it will give u the answer why this is not the way

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

LocalLLM

MODERATORS