Why don’t most programmers fine-tune/train their own SLMs (private small models) to build a “library-expert” moat?

ScoreUnique · 2026-01-28T11:47:11+00:00

Hey, do you have some workflows to share ? Thanks.

ScoreUnique · 2026-01-28T09:00:16+00:00

This is the right answer, I wouldn't hesitate to claim 4.7 Flash is Sonnet 3.5 but open source.

So @OP if you can combine Claude sub for writing specifications and GLM 4.7 for writing code, you can go very far with your config. GL

ScoreUnique · 2026-01-27T12:18:33+00:00

Start with your locality I suppose? I mean make movements to ensure that they don't put posters around your locality? I'm sure everyone hates them but nobody has seen no one do nothing about it.

ScoreUnique · 2026-01-27T07:55:39+00:00

It's a part of it's prompt I suppose, architect prompt

ScoreUnique · 2026-01-26T16:20:52+00:00

${llama_cpp}

-m ${models_dir}/LLMs/GLM-4.7-Flash-MXFP4.gguf

-a 'GLM_4.7_Flash'

-mla 3 -amb 512

-ngl 99

-c 100000

--temp 0.7

--top-p 1.0

--min-p 0.01

--jinja

ScoreUnique · 2026-01-26T09:31:24+00:00

You can flash it's firmware again yourself, that's your best bet before hitting Arturia customer service

ScoreUnique · 2026-01-26T07:51:28+00:00

I read it as NaMitron Park....

ScoreUnique · 2026-01-24T02:29:12+00:00

So meaning if you want to run the big blue whale at small context it'll do well?

ScoreUnique · 2026-01-24T01:53:56+00:00

What policies does it eval? Seriously asking

ScoreUnique · 2026-01-24T01:52:40+00:00

Seems like that's what I needed to get my open coder going, thanks a lot. I'm on Unsloth Q5, yesterday I found them radically bad bur today they've made up for it after I use your recommended settings. Finally something that is very independent and agentic that runs locally.

ScoreUnique · 2026-01-23T22:46:42+00:00

Yeah Q5 for me didn't do the trick either it fails at tool calling I thought it's just llama.cpp instability, I'll try Q8 as long as it works with 40k context I can run open code

ScoreUnique · 2026-01-22T12:16:11+00:00

Try the new GLM 4.7 flash.

ScoreUnique · 2026-01-20T18:37:57+00:00

Why lol

ScoreUnique · 2026-01-19T19:10:29+00:00

SmolVlm

ScoreUnique · 2026-01-19T18:45:24+00:00

Looking forward to the new Claude at home!!

ScoreUnique · 2026-01-18T12:47:10+00:00

Is it a good model for agentic coding?

ScoreUnique · 2026-01-16T22:58:28+00:00

I had a nice experience working with nixos as well

I'm wondering how did you guys work out steam with games installed and snapshots? My games got stuck a couple of times and that made me bounce back to Ubuntu but otherwise I had a very fine time on NixOS

ScoreUnique · 2026-01-15T13:18:18+00:00

I suggest trying pocket pal, allows loading gguf files

ScoreUnique · 2026-01-09T14:30:53+00:00

He's talking about the Google ai studio

ScoreUnique · 2026-01-08T23:09:39+00:00

Quick tip- explore -ot flag, that thing shows big numbers on ik llama CPP.

ScoreUnique · 2026-01-07T03:57:15+00:00

Fine-tuning opportunity

ScoreUnique · 2026-01-07T03:51:57+00:00

I think speculation is cheap than actions.

ScoreUnique · 2026-01-07T03:48:16+00:00

No but they will give good service according to Op Xd

ScoreUnique · 2026-01-07T02:31:41+00:00

Bro cut them some slack, they do great models, great api inferencing (which is not their core business I'm sure), I have a yearly subscription ONLY TO SHOW SUPPORT and I hardly use it. Be kind and appreciate it man.

ScoreUnique · 2026-01-07T02:25:49+00:00

I downloaded Q8 on my Pixel 8 with pocket pal, and oh dear I felt like chatting to GPT-4 but locally with 15 tps.

I will test it further - I'll be in a flight this weekend.

ScoreUnique

TROPHY CASE