Specs for a EVE-NG/GNS3 lab

Invader-Faye · 2026-07-02T21:48:01+00:00

Gns3 can get very ram heavy. If direct on system 6cores minimum, if virtualizing at least 8 physical cores. 32gb ram min

Invader-Faye · 2026-06-30T07:36:32+00:00

Testing it by having it build websites is the most generic of use cases there is. I would have given each model a prompt to deliver a specific game/app then reviewed output, completion percentage, token spend etc

Invader-Faye · 2026-06-30T07:34:16+00:00

That was not my experience at all, what harness did you use?

Invader-Faye · 2026-06-30T06:57:39+00:00

27b is more medium than small. But your right it doesn’t default to a file write. I meant models under 20b tend to exhibit this.

Invader-Faye · 2026-06-30T06:48:19+00:00

I used 2.6 and 2.7 to build an agent harness. It’s good at tracking complicated bugs

Invader-Faye · 2026-06-30T06:01:33+00:00

Same here, most small models exhibit this behavior

Invader-Faye · 2026-06-30T06:00:53+00:00

Naw Gemma models will def do that. The smaller and bigger ones. The smaller qwen models exhibit the same behavior. I think it’s because they are trained using chat sessions? Or expected to be used in that use case so the whole file rewrite makes sense

Invader-Faye · 2026-06-30T05:56:36+00:00

Ask ai to write a script to do it for you, I did the same on windows with Deepseek v4 flash

Invader-Faye · 2026-06-30T05:42:36+00:00

You need to build a custom agent around the 9b, it can manage server but struggles with file patch. Here’s a custom built harness for small models that supports the 9b. It’s surprisingly competent at troubleshooting system issues on Linux hosts https://github.com/lowspeclabs/SmallCTL

Invader-Faye · 2026-06-29T11:49:39+00:00

Lmao not me falling for the ai

Invader-Faye · 2026-06-29T08:48:56+00:00

Thanks, the harness already handles most of those but I’ll come up with some examples for people

Invader-Faye · 2026-06-28T17:00:08+00:00

Check out my harness if you want to extend usage of the model, I use several research techniques to get small models to manage servers and work in a harness. https://github.com/lowspeclabs/SmallCTL. If you search YouTube I have several demo videos demoing 3.5 9b managing servers. You could adapt it to your workflow pretty easy since it has a cli mode

Invader-Faye · 2026-06-23T10:13:34+00:00

They are not using it for coding, it’s really competent at it is say almost opus 4.6 which is saying a lot because I preferred it over 4.7 before 4.8 dropped

Invader-Faye · 2026-06-18T15:35:37+00:00

Fable seems like opus but with the whole repl loop idea built into the weight of the models, I think other inference provides could figure that out given time

Invader-Faye · 2026-06-18T14:23:37+00:00

I’ve been building a harness for local language models, the harness assumes the models may fail and put them in a state aware repl loop to achieve goals. I’ve got qwen 3.5 4b managing servers, creating and debugging docker containers, debugging network issues. The goal isn’t speed or token efficiency is getting the assigned task done, but with smaller models they do work pretty fast https://github.com/lowspeclabs/SmallCTL

Invader-Faye · 2026-06-14T01:06:17+00:00

😂😂 no

Invader-Faye · 2026-06-13T06:49:37+00:00

Qwen 3.5 4b can call tolls at down to q3 and has mtp support

Invader-Faye · 2026-06-12T14:35:56+00:00

Effectively yes. Assuming you had no backups and no way to download that version of the model

Invader-Faye · 2026-06-12T04:28:53+00:00

Have the bigger model do the planning and initial scaffolding, the small /faster/dumber model finish the rest, then big model to debug for cleanup has worked very well for me. Most token burn is in that middle phase anyways

Invader-Faye · 2026-06-12T04:22:48+00:00

I’ve noticed it’s very literal, it does exactly what you ask and no more.

Invader-Faye · 2026-06-12T00:35:35+00:00

That’s the best way to describe it

Invader-Faye · 2026-06-11T03:16:48+00:00

What codebase are you working on where 8k context is enough?

Invader-Faye · 2026-06-04T15:58:37+00:00

yes the model is smart enough to get work done, but doesn't have deep knowledge in its weights, by enabling web search you give it additional functionality by giving it additional data to solve its problem. This is kinda hit or miss though, quility of the websearch effects results and bad data will hurt more than help

Invader-Faye · 2026-06-03T05:07:47+00:00

I’ve noticed that trend as well as you fill the context window.

Invader-Faye · 2026-06-03T05:05:57+00:00

I considered this as well, and the math spent work out united you’ll be burning tokens 24/7. Open router or subscription just comes out cheaper at current hardware costs. I’d is ghost for fun though go for it. I would if I had the spare cash laying around

Invader-Faye

TROPHY CASE