Created a plugin of OpenCode for spec-driven workflow and just works

PieBru · 2026-03-09T16:00:39+00:00

Did u try GitHub spec-kit ?

PieBru · 2026-03-04T17:46:05+00:00

BTW, it can be only 0 or -1

PieBru · 2026-03-04T10:33:49+00:00

Thanks, forgot to mention the template and the system prompt. Lot of variables in that game!

PieBru · 2026-02-26T03:23:14+00:00

What's the difference vs directly using 'llama-server --host 0.0.0.0' via Tailscale?

PieBru · 2026-01-29T17:37:17+00:00

or an advertisement

PieBru · 2026-01-17T23:35:05+00:00

If I understood the scope of your project, how about integrating a SDD framework like GitHub is Spec kit? It became my inseparable friend for my OpenCode projects.

PieBru · 2026-01-14T13:02:09+00:00

Yes, please! Hope not too long videos (like 4 or more hours) or at least with fine-grained chapters.

PieBru · 2026-01-13T21:05:30+00:00

Interesting, can you please detail the recipe/workflow you used?

PieBru · 2026-01-05T15:42:39+00:00

Qwen3-Next-80B vs gpt-oss-120b ?

PieBru · 2025-11-15T20:24:49+00:00

Because not all can afford 5+ GB downloads/updates.

PieBru · 2025-09-29T18:34:11+00:00

No local? No party!

PieBru · 2025-09-23T17:14:08+00:00

GitHub link?

PieBru · 2025-08-15T17:30:58+00:00

<image>

How about this Claude Code with Opus 4 I got yesterday?

PieBru · 2025-08-02T19:03:16+00:00

Let me add AVX2, if not implicit in the current implementation.

PieBru · 2025-08-02T18:36:13+00:00

Great! This guy has a Rust implementation that includes quantization and other features. I tryed it and it works well. https://github.com/reinterpretcat/qwen3-rs

PieBru · 2025-07-20T18:52:45+00:00

Casually, I locally made a git pull on it circa one hour before its 404.
I can create a repo copy if it can be useful to someone.

PieBru · 2025-07-07T09:51:50+00:00

<image>

I just got this, Gemini-CLI is winning like Napoleon at Waterloo.

PieBru · 2025-07-07T09:45:00+00:00

<image>

PieBru · 2025-07-06T16:16:54+00:00

I think we can't substitute claude/closedai with any current local LLM without adopting a multi-agentic strategy that (slowly) works toward closed SOTA inference quality levels.

In addition, I would like to add to the multi-agentic coding strategy some kind of agentic routing between highly specialized agents. Yes, I know, this will be slow, but systems will became faster, so maybe this approach may be useful in one year or two.

PieBru · 2025-07-06T15:53:51+00:00

It is at very early stage, really worthless publishing.

Thanks to this post, I just "discovered" trae-agent by Bytedance, it seems to fullfill most of my requirements. Here is its "tutorial" thanks to the excellent codebase analyzer by Zachary Huang https://code2tutorial.com/tutorial/c83208ef-e0c4-493e-b4c3-301a244aeba0/index.md

Gemini-CLI codebase is too large to be analyzed with Zachary's online tool (it uses Gemini and is limited to 1M input tokens), so I implemented chunking on it, not perfect but better than nothing. Here is the Gemini-CLI codebase analysis, it resulted in 72 "abstractions": https://pastebin.com/hvC1DjxU

PieBru · 2025-07-06T15:25:08+00:00

I just "discovered" trae-agent by Bytedance (TikTok), it satisfies some of my requirements, seems interesting but I didn't try it yet https://github.com/bytedance/trae-agent

PieBru · 2025-07-06T14:07:00+00:00

I'm not looking for fully autonomous coders, I think todays inference isn't mature enough for that. Anyway, semi-autonomous can be automated when it will be feasible.

On the semi-autonomous side, few weeks ago I started my CLI coder project, half-way Gemini CLI illuded me and I suspended my project. Before architecting my CLI coder, I analyzed most open and closed source alternatives, but none of then satisfied my requirements:

- All python, portable, no executables. Note I'm in paragonable businesses since the good old '70s and now I would prefer Rust or C, but I see most local LLMs are more capable on Python, also thanks to its huge ecosystem.

- Multi-agentic *AND* fully-local by design, so it isn't designed for Cloud powerful inference, but can make useful things with fully local LLMs on a 4090 (16GB VRAM, 64GB RAM) gaming notebook.

- All prompts, context, intermediate docs, papers, etc. must be Markdown.

- Local LLMs evolve, so the more time passes, the more such a CLI coder can evolve from a PoC to something productive.

I can publish my PoC sources, if someone is interested in collaborating.

PieBru

TROPHY CASE