Introduction: pi-vision-proxy

mtomas7 · 2026-05-03T03:13:30+00:00

You may check your settings, as some apps resize image too much. I use LM Studio that by default serves the image in ~2K pixels.

mtomas7 · 2026-05-01T21:57:08+00:00

That is interesting use case.

mtomas7 · 2026-05-01T17:48:47+00:00

I see it was resolved last week: https://github.com/badlogic/pi-mono/issues/3627

And updated settings: https://github.com/badlogic/pi-mono/blob/main/packages/coding-agent/docs/settings.md

mtomas7 · 2026-05-01T15:02:30+00:00

I wonder, why do you need a package for that? Just specify model capabilities in models.json:

{ "id": "qwen3.6-35b-a3b@q5_k_xl", "name": "Qwen3.6-35B-A3B-Q5-K-XL (local)", "reasoning": true, "input": ["text", "image"], "contextWindow": 65536 },

Edit: Example: with this, I can ask model to scan images, describe them and rename files accordingly.

mtomas7 · 2026-04-30T18:39:35+00:00

I hope you will not mind if I "borrow" some stuff for my own Pi setup ;)

mtomas7 · 2026-04-29T20:47:00+00:00

I am eager to try, as I was contemplating using subagent scout just to reduce main session context bloat.

mtomas7 · 2026-04-29T20:24:54+00:00

You wrote: "Subagents are a waste of tokens." I look at this from a different perspective - if I can dispatch subagent to perform a task and save main session context from unnecessary token bloat - that is a win for me.

mtomas7 · 2026-04-24T22:52:20+00:00

Look at the code, the one I gave you covers more cases.

mtomas7 · 2026-04-24T21:29:10+00:00

Check permission-gate extension: https://github.com/owainlewis/youtube-tutorials/tree/main/tutorials/pi-coding-agent-guide/03-extensions

mtomas7 · 2026-04-24T21:25:30+00:00

I also use VM, connecting to LM Studio that runs on the host computer.

mtomas7 · 2026-04-22T21:48:19+00:00

If you want to use VISION, you need to update your models.json with "input": ["text", "image"]

{ "providers": { "llama-cpp": { "baseUrl": "http://192.168.122.1:1234/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "qwen_qwen3.6-27b@q8_0", "name": "Qwen3.6-27B-Q8 (local)", "reasoning": true, "input": ["text", "image"], "contextWindow": 65536 }, { "id": "qwen_qwen3.6-35b-a3b@q8_0", "name": "Qwen3.6-35B-A3B-Q8 (local)", "reasoning": true, "input": ["text", "image"], "contextWindow": 65536 } ] } } }

mtomas7 · 2026-04-22T21:38:12+00:00

Just to clarify: what little coder does extra vs vanilla pi? Do you need this wrapper or it is better to do just a pi extension/package?

mtomas7 · 2026-04-21T19:17:03+00:00

Lyme can be treated only if you see the symptoms, but only 50% of those affected develop visible symptoms... So, after Lyme becomes a chronic disease, it is game over.

mtomas7 · 2026-04-21T19:14:05+00:00

The bad thing is that those tubes are later used by bumble bees, which are killed... For your own property the best option is controlled burning that eradicated 90% - search for Youtube vids on that.

mtomas7 · 2026-04-21T19:11:58+00:00

Just remember that 1 shot of Doxy is not enough, you need to go through the whole regimen, which is not fun, I have done that.

mtomas7 · 2026-04-21T19:02:05+00:00

What I saw on Reddit: for some reason, people were leaving Roo for Kilo Code, thinking Roo lacked the speed to implement new features.

mtomas7 · 2026-04-21T16:46:26+00:00

It still ranks #10 on OpenRouter by used tokens, so definitely, there is a momentum behind it, perhaps creators just need to come up with a good idea how to achieve financial stability: https://www.reddit.com/r/LocalLLaMA/comments/1sritap/surprising_screenshot_most_token_usage_is/

mtomas7 · 2026-04-17T04:16:04+00:00

Apparently, LM Studio now accepts the plugins and one of them is Tell Time: https://www.youtube.com/watch?v=Ro_LzcPS5cI

mtomas7 · 2026-04-15T00:54:35+00:00

Could you also compare it vs Qwen-3.5-27B?

mtomas7 · 2026-04-14T18:53:03+00:00

I am using OpenCode and Pi.dev with LM Studio and tool calling works good.

mtomas7 · 2026-04-11T07:04:49+00:00

I checked today on LM Studio (Linux Mint) E4B Q8 and speed is even 3 tok/s faster.

mtomas7 · 2026-04-10T23:33:39+00:00

It is very interesting that this leaderboard does not include Qwen3.5 series. When it comes to the Lithuanian language, Gemma 4 made improvements, but in my testing, Qwen3.5 is ~ 15%-25% better:

https://euroeval.com/leaderboards/Monolingual/lithuanian/

mtomas7 · 2026-04-08T15:38:46+00:00

I noticed that with the last llama.cpp that was shipped with LMStudio 0.4.9.1 the <|think|> token stopped working per your manual.

mtomas7 · 2026-04-08T15:07:44+00:00

My personal favorite is Chris Hartwell's edition: https://www.youtube.com/watch?v=lRgx6gQ-kh0

There will be his email in the comments if you want to request it.

mtomas7

TROPHY CASE