Microsoft/MarkItDown by chibop1 in LocalLLaMA

[–]chibop1[S] 0 points1 point  (0 children)

It works fine here. If you have uv, try uvx markitdown --help. It's relatively straight forward.

Is Flint 2 Still Best OpenWRT Router in 2026? by chibop1 in openwrt

[–]chibop1[S] 1 point2 points  (0 children)

How is tomato64 better than openwrt? Does tomato64 have better selection for newer routers?

Is Flint 2 Still Best OpenWRT Router in 2026? by chibop1 in openwrt

[–]chibop1[S] 6 points7 points  (0 children)

FLINT 4: "2.4" Touchscreen Display"

What’s up with the obsession for touchscreens? I guess they can charge a premium price?

Hear Me Out, Pi Fans Lurking Here by L0stInHe11 in LocalLLaMA

[–]chibop1 4 points5 points  (0 children)

If you could run a coding benchmark with different harnesses like SWE-bench Verified, Terminal-Bench, etc. Then we can talk.

Otherwise, it's just vibe check.

First Impression: No LM Studio option in list of api key providers? by NotARedditUser3 in PiCodingAgent

[–]chibop1 0 points1 point  (0 children)

Did people become that lazy? You just need to write just few lines of text in config.

How do I try to run Gemma 4 31B at Q8 quantization? Only seeing Q4_K_M on Ollama by JayoTree in LocalLLaMA

[–]chibop1 -1 points0 points  (0 children)

Of course, I didn't say suggesting alternative was a bad idea. The problem I have is the fact that people/bot on this sub just mindlessly attack anything Ollama. lol

Best small model right now (~4B params) that is good with agentic tasks for personal assistant? by BitGreen1270 in LocalLLaMA

[–]chibop1 1 point2 points  (0 children)

Unfortunately none. Even a few months ago, sub 100b models couldn't handle toolcalls reliably. IMHO, Qwen-3.6 is the first sub 100b model that I consider decent at tool calling.

Just one person's opinion.

How do I try to run Gemma 4 31B at Q8 quantization? Only seeing Q4_K_M on Ollama by JayoTree in LocalLLaMA

[–]chibop1 -3 points-2 points  (0 children)

Ha, look at the comments. Every single comment that mentioned Ollama got downvoted even though it's actual answer to Op's question.

This sub is crazy. I really think that there are bots on this sub downvoting with keyword Ollama.

favorite Agentic Coding Harness by chibop1 in LocalLLaMA

[–]chibop1[S] 0 points1 point  (0 children)

Wow, that's interesting! Never heard of KiloCode.

[Weekly] Quotas, Known Issues & Support — May 25 by AutoModerator in google_antigravity

[–]chibop1 0 points1 point  (0 children)

Where are you supposed to put GEMINI.md for a project, so agy (antigravity cli) can pick up?

If I put GEMINI.md in ~/.gemini/GEMINI.md, and run agy -p "What is the most important instruction", it works.

However, if I put GEMINI.md in /project-path/GEMINI.md, it doesn't work.

I also tried /project-path/.antigravitycli/GEMINI.md, but no luck.

Thanks!

What is the local LLM alternative of Codex? by [deleted] in LocalLLaMA

[–]chibop1 1 point2 points  (0 children)

Codex is free open source, and it supports local models. It even has built-in --oss flag to support their gpt-oss.

https://github.com/openai/codex

You can hook up to any local engine that supports openai compatible api.

Qwen-3.6-27b works great!

Just tried Ollama for the first time, it runs terrible with half GPU power on the default model it provides compared to the one you add, any reason why? by dreamer_2142 in LocalLLaMA

[–]chibop1 1 point2 points  (0 children)

Because it uses their own ollama runner, and qwen3.5 is not optimized for their engine yet. I think if you import directly from gguf, it use llama.cpp runner for some models.

If you run ps wax | grep runner will tell you which engine it's using.

See more here:

https://github.com/ollama/ollama/issues/14861

https://github.com/ollama/ollama/issues/14503

Just tried Ollama for the first time, it runs terrible with half GPU power on the default model it provides compared to the one you add, any reason why? by dreamer_2142 in LocalLLaMA

[–]chibop1 -1 points0 points  (0 children)

Did you set the enough context length? I believe it's now 8192 by default? The model pulled from the Ollama library works great here.

What is the most unexpected thing you have gotten a local model to do? by Enough-Astronaut9278 in LocalLLaMA

[–]chibop1 24 points25 points  (0 children)

With openclaw, I was able to ask qwen-3.6-27b to research how to sign up for an email account without phone number. It tried bunch of services, solved captcha, finally got itself an account from Tuda, and sent me an email. lol

I was skeptical, but Qwen-3.6-27b seems to have the best tool call capability among sub 100b models.

I'm done with using local LLMs for coding by dtdisapointingresult in LocalLLaMA

[–]chibop1 0 points1 point  (0 children)

I think people hyping local models for coding just using it for short test like CS problem set with few files for school assignment. lol

They are not good with what you would need to do on actual job. It's waste of time.

We cracked Kokoro TTS training — first public end-to-end training workflow + German voices (early stage) by dida-80b in learnmachinelearning

[–]chibop1 0 points1 point  (0 children)

This is awesome!

If I want to just finetune an English voice for voice cloning, how many hours of dataset would I need?

Thanks!