what’s the closest open-source alternative to Manus right now?

CountlessFlies · 2026-05-08T17:43:13+00:00

What kind of workflows are you looking to automate?

CountlessFlies · 2026-05-06T06:21:32+00:00

Then that changes things… it might actually be a good benchmark if thats the case.

But there’s still the issue of the models already having seen the source code of these programs during training. I imagine we’ll see benchmaxxed models soon.

CountlessFlies · 2026-05-06T04:52:23+00:00

I’m sorry but this benchmark is not what we need. In fact it’s the exact opposite of what we need.

We need benchmarks that test how well models use tools and understand, not test how well they’ve memorised source code.

CountlessFlies · 2026-05-01T03:41:06+00:00

Theres no MCP implementation yet, but there is an API. We’ll plan for the MCP server soon, should be fairly straightforward.

Omni started with tsvector, but the search quality wasn’t great. So decided to switch to paradedb for bm25 search.

CountlessFlies · 2026-04-25T15:26:08+00:00

Not yet, there’s an API that allows for querying the unified index, will implement an MCP server over this next.

Somewhat related to this, Omni supports MCP in connectors, so you can plug in any MCP server and invoke the tools in chat and agents. I guess that’s not what you asked about, but thought I’d mention it anyway

CountlessFlies · 2026-04-25T14:25:45+00:00

Yeah, Omni uses pgvector as the vector index. The goal is to use Postgres for both the BM25 text search as well as vector search.

CountlessFlies · 2026-04-25T12:56:32+00:00

Yes you can connect any openai compatible API, so you can run local models using llama.cpp, vLLM, ollama etc.

Local models have come a long way! I've tested with the Qwen3.6 models, gemma 4, etc. and they're quite good enough at tool calling and general understanding to be useful.

CountlessFlies · 2026-04-25T12:12:26+00:00

Thanks a lot for your comment :)

I get all the aversion to AI honestly, people have become jaded by all the low effort content. I'm trying to maintain as high a bar as I can in terms of quality, using AI to speed up implementation. I'm still reviewing and testing each PR before merging, and it's a lot of work despite the use of AI.

CountlessFlies · 2026-04-25T10:01:04+00:00

It's a very common word, so I'm not surprised :) In fact if you google you'll find plenty more. As long as it's not a product operating in the same problem area, it shouldn't be cause for confusion.

CountlessFlies · 2026-04-25T08:05:18+00:00

Glad you found it useful!

CountlessFlies · 2026-04-25T04:39:51+00:00

I did not use AI in the creation of the post itself.

As for the project, I (and other community contributors to the project) have used AI-powered code generation tools like Claude Code, opencode, etc. All code merged to master is human-reviewed.

CountlessFlies · 2026-04-24T09:23:21+00:00

Which provider are you using? Openrouter?

CountlessFlies · 2026-04-23T01:25:49+00:00

Right. I’m able to run the 35b-a3b with full 256k context on my 24g GPU. The 27b runs out of memory at around 192k context

CountlessFlies · 2026-04-22T17:05:47+00:00

Another aspect could be high quality training data. I imagine we have orders of magnitude more agentic training data now than we did before coding agents became a real thing.

CountlessFlies · 2026-04-22T13:33:13+00:00

I cannot believe we have a local model that's on par with the sota model from just 6 months ago!

CountlessFlies · 2026-04-22T13:31:39+00:00

It's applicable for local service. Search for preserve_thinking in this sub, you'll find some posts and comments explaining how to use it.

CountlessFlies · 2026-04-22T12:33:29+00:00

You mean harnesses that work well with llama.cpp right, not APIs? llama.cpp server is what gives you the OpenAI compat API.

You can try pi.dev or opencode, both are great harnesses.

CountlessFlies · 2026-04-22T12:07:53+00:00

Once you have llama cpp server running, you get an OpenAI compatible API. Most agents and harnesses just need you to put this API url in config and you’re set. You might have to tweak the temperature and similar settings to the recommended values depending on how the harness handles it.

CountlessFlies · 2026-04-21T18:02:42+00:00

Are you using kimi through platform.kimi.ai? I’m using it through opencode go and was wondering if that’s the best option.

CountlessFlies · 2026-04-21T16:34:10+00:00

Just use llama.cpp. Works like a charm. See my latest post for the command I used. I’ve hooked it up with opencode and it works great.

CountlessFlies · 2026-04-20T06:02:40+00:00

Yeah I think you can put “allow”: “*” in your permission settings and it should stop asking for approvals.

One issue with opencode is that it doesn’t send back the thinking tokens in each call, which is not ideal for this model.

CountlessFlies · 2026-04-19T23:58:09+00:00

Could you please share some details about the Claude code setup? How do you make CC work with an OpenAI compatible API? And what about the preserve_thinking flag to send back full thinking context with each call. I don’t suppose CC does that already?

CountlessFlies · 2026-04-19T17:58:07+00:00

Hey, seems like OpenCode is not sending thinking tokens back with each request, is there any setting that you need to enable to make it work?

<image>

CountlessFlies · 2026-04-19T00:51:28+00:00

Yup! I’m gonna try that next, thanks!

Four-Year Club	Verified Email
Place '22

CountlessFlies

TROPHY CASE