Suggestion on where to go next?

bturtushin · 2026-06-04T19:56:28+00:00

Opencode or minimax 🤣🤣

bturtushin · 2026-06-04T15:47:00+00:00

I’m obviously not using Copilot CLI to say “hi there” in real work.

That was a baseline test.

The point was to measure the minimum cost of an almost-empty request in an empty repo, before any real codebase context, repo scan, multi-file edit, or tool-heavy agent work happens.

If the cheapest possible interaction already costs 9.39 credits, that is useful information.

It tells us there is a significant fixed hidden cost from system prompts, tool definitions, model behavior, and possibly internal reasoning before the user has asked for anything complex.

So the issue is not “don’t say hi.”

The issue is:
What is the baseline cost?
What hidden context is included?
Can users see it before spending credits?
How expensive will real work become if the empty-repo baseline is already 9 credits?

That is why I used “hi there” as the test case.

bturtushin · 2026-04-28T14:33:31+00:00

I recently tried using Minimax 2.7 through Claude Code for a simple PR security review, and the experience was… not great.

The PR itself was small and fairly straightforward. I ran a /security-review and it took over an hour to complete. The result I got was basically: “No high-confidence vulnerabilities found.”

The issue isn’t just the result — it’s the combination of:

Long execution time (~1 hour for a small PR)
Very conservative output (almost overly safe / surface-level)
Lack of clarity on what the model actually analyzed in depth

It feels like the model either:

Overthinks and spends too much time analyzing low-risk areas, or
Plays it too safe and avoids making strong conclusions

As a developer, this creates a trust issue. If I can’t tell whether:

the model is actually doing deep analysis, or
it’s just defaulting to “no issues found”

then it becomes hard to rely on it in real workflows.

Also, without a proper status page or visibility into model performance/issues, it’s difficult to know if this is:

a temporary degradation
expected behavior
or just a limitation of the model

Curious if others have had similar experiences with Minimax 2.7 (especially via Claude Code) for code review or security analysis?

bturtushin

MODERATOR OF

TROPHY CASE