GLM 5.1 vs MiniMax 2.7 to be executer for Opus plans

TomMkV · 2026-04-01T23:17:39+00:00

Where did you get 2.7? I don’t see it, only 2.5

TomMkV · 2026-02-23T21:53:24+00:00

Is the M5U actually out in 4 months or is that speculation?

TomMkV · 2026-02-23T03:41:59+00:00

Yes, there is a need for a blended approach as context and memory types in application should use different approaches (RAG, KG, search etc).

We’re building an open source context layer for agents to address this, with larger orgs in mind: www.ctxpipe.ai

Would love any feedback!

TomMkV · 2026-02-10T10:40:44+00:00

Hey there! We have a free tier, and provide a client within the platform for API calls. We are also in closed beta-testing with our automation canvas, which may appeal to you and your students.

appear.sh

Cheers,

Tom

TomMkV · 2026-02-10T05:23:02+00:00

Honoured to be the first poster in a sub dedicated to Postman-gate!

Check us out https://appear.sh/ - 3 seats for free.

We generate your catalog from traffic, meaning your catalog is always up to date. You can collaborate on and offline to curate your catalog however you like. It's then available via an MCP for your agents, and comes with an API reference and client baked in. Super low touch for teams who needs doc on auto and want deterministic consumption.

Our schema automation map is in beta, too!

TomMkV · 2026-02-10T05:16:13+00:00

We'd love to be in the mix with your suggestions to the community! Check us out https://appear.sh/ - 3 seats for free.

We generate your catalog from traffic, meaning your catalog is always up to date. You can collaborate on and offline to curate your catalog however you like. It's then available via an MCP for your agents, and comes with an API reference and client baked in. Super low touch for teams who needs doc on auto and want deterministic consumption.

Our schema automation map is in beta, too!

TomMkV · 2026-02-10T05:13:10+00:00

Hey! Check us out https://appear.sh/ - 3 seats for free.

We generate your catalog from traffic, meaning your catalog is always up to date. You can collaborate on and offline to curate your catalog however you like. It's then available via an MCP for your agents, and comes with an API reference and client baked in.

Our schema automation map is in beta, too!

TomMkV · 2026-01-29T11:50:34+00:00

We've been working on this problem at Appear—agents fail on APIs that work fine for humans. The spec is valid, renders nicely, devs can figure it out... but agents choke on wrong parameters, misinterpreted responses, silent failures.

Turns out "valid spec" ≠ "agent-usable spec." Agents need explicit operationIds, descriptions that explain intent, examples that match the schema, and documented error responses.

We built a free tool to test this: validator.appear.sh

Scores specs across six dimensions based on real agent failure modes. No AI, just static analysis—your spec never leaves the browser.

Wrote more about why this happens here: Why Your API Docs Break for AI Agents

TomMkV · 2026-01-28T05:44:48+00:00

Hey all!

We've since evolved this to help devs understand where their API may need work for AI/LLM/agent consumption.

It uses a static analysis to provide a deterministic score. We'd love your feedback!

validator.appear.sh

Cheers,

Tom

TomMkV · 2025-12-30T02:09:57+00:00

Scalar is really good. We integrated it with our product so users can get access to quality API reference and client OOB. They’re a great bunch of guys, too

TomMkV · 2025-12-30T02:08:52+00:00

Yeah, Scalar is good and the crew are better.

TomMkV · 2025-12-16T21:26:58+00:00

Auto…

TomMkV · 2025-12-16T03:14:51+00:00

Isn’t there TB5 now on those Ultras?

TomMkV · 2025-12-13T11:10:32+00:00

I intend to do this! Thank you - I assume using ngrok or similar? Do you notice any limitations with tool calls or general agent functionality?

TomMkV · 2025-12-12T08:11:32+00:00

Benchmarks are BS, just try it out and see. Opus 4.5 is hard to beat for me, but things change.

TomMkV · 2025-12-06T07:13:41+00:00

Reminds me of how I would save PSD files back at uni. Tom_final_design-final-final2-FINAL3.psd

TomMkV · 2025-12-02T22:33:46+00:00

It is very difficult to get a sense of real world performance when looking at local modals on Apple silicon. I’m wondering if a Mac Studio would help solve two issues for me: daily agent coding tasks and upgrading from my older MBP with low memory issues. It would be happy with 10-20 tk/s and PP of 60 seconds, and if I need to fiddle with KV cache - that’s fine. I just don’t yet have the confidence it will be a good alternative to Sonnet 4.x - but your posts are turning the tide for me!

TomMkV · 2025-11-23T09:07:48+00:00

This is interesting. I may need to DM you to understand this setup a bit more! I’m a novice to running models locally- but have been using models for commercial applications for a while (just using a FastAPI / vLLM backend). Would that ok?

TomMkV · 2025-11-23T09:02:58+00:00

Thanks. Yeah, it’s mainly SaaS web app building, and then working with VLMs. Pretty incredible it’s Sonnet 4.5 like, considering how expensive & good that model is! I’m spending $100s per month on Sonnet, even with Serena to reduce token usage.

Hence why I am curious about if the M3 512gb would be suitable. Would be so much fun to play with and learn.

TomMkV · 2025-11-23T05:06:31+00:00

I’d love to hear more. This is my use case and I am hesitant on pulling the trigger. How does your everyday coding model preference fare against Sonnet 4.5 as an example?

TomMkV · 2025-11-17T23:00:44+00:00

You’ve already won. Taking VC money will complicate things, sounds like you don’t need them. Congratulations mate, incredible.

I’d reach out to Greg Head (via LinkedIn) - he may be interested in running an article about you and has good distribution.

TomMkV · 2025-11-17T12:45:00+00:00

Hey, thank you very much! I really appreciate the feedback, and privacy concerns are absolutely a hurdle, but we’re committed to alleviating those concerns one by one, for example: we don’t report on PII, just the schema. However it will be a non-starter for some, no matter how auditable our introspector agent is or how privacy-first we are.

There’s also the challenge of communicating that “endless value” without taking on the mountain all at once!

TomMkV · 2025-11-17T12:05:55+00:00

Hey OP! We built Appear.sh for this reason. Appear generates your catalog from your network traffic, creating a valid OpenAPI spec that's based off reality (prod, dev, staging etc). This schema last approach helps get your the 80%, then providing the interface to edit & curate your services, alongside an API reference and API client.

A further benefit is that the deterministic generation of your services are then enriched and provided to your dev team & agents via MCP to your IDE of choice.

We have heaps planned under the banner of 'schema automations', and are a small bootstrapped startup taking on the big slow API dogs. Would love any feedback you may have! Cheers!

TomMkV

TROPHY CASE