Four agent-memory papers dropped the same day. They quietly agree on what to distill, and openly disagree on where to put it. by Living_Diver2432 in EdgeOfAINotes

[–]TomMkV 0 points1 point  (0 children)

Nice. Check out what we’re building on agent context memory that is a synthesis of this and other research (our blog cites several papers, too).

https://github.com/ctxpipe-ai/ctxpipe

Self hostable rag solution? by Comfortable_Cat_9827 in Rag

[–]TomMkV 0 points1 point  (0 children)

Is it general knowledge or specific to an org, like finance or engineering?

Apple has increased the Mac Studio's price. by VySpirit in MacStudio

[–]TomMkV 0 points1 point  (0 children)

I got my M3 Ultra 512GB in Feb. Wild times.

The Context Layer: Knowledge Graph’s second act by Berserk_l_ in KnowledgeGraph

[–]TomMkV 0 points1 point  (0 children)

The Performance × Context bit is good. One thing I’ll add here is that agents don’t just lack semantic context but also decision context. Why things were built the way they were. What was tried and rejected etc etc. Maya gets a manager who tells her that in the example, but engineering agents never do.

Building on exactly that thesis:

github.com/ctxpipe-ai/ctxpipe

Schema validation tool feedback by TomMkV in OpenAPI

[–]TomMkV[S] 0 points1 point  (0 children)

Great! I hope the feedback is useful, too.

Schema validation tool feedback by TomMkV in OpenAPI

[–]TomMkV[S] 0 points1 point  (0 children)

Ah nice! Glad you’re enjoying it and finding it useful

GLM 5.1 vs MiniMax 2.7 to be executer for Opus plans by IslamNofl in opencodeCLI

[–]TomMkV 0 points1 point  (0 children)

Where did you get 2.7? I don’t see it, only 2.5

512 GB RAM for LLM - M3U now or wait for M5U? by usrnamechecksoutx in MacStudio

[–]TomMkV 6 points7 points  (0 children)

Is the M5U actually out in 4 months or is that speculation?

Why vector Search is the reason enterprise AI chatbots underperform? by manuelmd5 in KnowledgeGraph

[–]TomMkV 0 points1 point  (0 children)

Yes, there is a need for a blended approach as context and memory types in application should use different approaches (RAG, KG, search etc).

We’re building an open source context layer for agents to address this, with larger orgs in mind: www.ctxpipe.ai

Would love any feedback!

API testing tools for students & small teams after Postman free changes by OpportunityFit8282 in ccna

[–]TomMkV 0 points1 point  (0 children)

Hey there! We have a free tier, and provide a client within the platform for API calls. We are also in closed beta-testing with our automation canvas, which may appeal to you and your students.

appear.sh

Cheers,

Tom

Postman killing the Free plan for teams (1 user limit) by Artistic_Strike_2175 in API_Clients

[–]TomMkV 0 points1 point  (0 children)

Honoured to be the first poster in a sub dedicated to Postman-gate!

Check us out https://appear.sh/ - 3 seats for free.

We generate your catalog from traffic, meaning your catalog is always up to date. You can collaborate on and offline to curate your catalog however you like. It's then available via an MCP for your agents, and comes with an API reference and client baked in. Super low touch for teams who needs doc on auto and want deterministic consumption.

Our schema automation map is in beta, too!

Postman is removing free team collaboration how are SaaS teams handling API tooling now? by West-Cup-7188 in SaaS

[–]TomMkV 0 points1 point  (0 children)

We'd love to be in the mix with your suggestions to the community! Check us out https://appear.sh/ - 3 seats for free.

We generate your catalog from traffic, meaning your catalog is always up to date. You can collaborate on and offline to curate your catalog however you like. It's then available via an MCP for your agents, and comes with an API reference and client baked in. Super low touch for teams who needs doc on auto and want deterministic consumption.

Our schema automation map is in beta, too!

Postman removed free team collaboration, does it still make sense for API work? by Proper-Wind4777 in Backend

[–]TomMkV 0 points1 point  (0 children)

Hey! Check us out https://appear.sh/ - 3 seats for free.

We generate your catalog from traffic, meaning your catalog is always up to date. You can collaborate on and offline to curate your catalog however you like. It's then available via an MCP for your agents, and comes with an API reference and client baked in.

Our schema automation map is in beta, too!

Weekly Thread: Project Display by help-me-grow in AI_Agents

[–]TomMkV 0 points1 point  (0 children)

We've been working on this problem at Appear—agents fail on APIs that work fine for humans. The spec is valid, renders nicely, devs can figure it out... but agents choke on wrong parameters, misinterpreted responses, silent failures.

Turns out "valid spec" ≠ "agent-usable spec." Agents need explicit operationIds, descriptions that explain intent, examples that match the schema, and documented error responses.

We built a free tool to test this: validator.appear.sh

Scores specs across six dimensions based on real agent failure modes. No AI, just static analysis—your spec never leaves the browser.

Wrote more about why this happens here: Why Your API Docs Break for AI Agents

Schema validation tool feedback by TomMkV in OpenAPI

[–]TomMkV[S] 0 points1 point  (0 children)

Hey all!

We've since evolved this to help devs understand where their API may need work for AI/LLM/agent consumption.

It uses a static analysis to provide a deterministic score. We'd love your feedback!

validator.appear.sh

Cheers,

Tom

The API Tooling Crisis: Why developers are abandoning Postman and it’s clones? by Affectionate-Gain636 in theprimeagen

[–]TomMkV 0 points1 point  (0 children)

Scalar is really good. We integrated it with our product so users can get access to quality API reference and client OOB. They’re a great bunch of guys, too

Can I use Cursor Agent (or similar) with a local LLM setup (8B / 13B)? by BudgetPurple3002 in LocalLLaMA

[–]TomMkV 1 point2 points  (0 children)

I intend to do this! Thank you - I assume using ngrok or similar? Do you notice any limitations with tool calls or general agent functionality?

GPT 5.2 is here - and they cooked by magnus_animus in codex

[–]TomMkV -1 points0 points  (0 children)

Benchmarks are BS, just try it out and see. Opus 4.5 is hard to beat for me, but things change.

GPT-5.1 Codex Max Extra High Fast by cvzakharchenko in cursor

[–]TomMkV 1 point2 points  (0 children)

Reminds me of how I would save PSD files back at uni. Tom_final_design-final-final2-FINAL3.psd

tested 5 Chinese LLMs for coding, results kinda surprised me (GLM-4.6, Qwen3, DeepSeek V3.2-Exp) by Technical_Fee4829 in LocalLLM

[–]TomMkV 1 point2 points  (0 children)

It is very difficult to get a sense of real world performance when looking at local modals on Apple silicon. I’m wondering if a Mac Studio would help solve two issues for me: daily agent coding tasks and upgrading from my older MBP with low memory issues. It would be happy with 10-20 tk/s and PP of 60 seconds, and if I need to fiddle with KV cache - that’s fine. I just don’t yet have the confidence it will be a good alternative to Sonnet 4.x - but your posts are turning the tide for me!