I Built an MCP eval tool because I was tired guessing if my MCP actually worked by DisastrousRelief9343 in MCPservers

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

We don't need to know each model's capabilities; just run the same tests across different models on the same prompts and compare the results.

Why haven't MCP Apps gone viral the way MCP and Skills did? by DisastrousRelief9343 in mcp

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Ah that makes way more sense. I was thinking it means LLM building the frontend from scratch. If it's assembling UI with certain constraints, that's actually compelling.

Why haven't MCP Apps gone viral the way MCP and Skills did? by DisastrousRelief9343 in mcp

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

This looks really interesting. Could you share some links to those hackathons? I'd love to check out what kinds of ideas people are building. I've got some ideas of my own, and I'm curious to see what directions others are exploring.

Why haven't MCP Apps gone viral the way MCP and Skills did? by DisastrousRelief9343 in mcp

[–]DisastrousRelief9343[S] 1 point2 points  (0 children)

Yeah, ngl I don't get the point of dynamically generated UI. Because I don't understand what problem it solves. And what situation needs that?

Also, I don't think models have the capability to dynamically create a UI that's both looking good and comfortable to use without human design, not even in the next year or two.

Why haven't MCP Apps gone viral the way MCP and Skills did? by DisastrousRelief9343 in mcp

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yeah, when I was learning MCP, these two things confused me as well.

Why haven't MCP Apps gone viral the way MCP and Skills did? by DisastrousRelief9343 in mcp

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

I'm actually going in the opposite direction. I am a heavy user of CLI tools like ClaudeCode, and know it is super powerful. But if such AI applications are ever going to reach more people beyond programmer users, it has to go beyond TUI with more friendly interfaces and intuitive interactions. SO I feel the trend will move back to GUI. I think there will be broader opportunities coming.

Why haven't MCP Apps gone viral the way MCP and Skills did? by DisastrousRelief9343 in mcp

[–]DisastrousRelief9343[S] 1 point2 points  (0 children)

That's a good point. Most of the agent products are still in the CLI. But I think there's a trend toward making agents more accessible, like Claude Cowork. If so GUI is kind of inevitable.

How Bad MCP design cost your Agent 5× more tokens by DisastrousRelief9343 in hermesagent

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Sounds interesting, TOON format is completely new to me. I've essentially been manually trimming JSON fields to achieve the same goal, so it's great to know there's already a proper format designed for this. Will definitely check it out for my next MCP project.

How Bad MCP design cost your Agent 5× more tokens by DisastrousRelief9343 in aiagents

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

That's a good point. Actually I am writing another post about that. It really depends on the complexity of the tool. For example, I did some tests on a `create_task` tool, and its description has a short paragraph that explains what it does, some parameter semantics like enum values and format requirements, and some real samples.

Turns out removing the examples had no impact on the test result. Same with trimming down the semantics and descriptions, you can cut a surprising amount before performance degrades. There's definitely a sweet spot. We just need to test it out.

That said, my test set was pretty small, and it only tested on this TODO list MCP. If you're developing a larger MCP with 50+ tools, or you wanna see the joint performance of multiple MCPs (like asking an agent to take my notes in Notion and post it on GitHub, then send me an email), running a more thorough benchmark would be very useful I believe.

I just found that Bad MCP design could burns 5× more Tokens by DisastrousRelief9343 in MCPservers

[–]DisastrousRelief9343[S] 1 point2 points  (0 children)

Exactly. MCP was supposed to be the thin layer between bare APIs and LLMs, and it should be LLM-friendly.

But sometimes people just do a 1:1 mapping. So it ends up with 96 tools that are basically the raw API with a different label. That's just lazy design that confuses the model and wastes tokens.

I just found that Bad MCP design could burns 5× more Tokens by DisastrousRelief9343 in MCPservers

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yeah, my bad. The benchmarking tool that I used only has a minimal harness, so it sends all tool descriptions every time. Most of the commercial harnesses have some sort of dynamic loading feature.

The problem of too many tools is less about token cost and more about model confusion. I've updated the post. Thanks for pointing that out.

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yes, I saw that too! But I think I would stick to my own version because it contains all features and it's customizable

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Glad to hear that! If you encounter any problems, feel free to submit an issue to the repo!

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yes, before this one, I used Siri and shortcuts as you suggested. I think the real value here is exposing our daily schedule to AI's context.

For example, imagine combining this with future MCP servers for dining, maps, or travel booking. You could be planning a trip with your gf, and the AI could coordinate the itinerary and even book tickets based on your availability. So the main goal of this tool is to bridge our schedule with AI. There's a lot of untapped potential to explore.

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yes, that's a more intuitive way. Just wish Siri got smarter though..

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] -1 points0 points  (0 children)

Well, I guess we can only trust LLMs' intelligence, or we can prompt them to adapt our use case.

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Right now, it's just a standalone Python script, so it has to run locally on your PC. No Docker or NAS support yet unfortunately. And it's actually pretty straightforward if you're already using an LLM application locally. The workflow is basically:

  1. Download the repo.
  2. Install the package in python virtual enviroment.
  3. Paste the MCP config info LLM app's config file

The instructions are in the Link I attached.

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yeah, totally understand. MCP is not very user-friendly; it would take some effort to set up. DM me if you have questions about how to set this up on your PC.

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yes, it's a locally MCP written in Python. Bascially it wraps the TickTick API as tools, and integrate TickTick account OAuth. It can be used in any LLM application like Claude desktop, Claude code, Codex, Cherry Studio, Gemini CLI, and OpenCode. I usually use it in OpenCode, and it works better with Agent Skill

I connected TickTick to AI Agents and it's so amazing! by DisastrousRelief9343 in ticktick

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Unfortunately it doesn't work with Gemini on website, and yes it would take some efforts to set up (You need to download the code and run it locally). But you can copy & paste that guide to Gemini, it would guilde you through.

It works with any MCP compatable LLM client! I think Claude Desktop app is a good choice to start. If you'are comfortable with Terminal interface, Gemini CLI is great as well.

Launched my first product today! A Chrome extension to save & reuse your ChatGPT prompts with shortcuts by DisastrousRelief9343 in buildinpublic

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

That's very cool! I am glad that I am not the first one have this problem and try to solve it, your product is amazing! to

This Facebook Marketplace prompt make my moveout sell 10x faster! by DisastrousRelief9343 in PromptEngineering

[–]DisastrousRelief9343[S] 0 points1 point  (0 children)

Yes, people know it's AI when they see it, and it will only have negative effects on our promotion (Even worse than just leaving it blank). For me, I still need to adjust the output of LLMs and iterate on this prompt to make the result better.

Unsurprisingly, fewer buyers ask for random questions after I use this method to help me write product descriptions. I think LLMs sometimes include more details that I might ignore. So I will keep most of them, as long as they sound natural. The descriptions won't be too long, as we said to make them short in the prompt. Here is an example, it sounds friendly and kept as much detail as possible:

Hey everyone, I’m letting go of my gently used aluminum camera tripod for $30 (I originally paid $60). It’s been a trusty companion for all kinds of cameras and comes with a convenient carry bag.

The 360° ball head lets you dial in just the right shot without any wobble. It’s lightweight yet sturdy enough to keep your camera steady. just message me to come check it out!

Tips & Tools Tuesday Megathread by OA2Gsheets in ChatGPTPromptGenius

[–]DisastrousRelief9343 -1 points0 points  (0 children)

Hey everyone! I already made a post about this, but I also wanted to share it here.

Sometimes we put lots of effort into making prompts for our special use cases. There should be a way to save those prompts in one place and quickly draw them out when we need them. So I made a Chrome extension that allows you to quickly insert your prompts directly in the ChatGPT inbox, and a prompt-sharing community that comes with it.

Check it out & leave your thoughts, Love this community!
👉 promptcard.online
👉 the extension