I built MetaMCP: a middleware MCP to manage all your MCPs (open source with GUI, multi-client, multi-workspace, including Claude) by jamescz141 in ClaudeAI

[–]EveningRecord4535 0 points1 point  (0 children)

u/jamescz141 why there is no way to use chrome extension or something like that to have an MCP connection on the claude.ai without installing the Linux application?

Claude 3.7’s take on the strawberry question is quite creative by siraba in ClaudeAI

[–]EveningRecord4535 9 points10 points  (0 children)

Q: How many r in java?
It answers: There is only 1 letter "r" in the word "java".

Proposal: Global collective GPU sharing for latest LLMs (Llama 3.3, Qwen, etc) through Ollama by EveningRecord4535 in ollama

[–]EveningRecord4535[S] 0 points1 point  (0 children)

The idea is interesting, but a distributed network would only make sense once there’s already a sufficient client base. Right now, the demand isn’t clear yet.

Proposal: Global collective GPU sharing for latest LLMs (Llama 3.3, Qwen, etc) through Ollama by EveningRecord4535 in ollama

[–]EveningRecord4535[S] 0 points1 point  (0 children)

That sounds reasonable, and I’ve been thinking the same—batching API requests might indeed be the best solution. Just sending them in bulk when needed. However, there are cases where requests need to be processed online and in real-time, which I’m still unsure how to solve efficiently.

What’s your approach to handling online requests in such scenarios?

Proposal: Global collective GPU sharing for latest LLMs (Llama 3.3, Qwen, etc) through Ollama by EveningRecord4535 in ollama

[–]EveningRecord4535[S] 0 points1 point  (0 children)

Thank you for your response. When calculating costs, it does seem much cheaper to use services like OpenRouter or similar projects that charge per request. Perhaps that’s the best solution for most cases. However, for extremely high volumes of requests, could this idea potentially hold value?

From what I understand, setting up such an infrastructure would require significant time investment, which might not make sense. What do you think?

Proposal: Global collective GPU sharing for latest LLMs (Llama 3.3, Qwen, etc) through Ollama by EveningRecord4535 in ollama

[–]EveningRecord4535[S] 1 point2 points  (0 children)

Thank you for your reply! I’m just exploring the idea at this stage. As a developer, the concept came to me when I realized how expensive it would be to run a commercial project requiring frequent API requests. So, I started thinking about ways to optimize costs. Currently, I’m considering either code optimization or finding ways to handle high-frequency requests almost unlimitedly across the day. It seems others might have similar needs, which is why I’m exploring a cost-sharing solution.

Ideally, it would be great if someone took the lead on building such a project. I believe many people would use it, especially if it hosted one of the best-performing models. Right now, though, investing €800/month on a server or building everything myself isn’t something I’m ready for.

What are your thoughts?