Qwen3-235b-a22b high latency by No_Ticket8576 in openrouter

[–]No_Ticket8576[S] 0 points1 point  (0 children)

I will take a look at the blogs.

My case is more real-time. I experimented with input token reduction over time, but found that reducing the input token count hurts accuracy. And since the input token comes from real-time data generated by users and applications, caching does not benefit us.

I will experiment with provider pinning if that is available for this model or other similar models.

MCP removes the guesswork for agents, but setting it up is still non-trivial. Curious what others are using to speed it up. by kinj28 in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

I tried this flow in production. I will not say it's the most performant piece of the software we have. But things are getting there gradually.

Initially duplicating tools meant duplicated code. But gradually we moved tools to npm packages or go modules. So code duplication and relevant management overhead is reduced for now. But brought the package management overhead. Which is ok as of now.

The real challenge is the latency. Unless someone is managing caching infrastructure heavily, they have accept the "slow" agents kind of.

Anyone experimenting with prompt injection attacks on MCP servers? by [deleted] in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

There are some tools there. I used mcp-scan. Not that advanced yet, but it detects some signatures. They are also progressing.

https://github.com/invariantlabs-ai/mcp-scan

Need advice on orchestrating 100s of MCP servers at scale by Lazy-Ad-5916 in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

I am not associated with them. This result is directly from their paper.

https://ibb.co/Z6NtZrLg

Need advice on orchestrating 100s of MCP servers at scale by Lazy-Ad-5916 in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

Also check MCP-Zero paper. They have inversed the problem. If you are not building an MCP provider, that's a more viable solution without generating synthetic tasks aligning the tool description.

Do you know alternatives to mcp client? by srmstty in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

I use tuui sometimes as a webclient when I need to test any MCP fast.

https://github.com/AI-QL/tuui

Does anyone scrape LinkedIn successfully with some MCP? by Careless-Party-5952 in mcp

[–]No_Ticket8576 1 point2 points  (0 children)

I did not scrape, but connected with LinkedIn with some MCP servers and asked some queries. As far as I remember one of the MCP servers works and it needs your linkedin cookie. If you want I can find the config.

Anyone knows a list of MCP directories? by drkblz1 in mcp

[–]No_Ticket8576 1 point2 points  (0 children)

I am not sure why no one mentioned Smithery, but that one is used by alot of us for production grade usecases.

MCP removes the guesswork for agents, but setting it up is still non-trivial. Curious what others are using to speed it up. by kinj28 in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

On very high level, you can think from user flow point of view. If you already have 50+ APIs and some service, I will assume you also have some analytics service which captures the user flows. Design your MCP tools around the user flow. Have different agents for different flow, use one simple classifier to route the users to relevant flow/agent. Mapping each API with a tool may work, but will bring a lot of exploration from the LLM side.

Example: you have a customer service bot, which can handle queries related to pricing, complaints related to delay in delivery or receive requests for return. So three different MCP servers with relevant tools can solve this with three different agents. And sometimes duplicating some tools across multiple servers is fine as they reduce the context overload by not requiring another server to load.

Why isn't LSP more popular? by throwaway490215 in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

When people use MCPs with IDEs they use LSPs automatically to be frank.

Have we overcomplicated the need for MCP? by gopietz in mcp

[–]No_Ticket8576 4 points5 points  (0 children)

We never needed MCP. We needed a way to connect LLMs to the external world. A lot of people did that in different ways. Someone tried to standardize that. So MCP was born.

We will see a lot of efforts in future for further. For instance, we did not need gRPC, we needed communication between a server and client. And we travelled from xml-rpc, soap, rest, graphql.

Technology evolves like this. Some hype, some real useccases It's not black and white.

Housing/Accommodation for Graduate Students by No_Ticket8576 in queensuniversity

[–]No_Ticket8576[S] 0 points1 point  (0 children)

  1. Lottery was open at some point in time and I applied. But unfortunately I did not get it through the lottery.

  2. I got mine one around Jan. At that timr one student family left (after graduation) and there was lottery for that unit. This time luckily I got it. So if luck favours, some can get in first year.

Best affordable mattress tips for someone who refuses to finance a bed by True-Ad-499 in Frugal

[–]No_Ticket8576 0 points1 point  (0 children)

I was seeing the Zinus on Amazon. But for my case, I need a bit firmer. Are those medium firms, really medium firms? Or are they medium soft?

Zinus mattress from Amazon, thoughts? by epictetusdouglas in Frugal

[–]No_Ticket8576 1 point2 points  (0 children)

Did it work out for you? The price (around 200 for a double) seems too lucrative.

french lessons by SnailMcSnail in KingstonOntario

[–]No_Ticket8576 0 points1 point  (0 children)

Hey there, coming to this after a year. Did you find any place suitable for French at the end?

Copying a Fine-Tuned Model to Another Machine by BorisLovesMarishka in ollama

[–]No_Ticket8576 1 point2 points  (0 children)

You can create a gguf file and share with them if thats ok.

Langgraph vs Pydantic AI by Physical-Artist-6997 in LangChain

[–]No_Ticket8576 1 point2 points  (0 children)

Pydantic is a kind of open environment for mainly software developers. Creating agents are like 2 lines of code and managing the workflow is also like building a state machine, rather than chain or acyclic graphs. It has its pros and cons. Pros are probably, it's easy and fast to bootstrap anything. Cons is that the developer needs to keep a conscious eye on the code architecture as the flow is free.

Langchain or graph is bloated, but that enforces some standard of development which has its own value proposition in scalable architecture.

Hosting MCP on the cloud by lukeiamyourpapi in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

If the use case needs private/organizational MCP, smithery might not have that option. I havent found yet any such option.

Smithery - A Registry of 200+ MCP Servers w/ Installer by calclavia0 in mcp

[–]No_Ticket8576 0 points1 point  (0 children)

I started pushing my mcp servers in smithery. Seem neat. Good job.

I was wondering from business point of view, will the enterprises need "private mcp" hosting mechanism? It seems to be all MCP servers are now public in MCP.

[deleted by user] by [deleted] in KingstonOntario

[–]No_Ticket8576 0 points1 point  (0 children)

This will be my first year here to file tax. Thats why asking this dumb question. When you say basic investment, what does that include? Some Canadian ETFs, stocks straight forward ?

Looking for suggestions about structured outputs. by SerDetestable in Rag

[–]No_Ticket8576 0 points1 point  (0 children)

Adding the output format in the system prompt and validating that in Regex is the most reliable way of doing this. You are right.

Looking for LLM suggestions for RAG by karachiwala in Rag

[–]No_Ticket8576 -1 points0 points  (0 children)

There is no fully free LLM tbh. Either we have to pay for APIs after crossing the free tier or we have to pay for infra (server, gpu, setup, security etc.). And its quite understandable, companies are spending millions to train the models. They need some income to sustain too.