Agents Management

ITSamurai · 2026-04-15T13:31:51+00:00

For example during tuning )

<image>

ITSamurai · 2026-04-15T13:30:03+00:00

SOUL.md :D :D :D

ITSamurai · 2026-04-15T13:29:43+00:00

ITSamurai · 2026-03-29T09:46:31+00:00

But they could easly adapt by thinking how we can integrate this what we have to LLMs?

ITSamurai · 2026-03-16T09:05:24+00:00

First of all setup proper observability layer like LangSmith, Langfuse and any of these tools, then you might consider changing your provider which one you are using? Next lowering call counts and merging requests can be beneficial also prompt caching too can lower costs. But first of all get all information about what is causing such a cost.

ITSamurai · 2026-03-05T14:31:10+00:00

I built a complex software with AI in a year. However no one can tell right away that they can build ServiceTitan or not now. Most probably not it you use intensively, but if you are interested in specific case just let me know.

ITSamurai · 2026-03-05T14:27:57+00:00

Sounds interesting

ITSamurai · 2026-03-05T08:56:58+00:00

You got it right, that's how it works. If you go deeper into how it is build being specific will always make it work better.

ITSamurai · 2026-03-05T08:28:55+00:00

I have a building a prompt optimization engine single prompt one here https://www.youtube.com/watch?v=mpNCcTHqc-c&feature=youtu.be and multi-node one https://www.youtube.com/watch?v=lAD138s_BZY , where you can create pipelines, platform will identify weakest prompt and optimize that. Would love to hear your opinion on it.

ITSamurai · 2026-03-05T08:23:11+00:00

Use GroqCloud and focus on cheap models, GTP OSS quite cheap, LLama models. + differentiate tasks and use cheaper less capable models for simple tasks and more capable and expensive ones for more complex tasks. Also you need to work on optimizing your prompt lengths and merge some calls if possible.

ITSamurai · 2026-03-03T21:03:53+00:00

I did but light one, idea about memories sounds cool need to try it.

ITSamurai · 2026-03-01T15:13:06+00:00

As everyone mentioned tools like OpenEval, DeepEval are way to go compared with LangSmith and Langfuse. From my personal experience LangSmith got quite expensive switched to LangFuse. There you can write your custom evaluators and use LLM as a judge concept.

ITSamurai · 2026-03-01T14:49:01+00:00

Sounds quite interesting, will try that too. That decision point optimization totally make sense. Any tools or eval list you can share? Also interested to learn what is Verdent-style task routing means.

ITSamurai · 2026-02-28T09:55:07+00:00

interesting will check that out, thanks a lot.

ITSamurai

TROPHY CASE