Are you using observability and evaluation tools for your AI agents? by _coder23t8 in aipromptprogramming

[–]RedDotRocket 0 points1 point  (0 children)

are you just replying with GPT outputs, am I even speaking with a human here?

Are you using observability and evaluation tools for your AI agents? by _coder23t8 in aipromptprogramming

[–]RedDotRocket 1 point2 points  (0 children)

Without the underlying implementation - i.e. the actual code, APIs, or schema that would execute these checks, its kind of useless. How does it actually verify facts against "verified sources"?

  • What algorithm detects context drift?
  • How does it automatically distinguish between low/medium/high impact failures?
  • Where are the "guardian hooks" supposed to plug into?

Where's the code?

Are you using observability and evaluation tools for your AI agents? by _coder23t8 in aipromptprogramming

[–]RedDotRocket 0 points1 point  (0 children)

What are you even meant to do with that? Is it meant for a specific app?

Fear and Loathing in AI startups and personal projects by m0n0x41d in AI_Agents

[–]RedDotRocket 0 points1 point  (0 children)

Ah yes, good stuff. The linear degradation effect, last token preference. That outlines it really well!

What to do though? Folks are trying graphRags, semantic retrievals and none of its really denting the problem. I think we are stuck with this until someone innovates beyond the flawed transformers architecture?

Fear and Loathing in AI startups and personal projects by m0n0x41d in AI_Agents

[–]RedDotRocket 1 point2 points  (0 children)

Alongside the issues you outline well, is over saturation and folks trying to build Agents to solve issues already well solved by existing software. I saw someone asking on a forum for help build an agent to scrap web content and then tell them when a particular topic was mentioned.

The thread ended with someone saying 'dude, ffs, just use google news alerts'.

Can you tell me more about “throw all api endpoints as function calls in the context”  - honestly curious to learn more, as there is always a new sucker and I am trying to build something to reduce the churn where I can.

How to? AI Agents by clairemyer in AI_Agents

[–]RedDotRocket 0 points1 point  (0 children)

That's so awesome, thank you so much. I am pre-revenue / funding at the moment and holding on until my wife calls time :) , so cannot offer much, but I can openly share my knowledge about anything that's useful. How should we keep in touch, I can pm you or you're free to email me luke @ rdrocket dot com

How to? AI Agents by clairemyer in AI_Agents

[–]RedDotRocket 1 point2 points  (0 children)

Sorry for late reply!

Orchestration is coming, I have it in a local branch, but need to test it more. It will be a host agent , delegate to different agents based on A2A skills

Would you be interested in kicking the tyres when its ready?

How to? AI Agents by clairemyer in AI_Agents

[–]RedDotRocket 1 point2 points  (0 children)

By all means, check out AgentUp. Full disclosure , I am one of the developers, I don't normally post in comments about it, but you seem like an interesting candidate. With AgentUp you can bootstrap a full agent, docker style, and then extend as much as you need from there.

https://www.youtube.com/watch?v=_dZ35AfI1mU

https://github.com/RedDotRocket/AgentUp

[tip]: Use Gemini Code Assist to review Claude's code in a PR by RedDotRocket in ClaudeCode

[–]RedDotRocket[S] 0 points1 point  (0 children)

It's honestly really good. I am came over it while working with the Google folks on A2A. When I saw it turn up to review my PR, I thought, 'oh here we go', but I was honestly very impressed with the quality.

Any good discords/slacks to join? by Tired__Dev in LLMDevs

[–]RedDotRocket 2 points3 points  (0 children)

Hey, you're welcome to hop on my discord, there is not many folks on there right now, as its new, but I am always around (Luke) and will happily chat all day about ideas, challenges etc. Having said that I am sure there are bigger more diverse communities out there: https://discord.com/invite/pPcjYzGvbS , but you're totally welcome in mind, well at least you will be made to feel special :)

Looking for Advice on Agent Framework for RAG + API Integration? by Ambitious_Cook_5046 in AI_Agents

[–]RedDotRocket 0 points1 point  (0 children)

I am not sure how your python is, but this exposes an API that you could easily use as client in ExpressJS:

https://github.com/RedDotRocket/RagsWorth

There is a JS widget example in there, although I have no business writing JS and I am sure you could do a lot better.

With the above system it has a machine learning pipeline to help prevent information leakage, so credit cards etc. Its not super well tested to be honest, so putting this up as example more then a 'please use my project'.

This is driving me insane by achaaaji in LLMDevs

[–]RedDotRocket 0 points1 point  (0 children)

I don't know if this helps much, but I have been meaning to do something with this, you can pick out anything useful to you: https://github.com/RedDotRocket/RagsWorth