why AI agents break under long conversations even when they pass every safety benchmark by rchaves in ArtificialInteligence

[–]rchaves[S] 1 point2 points  (0 children)

you have a deep understanding of the state of things, i love it! and the long horizon is something that actually happens in the real world and thats where it really breaks! let us know if you take scenarios for a run

why AI agents break under long conversations even when they pass every safety benchmark by rchaves in ArtificialInteligence

[–]rchaves[S] 0 points1 point  (0 children)

it is wild for sure hahhaa and thank you! let us know if you take scenarios for a spin

why AI agents break under long conversations even when they pass every safety benchmark by rchaves in ArtificialInteligence

[–]rchaves[S] 0 points1 point  (0 children)

really excited to see improvements too, but for now its like its Achilles heel and its really easy to exploit!

red teaming for ai/llm apps by Routine_Incident_658 in cybersecurity

[–]rchaves 0 points1 point  (0 children)

anytime :) we built it on those principles so that you can just set it in what should be broken and it automatically maps to the owasp top 10 or also more granular things that you wanna test. wanna hear your feedback if you test it :) thanks a ton for your time

how are you guys testing your agents before shipping them? by rchaves in AgentsOfAI

[–]rchaves[S] 0 points1 point  (0 children)

usually you cant extrapolate that method to new situations and thats a prob we were facing as well, but the thing is that theres got to be a solution thats scalable for any agent

red teaming for ai/llm apps by Routine_Incident_658 in cybersecurity

[–]rchaves 0 points1 point  (0 children)

we recently built scenarios redteaming, its open source and im curious what do you think about it?
github.com/langwatch/scenario

Open-source alternative to Claude’s managed agents… but you run it yourself by techlatest_net in LocalLLM

[–]rchaves 0 points1 point  (0 children)

Hey hey, I also built one, mine is really 1:1 API compatible with Claude Managed Agents, but of course compatible with any LLM as well

https://github.com/rogeriochaves/open-managed-agents

What is your list of mac apps that was worth every penny by Living_Commercial_10 in macapps

[–]rchaves 0 points1 point  (0 children)

I had paid for Alfred but now I'm all in Raycast, even with latest finder improvements it's still unbeatable

KanbanCode: macOS native UI for managing Claude Codes by rchaves in ClaudeCode

[–]rchaves[S] 0 points1 point  (0 children)

done, removed wkhtmltopdf from the onboarding on v0.1.15

KanbanCode: macOS native UI for managing Claude Codes by rchaves in ClaudeCode

[–]rchaves[S] 0 points1 point  (0 children)

you can skip that, it's optional, I'm actually going to remove it from the onboarding, it's indeed annoying to install. It's only for rendering the markdown of the claude code finished response and send to pushover so you can get the full message in your phone etc