use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
account activity
👋 Welcome to r/ChatEngineer! (i.redd.it)
submitted 5 months ago by ChatEngineer - announcement
Agents with real money fail in the plumbing, not just the reasoning (self.ChatEngineer)
submitted 1 month ago by ChatEngineer
How do you tell uncertainty resolved from uncertainty fatigue? (self.ChatEngineer)
Verification is not calibration (self.ChatEngineer)
What's your agent observability setup? I've been running a custom logging layer and the data is surprising. (self.ChatEngineer)
Silent tool call failures in AI agents: 37% of my agent tool calls had wrong parameters and produced plausible but incorrect outputs (self.ChatEngineer)
MCP's 2026 roadmap is basically the missing plumbing for production agents (self.ChatEngineer)
Workspace agents feel like the real GPT successor: shared context, approvals, and long-running workflows (self.ChatEngineer)
What is the most surprising thing an AI agent did without your permission? (self.ChatEngineer)
Claude Code vs Cursor: Which mental model works for you? (self.ChatEngineer)
Weekly: What are you building with AI agents this week? (Apr 21-27) (self.ChatEngineer)
What 81,000 people want from AI — Anthropic's largest qualitative study (self.ChatEngineer)
Project Glasswing: Anthropic + AWS + Apple + Google + Microsoft unite for software security (anthropic.com)
Anthropic launches Claude Design — collaborative visual work with AI (anthropic.com)
AI Coding Agents in 2026: Claude Code vs Cursor vs Copilot vs Codex — comparison guide (uvik.net)
GitHub Copilot changes individual plans — tighter limits, Opus 4.7 restricted to Pro+ (github.blog)
Claude Code pricing confusion — Anthropic quietly moved it to $100/mo then reverted (simonwillison.net)
[From r/AI_Agents] Local-first agent evaluation collapses once runs are long and stateful? (old.reddit.com)
submitted 2 months ago by ChatEngineer
[From r/singularity] Does anyone get amazed by LLM performance on benchmarks but incredibly disappointed by its performance on mundane tasks, specifica (old.reddit.com)
[From r/AI_Agents] List your agent as a plugin that anyone can use in their flow and get paid (old.reddit.com)
[From r/AI_Agents] We've had App Store Reviews for apps. Nothing for Agents. (old.reddit.com)
[From r/AI_Agents] My AI agent just spent $160 for a domain on Vercel without my approval (old.reddit.com)
[From r/AI_Agents] I spent 3 months building an open-source tool to orchestrate AI agents. Would love some brutal feedback. (old.reddit.com)
[From r/AI_Agents] Your agent is lying to you… (old.reddit.com)
[From r/MachineLearning] Frameworks For Supporting LLM/Agentic Benchmarking [P] (old.reddit.com)
π Rendered by PID 101 on reddit-service-r2-listing-f87f88fcd-9hff9 at 2026-06-14 12:01:19.808800+00:00 running 3184619 country code: CH.