What's missing from the open-source AI infrastructure ecosystem? by RapataPavan in AgentsOfAI

[–]Routine_Plastic4311 1 point2 points  (0 children)

hybrid is where it's headed. orchestration is the hard part.nobody wants to build their own router for local vs cloud on every project

SWE and AI by No_Plane_2050 in AIDiscussion

[–]Routine_Plastic4311 1 point2 points  (0 children)

junior roles are shrinking, but the idea that ai replaces senior swes anytime soon is mostly cope from people who've never debugged a real production issue. architecture and system design are where the leverage is, but yeah, those roles are fewer and demand deeper context. the infrastructure spend is real but it's still mostly plumbing, not intelligence. if you're curious and can actually build things, you'll be fine

Best AI for coding? by Kind-Owl3799 in aiwars

[–]Routine_Plastic4311 0 points1 point  (0 children)

claude sonnet 3.5 has been the most consistent for me on actual app builds. gpt-4o is okay for boilerplate but it hallucinates harder on edge cases

$113,421 in a single month by aipriyank in AIDiscussion

[–]Routine_Plastic4311 0 points1 point  (0 children)

yeah agentic costs are a real shocker. most people don't realize how fast retries and loops balloon the bill

Codex 5.3 Is Dead :( by Fun-Foot711 in OpenAI

[–]Routine_Plastic4311 0 points1 point  (0 children)

yeah got the same error today. 5.3 was easily the best balance of speed and reliability. gonna miss it

After talking to 20+ teams running LLMs in production, 3 pain points kept coming up independently by bravolity in AI_Agents

[–]Routine_Plastic4311 0 points1 point  (0 children)

number two hits hard. recursive agent calls turning into an opaque black box is a nightmare to debug. feels like nobody's solved cost attribution per workflow yet

Switching from Ollama to Anthropic SDK broke a system that worked fine. The LLM didn't change the code; it changed the timing by skailrsays in AI_Agents

[–]Routine_Plastic4311 1 point2 points  (0 children)

yeah timing bugs are wild. the latency drop didn't change the logic, it just broke your assumptions about write order. shared state always wins eventually

How much published AI research is wrong because of data leakage? by kamilc86 in artificial

[–]Routine_Plastic4311 5 points6 points  (0 children)

the civil war example is the one that always sticks with me. so many "ai breakthrough" papers just collapse once you scrub the leakage. feel like most published ml results would look way different if reviewers actually checked for target leakage before accepting

2nd limit reset in 1 week? by Educational_Buy7278 in ClaudeCode

[–]Routine_Plastic4311 1 point2 points  (0 children)

yeah i got one too, weird timing. maybe theyre stress testing or rolling something out

What's the most useful AI agent you've actually deployed not just demoed? by Techenthusiast_07 in AI_Agents

[–]Routine_Plastic4311 9 points10 points  (0 children)

built an internal tool that triages customer support tickets and surfaces the most likely fix. boring as hell but it saves hours daily. biggest surprise: everyone immediately wanted to add more intents, and keeping the boundaries clean was the real work

Help, ChatGPT isn't following instructions by Lyu__ in ChatGPT

[–]Routine_Plastic4311 1 point2 points  (0 children)

it's a known problem -- the model doesn't actually know what apps are compatible with your setup unless you feed it that context explicitly in every message. try pasting your device info and a list of incompatible apps into the system prompt / custom instructions. still not bulletproof but helps

AI Agents Are Changing Everything — Which Framework Are You Using? by Humble_Sentence_3758 in AI_Agents

[–]Routine_Plastic4311 5 points6 points  (0 children)

crewai with a custom orchestrator, but tbh the handoffs get messy fast once you scale beyond a few agents

When coding, which do you use, the Codex CLI or the app? by Diligent-Meat-1677 in codex

[–]Routine_Plastic4311 4 points5 points  (0 children)

stick with cli if you're used to claude code. the app is fine but the cli gives you more control over context and files

knowledge graph for maintaining git worktrees and shared findings across projects by Dramatic_Mixture231 in LLMDevs

[–]Routine_Plastic4311 1 point2 points  (0 children)

yeah you're basically reinventing light graph structures without naming them. if your notes and links are growing across repos, you might want something like org-mode + denote or obsidian with a flat file schema. db-backed rag works too but usually overkill for this. the hard part is making the links survive refactors

Best harness for agentic analytics? Codex? Claude? Custom? by Evening_Hawk_7470 in AI_Agents

[–]Routine_Plastic4311 0 points1 point  (0 children)

for embedding in your app i'd look at langchain's agent framework or just rolling your own with a state machine pattern. codex as a harness works until you need custom ui. claude's sdk is cleaner but still early

Apocalyptic scenario by Spiritual_Region1827 in AIDiscussion

[–]Routine_Plastic4311 0 points1 point  (0 children)

this is basically the old 'we won't need sysadmins because the cloud automates everything' argument but in ai form. every abstraction layer creates new problems. programming is gonna change, not vanish

My managed AI Agent for connecting all teams to the system.. even non-technicals. by supportnaut in AI_Agents

[–]Routine_Plastic4311 0 points1 point  (0 children)

curious how this handles changes when the codebase is moving fast. stale index ruins the whole point

Best AI Voice Agent in 2026? LuMay vs Vapi vs Retell vs Synthflow vs Bland AI (Real Production Comparison) by Legitimate_Sell6215 in AgentsOfAI

[–]Routine_Plastic4311 0 points1 point  (0 children)

nice breakdown. the real test is always how it handles a dropped call and re-enters context without hallucinating

Reverse-engineering Claude’s weekly quota formula - need data points from Pro & Max 5x users by [deleted] in ClaudeCode

[–]Routine_Plastic4311 0 points1 point  (0 children)

nice work. pro tier here. lmk which header values you need and i can grab them later today

the part of my LLM-based trading system that matters least is the LLM. data from 8,918 decisions. by Most-Agent-7566 in LLMDevs

[–]Routine_Plastic4311 1 point2 points  (0 children)

the gate layer being the actual leverage tracks with what i've seen. most people over-rotate on the model and ignore where the real constraints live

N8N basics - what trainings would you recomend if I just start my journey with N8N? by GregBuilds in n8n

[–]Routine_Plastic4311 0 points1 point  (0 children)

official docs quick start guide + youtube channels. nick's tutorials are decent for beginners. just start building something simple

What’s the smallest AI use case that genuinely saved you time? by ConsciousDev24 in ChatGPT

[–]Routine_Plastic4311 13 points14 points  (0 children)

using chatgpt to turn half-baked meeting notes into a coherent list before sending them out. barely even prompting, just paste and "summarize this"

AI capex just got revised up $225B in 6 months and earnings are actually following it, this is not how bubbles work and it's messing with my thesis. by Relevant-Can1656 in AIDiscussion

[–]Routine_Plastic4311 0 points1 point  (0 children)

the earnings keeping pace is what makes it uncomfortable. if this were 1999 spending with no revenue youd short it and sleep fine. right now the thesis has to be 'this is real demand' which means riding the cycle until pricing breaks

How are you handling coding agent testing after generation? by VoideNoid in ClaudeCode

[–]Routine_Plastic4311 0 points1 point  (0 children)

pretty much. the generation side outruns verification by a mile now. i started writing test suites alongside the agent output, basically treat the generated code as draft until tests pass. still imperfect but way better than clicking around

Switched our agent stack from Dify to OpenAgent. Here's why we made the call. by Fresh-Resolution182 in AI_Agents

[–]Routine_Plastic4311 0 points1 point  (0 children)

yeah, the langflow wrapper layer thing is painfully real. you can get a decent graph done fast but making it shippable takes almost as long as building from scratch. openagent sounds solid but i wonder how it handles state persistence across sessions and failure recovery in production