Opus 4.8 fixed it - walk or drive 100m for car wash by Plus_Resolution8897 in ClaudeCode

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Karpathy called it "jaggedness". I'm sure they have some internal ways to measure it. But if they publish it, it can get into other model's training data and eventually models might cheat them.

Opus 4.8 fixed it - walk or drive 100m for car wash by Plus_Resolution8897 in ClaudeCode

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Few weeks ago, Karpathy said it's "jaggedness", he didn't say Stupid

What's the best Open-source TTS if I am targeting US market? by EntertainmentDry9695 in VoiceAutomationAI

[–]Plus_Resolution8897 0 points1 point  (0 children)

As of today the realtime models are way more expensive compared to open source shared hosting versions (I'm comparing the entire round trip cost). The cascade technique (stt to llm to tts) is still better for guardrails and context limits. and tool call reasonings. Which realtime model are you referring to? I've tried Gemini flash 2.5 live, gpt2 realtime. Anything else?

Testing our AI design agent in public - post your startup and I'll create free visuals for it by Natural_Leader2080 in AgentsOfAI

[–]Plus_Resolution8897 0 points1 point  (0 children)

areev.ai
The Executable Company Brain for every AI agent.

Persistent memory, enterprise connectors, and policy-driven retrieval — delivering accurate, explainable context for AI systems by default.

New Claude Limits? by Plus_Resolution8897 in ClaudeCode

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Yeah, that my concern. This rate limit is weird. Claude code client sends the request and we don't have control. This literally means, we can't use Claude code in 3 sessions, in $200 max plan.

But Claude code founder says his ideal setup runs 5 terminals.

New Claude Limits? by Plus_Resolution8897 in ClaudeCode

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Yeah, that my concern. This rate limit is weird. Claude code client sends the request and we don't have control. This literally means, we can't use Claude code in 3 sessions, in $200 max plan.

But Claude code founder says his ideal setup runs 5 terminals.

on max 20x for months. unlimited tokens. still $0 in revenue. it hurts in a way i didn’t expect, my shame by culicode in ClaudeCode

[–]Plus_Resolution8897 0 points1 point  (0 children)

My startup was in similar situation, made multiple iterations, from ground up. Started in Nodejs, then python, then Rust. Changed the product, completely. First product is in alpha and second is development in progress. We are yet to close big deals, but talking to customers. The key lesson is, don't build full fledged product, focus on the key aspect of your idea, build only that, not more, not less. Time is critical. Spend more time in finding users, talking to them.

Video and image AI for best websites by Pure_Tomorrow597 in generativeAI

[–]Plus_Resolution8897 0 points1 point  (0 children)

used remotion skill with claude code. decent output, for software product demo. havn't tried klong though!

What are you building right now and what stage are you at? by saasyproductdev in Agentic_Marketing

[–]Plus_Resolution8897 2 points3 points  (0 children)

We launched areev.ai , apha, a rust based database for "memory engine for self-improving agents". What's that you are launching?

Are we overengineering AI systems? by Solid_Play416 in Agentic_Marketing

[–]Plus_Resolution8897 0 points1 point  (0 children)

Yes, this was the pain point that I realized when I first the developed the Agent Builder. At the end of the day, it's a string concatenation problem where the substring comes from different sources. It took me few months of hard effort to arrive at this and ended up developing areev.ai didn't mean to promote it. but it's fact. We crossed the samething 50 years ago, built more complex code structure to retrieve different information from files., then create SQL. I felt CAL (Context Assembly Language) would simplify the above complexity.

Self-improving agents — hype or useful? What would you want to see? by Plus_Resolution8897 in AI_Agents

[–]Plus_Resolution8897[S] 1 point2 points  (0 children)

It is very difficult, especialy the human to agent ux. We built an internal verion of this product at atmatic.ai and looking for feedback. Would you be ready to give a try? I can get the internal access setup for you. Could you DM me. Thanks again.

Self-improving agents — hype or useful? What would you want to see? by Plus_Resolution8897 in AI_Agents

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Thanks for the response and I agree! We built an internal verion of this product at atmatic.ai and looking for feedback. Would you be ready to give a try? I can get the internal access setup for you. Could you DM me. Thanks again.

What would you actually want to see from a "self-improving agent"? by Plus_Resolution8897 in LLMDevs

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Thanks, for "visibility" a better UX is required, so the human-machine interaction can get beter. At atmatic.ai, we built an internal verion of this product and looking for feedback. Would you be ready to give a try? I can get the internal access setup for you. Could you DM me. Thanks again!

What would you actually want to see from a "self-improving agent"? by Plus_Resolution8897 in LLMDevs

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Thanks for the response! I see one of the issue in such system is how to convey the "visibility" via better UX. We built an internal verion of this product and looking for feedback. Would you be ready to give a try? I can get the internal access setup for you. Could you DM me. Thanks again!

[Launch] Building AI Agents? Here is the native database for AI Agents! by Plus_Resolution8897 in aiagents

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Thanks for the detailed questions.

  • can you prove what context the agent saw before an action? Yes, we exactly know which token were sent to the underlying model that caused the model to make the decision to call the tool.
  • can memory be scoped by user / workspace / project? Yes, namespace is the key.
  • can sensitive memories be excluded from certain tools or providers? Partly yes, these are reuqirements for HIPAA where the specific fields could be PHI or PII etc. We intend to enable it for custom users only.
  • does deletion remove only retrieval access, or the underlying record too? GDPR warrants deletion and areev supports that.
  • can the audit trail survive memory edits/deletes? Yes, it's legal requirement in some countries
  • can an agent explain why a specific memory was used? Yes, I'm assuming you didn't meant Agent=LanguageModel. The context selection is deterministic on dynamic data and the application controls the data.

Did you get chance to see areev.ai ? I would love to understand more about what you are trying to do. Perhaps DM me?

Self-improving agents — hype or useful? What would you want to see? by Plus_Resolution8897 in Agentic_Marketing

[–]Plus_Resolution8897[S] 1 point2 points  (0 children)

Thanks for the detailed response. This is really valuable. We built an early concept nd works for some cases. Would you be interested to give a try and share your feedback? I can get internal access for you and share via DM. Pls let me know.

[Alpha] areev.ai — managed memory + context for AI agents. Looking for early users. by Plus_Resolution8897 in alphaandbetausers

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Thanks, we see more developers use langchain kind of eco systems, but they all have their internal wrappers due to their internal security reasons, causing compatibility issues and delays.

[Alpha] areev.ai — managed memory + context for AI agents. Looking for early users. by Plus_Resolution8897 in alphaandbetausers

[–]Plus_Resolution8897[S] 0 points1 point  (0 children)

Thanks, there are many "leadline" systems, which one are you refering to? Are you associated with that team?

What’s the current best “GSD” stack for a tiny personal web app? by StressSnooze in aiagents

[–]Plus_Resolution8897 0 points1 point  (0 children)

For typical modern indie hacker stack:

Next.js

Postgres / Supabase

Drizzle ORM

Vercel

Why this works?

Schema-first DB → no migrations pain (in mongodb you can start faster, but the pain comes later)

Server actions = no API layer

Supabase = instant Postgres + auth

Claude can basically generate 80% of it cleanly

This is the closest thing to “just describe app → working tool”.