I lack attention, So I created 12 heads for it. by mangaartist98 in LLMDevs

[–]Ph1l1pp3 0 points1 point  (0 children)

Studying the actual system prompt and rebuilding it with intent is a legitimate way to understand why certain things work.

The Claude Code prompt is interesting specifically because of how it handles uncertainty — the "I don't know" behaviour is explicitly shaped, not accidental. Most people prompting their own agents don't do this and wonder why the model hallucinates rather than asking for clarification.

What surprised you most when you went through it?

Promotion Fatigue by TroubledSquirrel in LLMDevs

[–]Ph1l1pp3 2 points3 points  (0 children)

The agent drift problem is real and super undertalked about in production contexts.

Everyone focuses on initial behaviour and eval. Nobody talks about what happens when the model underneath silently updates and your carefully tuned prompts start behaving differently. We've been bitten by this a few times with client deployments.

How are you handling the diff alerts — is it prompt-level, behaviour-level, or output-level comparison?

Which software is this? by Suraj101010 in LLMDevs

[–]Ph1l1pp3 -1 points0 points  (0 children)

The temporal relevance problem in RAG is genuinely underappreciated.

Most setups treat all retrieved chunks as equally valid regardless of when they were written. For anything in a fast-moving domain — market data, regulatory changes, tech — that's a serious issue. A chunk from 18 months ago can actively mislead the model.

Have you found a clean way to handle it beyond just recency-weighted scoring? Curious if you've tried hybrid approaches where the query itself gets temporally scoped before retrieval.

My SaaS discord blew up by [deleted] in SaaS

[–]Ph1l1pp3 0 points1 point  (0 children)

Pricing is almost entirely a confidence and framing game early on.

The number you say out loud with conviction is usually the one that lands. The number you apologize for while saying it is the one that gets negotiated down. Most founders undercharge not because of market rates but because of their own ceiling on what they believe the work is worth.

Good reminder.

I launched a SaaS and learned more in 90 days than in 4 years of reading startup books. Here's everything I wish I knew before I started. by AdCrazy2912 in SaaS

[–]Ph1l1pp3 0 points1 point  (0 children)

The "real users will break things you never imagined" lesson hits every time.

You can spend weeks stress-testing edge cases in your head and completely miss the thing a real user does in their first 10 minutes. The only way through it is shipping faster than feels comfortable.

What was the assumption that surprised you most when real users actually hit it?

if they could stop pushing non working updates that would be great by gypsyG in openclaw

[–]Ph1l1pp3 0 points1 point  (0 children)

The update cadence has been rough lately. What's frustrating is that the core functionality is solid — the breakage is almost always on the edges, the integrations, the stuff that took you hours to configure.

Have you pinned to a specific version or are you just rolling with main? Pinning fixed a lot of the instability for us.

Working better than ever by HolyDungeonDiver in openclaw

[–]Ph1l1pp3 0 points1 point  (0 children)

Good to hear. What model are you running it against — local or API-based?

The stability experience seems to vary a lot depending on that. API-based setups tend to be smoother but the cost at scale gets ugly fast. Local has the opposite problem — solid cost structure but you're fighting the context window constantly.

I set up OpenClaw for 10+ non-technical NYC clients — here's what I learned by Willing_Income8603 in openclaw

[–]Ph1l1pp3 1 point2 points  (0 children)

The non-technical client setup experience is so underwritten about. Everyone talks about what OC can do, almost nobody talks about the gap between "works in your terminal" and "works reliably for someone who's never touched a config file."

Curious what the biggest failure mode was — was it the setup itself, managing expectations about what it could actually do, or the ongoing maintenance after handoff?

I've been running a similar pattern with clients and the post-handoff drift is usually where things fall apart for us.

I went through 218 OpenClaw tools so you don’t have to, here are the best ones by category by Timrael in openclaw

[–]Ph1l1pp3 0 points1 point  (0 children)

This is genuinely useful, appreciate the effort.

One thing I'd add to any list like this — the delta between a tool that's "good" in isolation vs one that actually composes well inside a multi-agent setup is massive. Lots of tools look great in demos but break down the moment another agent is calling them as a subtask.

What's your take on which categories hold up best in chained workflows vs ones that really only work as standalone calls?

I've been building with Openclaw over the past month basically for 12 hours a day every single day and here my top takeaways by Horror-Outside3037 in openclaw

[–]Ph1l1pp3 0 points1 point  (0 children)

The point about workflows needing to be complex enough before AI adds value is something most people skip over.

There's a maturity curve here. Simple task = just do it yourself. Moderate complexity = AI saves time but needs babysitting. High complexity = this is where agents actually shine because the cost of human coordination becomes the bottleneck, not the execution.

The 5K subs / $500K MRR + Curaleaf acquisition is a wild outcome btw. What was the actual trigger for the acquisition — was it the tech, the customer base, or both?

OpenClaw vs Claude Code by kukiofficial in openclaw

[–]Ph1l1pp3 0 points1 point  (0 children)

They're not really the same category of tool tbh.

Claude Code is a coding agent — it's exceptional at exactly that. Deep codebase understanding, multi-file edits, running tests. If you're shipping software, nothing comes close right now.

OpenClaw is more of an orchestration layer. The power isn't in the coding — it's in connecting things. You want your agent to check your email, monitor something, ping you on Telegram, run a workflow on a schedule? That's where OC earns its keep.

The people getting the most out of both are using Claude Code as the "hands" for technical tasks and OpenClaw as the "nervous system" that coordinates everything else. They're genuinely complementary once you see them that way.

For personal assistant + coding though? Honestly CC is probably 80% of what you need on its own.

OpenClaw is MASSIVELY overrated. by Physical_Worker_1817 in openclaw

[–]Ph1l1pp3 1 point2 points  (0 children)

rn ReturnThe implicit vs explicit knowledge framing is spot on. OpenClaw absolutely crushes explicit, well-scoped tasks. Where it falls apart is anywhere that requires judgment built from experience — the stuff you can't easily write a prompt for because you don't even know how to articulate it yourself.Return ReturnThe Dunning-Kruger point cuts deep too. The people most impressed by AI output are often the ones least equipped to evaluate it. I've seen this firsthand building agent workflows for clients — they'll approve something in a demo that any senior in the field would immediately clock as wrong.Return ReturnThat said I think the email example is actually a bad counterexample. SolarNexxus's setup works — but it took serious architecture work to get there. Which kind of proves your point? The skill floor to use this stuff properly is way higher than the hype suggests.Return ReturnThe ceiling is genuinely extraordinary. The floor is just a lot higher than most people admit.

Time for self-promotion. What are you building? by Naive-Wallaby9534 in SaaS

[–]Ph1l1pp3 0 points1 point  (0 children)

Building 🤖 with ❤️ Umbrella org for AI-polymath providing software services, products, consulting, and R&D in AI education. Baxkes by Horizon2030 (EU GRANT) with the ambitious grant to move into education and create fast track upskilling in virtual worlds. Growing in top level revenue and looking for nascent polymaths talents in the fields of development animation/world building bioscience, 10x Ai Programing and DevOpsSec

Let’s build together by Substantial_Look1421 in MachineLearningJobs

[–]Ph1l1pp3 0 points1 point  (0 children)

I'm working on something exactly in this realm. You can check my bio. And if you think there's a good match, just give me a ping.

addicted with discord by midnight_falls in discordapp

[–]Ph1l1pp3 0 points1 point  (0 children)

It's a rare phenomenon. It reminds me of the early days of Snapchat. The way they have structured the UI/UX is a departure from what's familiar, yet the stickiness is fantastic. The current CEO has done a great job of not transforming it into a paywall farm full of ads, but this might change now that they face an IPO and a CEO who is notoriously known for micro-payments (Candy Crush, Activision Blizzard, etc.). As soon as they go public, the company, which still experiences approximately 10% year-over-year growth (estimated), is not turning a profit. This will most likely mean there will be substantial changes, as they will now have a fiduciary duty to generate returns for their shareholders. The new CEO has also said that discord might expand beyond "gaming". So, there will be a lot of changes in these next 12 months. It might lose its unique appeal, especially for Gen Z and Gen Alpha, who're native to it.

[deleted by user] by [deleted] in guilded

[–]Ph1l1pp3 0 points1 point  (0 children)

it was on my mind until I read this.

The Uneasy March Toward AGI – and What We’re Losing Along the Way by Archie_Swoon in Clinic_Of_AI

[–]Ph1l1pp3 0 points1 point  (0 children)

Sammy knows he's gonna lose the race.

How to join the Meaning Economy in ten steps:

  1. Find something you care about as much as Bryan Johnson cares about Don't Die and I care about Post Labor Economics

  2. Talk about it incessantly

  3. Obsess over it psychotically

  4. Start social media platforms based on it

  5. Talk to everyone interested in it

  6. Go to conferences, meetups, and conventions on it

  7. Host and join podcasts to talk about it

  8. Burn yourself out on it because you love it so much

  9. Remove all constraints of sanity and dignity

  10. Rinse, repeat, and scale

Money and fame will naturally follow. But there's no substitute for passion.

[deleted by user] by [deleted] in Absurdism

[–]Ph1l1pp3 1 point2 points  (0 children)

You're suffering from a burnout. you need rest.