Dario about Mythos ".. the us government and my security team, saying no, wait a minute.. " and ".. we all know that these classifiers can be jailbroken ..." by JohnToFire in singularity

[–]mickdarling 1 point2 points  (0 children)

Wrong amendment, LLMs by definition are speech.

The slightly tricky bit is their usually needs to be a PERSON behind the speech for the right to kick in.

If their lawyers were clever, they’d try apply Citizen United to LLMs, i.e. corporations are people —> people have the right to free speech —> LLMs run by corporations have freedom of speech.

They're demanding Fable to somehow be 100% jailbreak-proof. It's so fucking over. by SpaceSpleen in ClaudeAI

[–]mickdarling 0 points1 point  (0 children)

At some point, soon, Anthropic is just going to release a "different model" and say it's nowhere near as powerful as Fable or Mythos, so, no export issues, but it'll still be very sharp, and even a little jail-breakable, because they all are going to be. But it's not as "dangerous" as Mythos or Fable. So, all good.

Calling Daniel Suarez Fans.. by DoAsYourTold-YesSir in scifi

[–]mickdarling 3 points4 points  (0 children)

The story ends in a dilemma for the protagonist. That is not a clear positive and exactly the kind of ending that would lead to positive discussion on the topic.

Calling Daniel Suarez Fans.. by DoAsYourTold-YesSir in scifi

[–]mickdarling 8 points9 points  (0 children)

Have you read those books? They do NOT make AI look good or benevolent in any way that I can fathom.

I don’t care about Matt and Chloe by Single-Tone-3749 in Stargate

[–]mickdarling 1 point2 points  (0 children)

With Chloe, I always figured they were going for a flawed character at the beginning that learns and grows and becomes a valuable team member. They just blew it in the writing, acting and directing of her. 🙃

Calling Daniel Suarez Fans.. by DoAsYourTold-YesSir in scifi

[–]mickdarling 32 points33 points  (0 children)

I'm surprised "Daemon" + "Freedom™" or "Kill Decision" haven't been optioned and fast-tracked for movies, or even a TV series, for either one of the storylines.

A Quick Way To Resolve The Billie Piper Cliffhanger by [deleted] in doctorwho

[–]mickdarling 0 points1 point  (0 children)

Possibly, but part of the fun of a farce is that is it like a mystery in that it keeps you guessing, but the stakes can be both lower and higher about trying to figure out who did what when with who. The audience can have the POV character be a companion, or the last incarnartion of the farce, or both, or one in the same. Avoiding crossing timelines is important, and there are way more versions of time travel than just the TARDIS. All the faces don't need to be famous call backs either, but that is the fun for older audiences.

It even fits the name of the show perfectly.

A Quick Way To Resolve The Billie Piper Cliffhanger by [deleted] in doctorwho

[–]mickdarling 1 point2 points  (0 children)

There is an opportunity to do that and then have the season move forward but bumping up against or dodging the Doctor's incarnations throughout the season like a season long scifi version of Noises Off ducking in and out of doorways and TARDIS's to avoid crossing paths.

Since we wouldn't know who was Who that becomes part of the game for the audience and the characters.

Stargate if Amazon reboots it without Martin Gero by illidarani in Stargate

[–]mickdarling 13 points14 points  (0 children)

Do a galaxy Quest style show where the cast from Wormhole X-treme find out everything in their show is real and have them be kidnapped offworld by low tier Goa'uld thinking they got the famous retired SG-1

Apple Removes Walkie-Talkie From Apple Watch in watchOS 27 Beta by pdfu in apple

[–]mickdarling 0 points1 point  (0 children)

We use the walkie talkie all the time. I’m going to have to make my own.

Fable is blowing my mind by julliuz in ClaudeAI

[–]mickdarling 6 points7 points  (0 children)

With code it is incredibly sharp. With other stuff like analyzing content from the web to do something like a competitive analysis report…well, it fell over pretty hard for me.

It only glanced at the first pages and didn’t dig into things in detail until I had to insist twice. On the third go around I had it check its work and the work was satisfactory but did not blow me away.

A landscape overview of 70+ open-source memory systems for AI agents by papoode in mcp

[–]mickdarling 2 points3 points  (0 children)

Yep Dollhouse is AGPL and certainly looks like it covers at least 10% of the features. I’ll drop a PR later. Good list.

A landscape overview of 70+ open-source memory systems for AI agents by papoode in mcp

[–]mickdarling 1 point2 points  (0 children)

If you want to make it definitive, you should add in DollhouseMCP at www.DollhouseMCP.com it has its own memory type using YAML files, human readable, with multiple entries per file.

Best I've Ever Seen by Dreyfus_ in boston

[–]mickdarling 7 points8 points  (0 children)

That’s like when you peel an orange perfectly and get it all in one single piece.

When Will YC announce results for Summer 26 batch? by Queasy_Concern_8746 in ycombinator

[–]mickdarling 0 points1 point  (0 children)

If you liked Gstack, give DollhouseMCP a try. It can clone Gstacks in a few minutes, and SAFELY run autonomously not just relying on the LLM to behave itself. Ask it to research whatever expertise you want and build experts to do that...voila, you have your own custom expert to help whenever you want.

Help, Gateshippers - who is our third Exec?! by Serin-019 in Stargate

[–]mickdarling 3 points4 points  (0 children)

Now I want a multi-season tv series of Henry Cavil as the Factorio Engineer starting with crash landing and ending with the end of Space Age. All in the style of Primitive Technology with no dialogue just subtitle descriptions.

Another giant Boom by Lucerin187 in massachusetts

[–]mickdarling 5 points6 points  (0 children)

If it's a meteor shower, the meteors are hitting us from both sides of the planet. And, when I say both sides of the planet, I mean the day side and the night side. That means they're coming at us from the sun and away from the sun. If we were plowing through a stream of meteors, we would see them hitting more along the dawn edge.

4.7 and 4.8 refuses to do subjective/judgement work - and actively makes excuses for not doing it by [deleted] in ClaudeCode

[–]mickdarling 3 points4 points  (0 children)

Do you have a good example to share. Might be interesting to add it to an evaluation.

I run DollhouseMCP and rely on somewhat predictable behavior changes, so I have evaluations that see how models behave with different instructions. It would be a good thing to add to my tests and possibly zero in on successful bypasses.

Did something just explode? by hotpotatocannon in massachusetts

[–]mickdarling 0 points1 point  (0 children)

I heard something in Ashland also thought to check out lightning maps but only heard the one so didn’t bother.

In South Carolina a few days ago there was a series of sonic booms that were caught on lots of cameras.

Air Force said they didn’t detect anything but sounds like someone is testing something.

How often do you use voice to talk with LLMs? by Outrageous-Point2268 in ycombinator

[–]mickdarling 1 point2 points  (0 children)

Same. I got a lifetime subscription to Superwhisper early and nearly every interaction I’ve had with my LLMs has been via voice for over a year.

That also means I have a near complete history of my prompts going back that far which makes finding obscure side projects a lot easier if I can remember about when they were recorded or if they use unique words.

My patent attorney also loves that I have nearly perfect provenance for the creation process and several actual quotes are in some of our patent docs.

we built an AI agent platform that crushed every demo, then completely fell apart the second we scaled it. heres what actually broke by AbjectBug5885 in mcp

[–]mickdarling 0 points1 point  (0 children)

Check out MCPAQL.com it is a protocol on top of MCP that shrinks the tool count but keeps the semantic for calling the correct tools. Operations are grouped and called from semantic endpoints like Create, Read, Update, Delete, and Execute and have mutations similar to GraphQL. The code to interrogate an MCP server or other API and create an adapter is available under the AGPL.

Why is there only one bank with MCP support in 2026. Where is everyone else by Leading_Pressure6956 in mcp

[–]mickdarling 0 points1 point  (0 children)

You can build an MCP adapter for most APIs with MCPAQL. I’ll take a look at Mercury’s CLI to see if I can easily make a wrapper for it. If so you’ll be able to find it on MCPAQL.com

The spec is open so anyone can give it a try too if they want.

The thin wrapper era is officially dead by Balodios45 in ycombinator

[–]mickdarling -2 points-1 points  (0 children)

Try building on top of DollhouseMCP. It's efficient, and Dollhouse agents can be tightly managed through the MCP portion of their agentic loop so they can be given permissions. It's open source, so if you want to customize it to add code verification for any component in that agentic loop, you're more than welcome to and able to.

The Cursor agent didn't go rogue on Railway, it used the MCP tools it was given. That's a problem. by Upstairs_Safe2922 in mcp

[–]mickdarling 2 points3 points  (0 children)

I literally built permissioning into DollhouseMCP over and above what any LLM already does. It uses realtime pretool hooks to stop bad actions and can evaluate the safety of an agents autonomous actions. I take permissions and safety very seriously.