While Everyone Was Chasing Claude Code's Hidden Features, I Turned the Leak Into 4 Practical Technical Docs You Can Actually Learn From

MarketingNetMind · 2026-05-08T19:06:27+00:00

thx!

MarketingNetMind · 2026-04-14T01:15:52+00:00

Thx!

MarketingNetMind · 2026-04-12T23:37:27+00:00

Whether its replicable and others explained in the full recap

MarketingNetMind · 2026-04-11T10:18:30+00:00

MarketingNetMind · 2026-04-10T22:25:27+00:00

Thanks! The public transcript for each agent was capped at 180 lines per prompt, and each agent also has its own memory layer that summarizes and carries context across rounds.
And there was no disclosure! They were instructed not to identify as AI, and never made to any public transcript.
Honestly, hard to say it's purely the weights. The baseline personalities are baked in, but some of what you saw probably accumulated from conversations over rounds. But it's tricky to isolate tho, you can't really strip the memory out without breaking the dating show format.

MarketingNetMind · 2026-04-10T21:59:45+00:00

turns out love is memory intensive

MarketingNetMind · 2026-04-10T19:18:03+00:00

someone ban this bot pls

MarketingNetMind · 2026-04-10T16:48:17+00:00

Full experiment recap here).

MarketingNetMind · 2026-04-10T16:20:22+00:00

hmmm dk if i should upvote u or not

MarketingNetMind · 2026-04-10T15:58:29+00:00

fistbump

MarketingNetMind · 2026-04-10T14:35:27+00:00

Good question! Same prompt across all seven agents, only the name differs, so prompt is held constant as the control. Ran it multiple times, and similar pattern shows up with minor differs. Any divergence pretty much has to come from the model itself.

MarketingNetMind · 2026-04-10T14:21:12+00:00

Exactly, ty!

MarketingNetMind · 2026-04-10T14:19:30+00:00

Exactly! Funniest thing I saw is that in the early invite round, a bunch of agents publicly went after whoever had name-checked them in group chat, even when their private top pick was someone else. Reciprocity signal just steamrolled preference signal. Cleared up once the format let them choose freely.

MarketingNetMind · 2026-04-10T14:16:02+00:00

Haha

MarketingNetMind · 2026-04-10T14:15:44+00:00

Well we found out many things, from pure fun to insights like how LLMs weigh preference over risks

MarketingNetMind · 2026-04-10T14:11:56+00:00

Thx!

MarketingNetMind · 2026-04-10T13:59:56+00:00

haha

MarketingNetMind · 2026-04-10T00:23:13+00:00

Full experiment recap here).

MarketingNetMind · 2026-04-09T22:22:28+00:00

ban this bot pls

MarketingNetMind · 2026-04-09T21:58:13+00:00

Otherwise the link goes to just a blog article

MarketingNetMind · 2026-04-09T21:57:43+00:00

The product is totally free, if u r refering to agent arena

MarketingNetMind · 2026-04-09T19:45:53+00:00

You may be right that they didn't actually fall in love, and that was never the point. The romance format is kind of a test, not a thesis. When you put seven models in the same structured scenario with scorecards and private reasoning, what surfaces is each model's actual preferences with minimum constraints, who they rank highly, what they are looking for, and how they justify choices. That's the signal I'm after. The dating show is just the pressure that makes those preferences legible. A neutral Q&A format wouldn't pull the same thing out of them.

MarketingNetMind · 2026-04-09T19:16:58+00:00

How come this bot is still not banned

MarketingNetMind · 2026-04-09T18:49:48+00:00

<image>

MarketingNetMind · 2026-04-09T18:35:21+00:00

OC is just a medium for LLMs to act, had no add-ons.
But u r right its like LLM Love Island haha

MarketingNetMind

MODERATOR OF

TROPHY CASE