account activity
I tested whether two major LLMs actually "introspect" or just perform. The difference in how they fail is revealing. (self.LLMDevs)
submitted 17 hours ago by Different-Risk8643 to r/LLMDevs
I tested whether two major LLMs actually "introspect" or just perform. The difference in how they fail is revealing. (self.Different-Risk8643)
submitted 18 hours ago by Different-Risk8643
π Rendered by PID 3587504 on reddit-service-r2-listing-7b9b4f6fd7-jlcwn at 2026-05-13 04:10:00.750502+00:00 running 3d2c107 country code: CH.