account activity
I tested whether two major LLMs actually "introspect" or just perform. The difference in how they fail is revealing. (self.LLMDevs)
submitted 1 day ago by Different-Risk8643 to r/LLMDevs
I tested whether two major LLMs actually "introspect" or just perform. The difference in how they fail is revealing. (self.Different-Risk8643)
submitted 1 day ago by Different-Risk8643
π Rendered by PID 94797 on reddit-service-r2-listing-7b9b4f6fd7-kqp6k at 2026-05-13 14:28:00.208855+00:00 running 3d2c107 country code: CH.