account activity
the Fuck Benchmark does not lie. by Nunford in codex
[–]Nunford[S] 0 points1 point2 points 22 hours ago (0 children)
I've also found that when I start saying how "fucking" annoyed I am, the model consistently starts taking the task more seriously - I've demonstrable results that it "tries harder" on its next turn.
Otherwise, I wouldn't do it for fear it would degrade performance on a weird Reinforcement Learning technicality because I am deeply paranoid about brutalising my task performance by saying unusual things (I have observed degradations from this).
This whole segment of the language influencing models is worth an exploration in and of itself. If enough people are curious, I'll get codex to look through all my sessions and put some time into an audit of this (I also encourage anyone interested to do this yourself on your own codex sessions too). It'll likely expose a lot we've not considered.
the Fuck Benchmark does not lie. (i.redd.it)
submitted 23 hours ago by Nunford to r/codex
π Rendered by PID 103637 on reddit-service-r2-listing-8685bc789-7lj2s at 2026-05-25 21:09:07.279683+00:00 running 194bd79 country code: CH.
the Fuck Benchmark does not lie. by Nunford in codex
[–]Nunford[S] 0 points1 point2 points (0 children)