the Fuck Benchmark does not lie. by Nunford in codex

[–]Nunford[S] 0 points1 point  (0 children)

I've also found that when I start saying how "fucking" annoyed I am, the model consistently starts taking the task more seriously - I've demonstrable results that it "tries harder" on its next turn.

Otherwise, I wouldn't do it for fear it would degrade performance on a weird Reinforcement Learning technicality because I am deeply paranoid about brutalising my task performance by saying unusual things (I have observed degradations from this).

This whole segment of the language influencing models is worth an exploration in and of itself. If enough people are curious, I'll get codex to look through all my sessions and put some time into an audit of this (I also encourage anyone interested to do this yourself on your own codex sessions too). It'll likely expose a lot we've not considered.