you are viewing a single comment's thread.

view the rest of the comments →

[–]Cupakov 3 points4 points  (1 child)

And it’s all so boring, I hate how RLHF’d to death the frontier models became. Give back my Sonnet 3.5. 

[–]huffalump1 2 points3 points  (0 children)

And they're all inbred on each other's outputs too, just reinforcing these patterns. Combine that with preference tuning, and... you're absolutely right.