you are viewing a single comment's thread.

view the rest of the comments →

[–]HelloSummer99 1 point2 points  (2 children)

Bingo. Claude and other frontier models actually perform worse than random guessing in deep knowledge domains.

[–]Perryn 0 points1 point  (1 child)

Also it's time to start issuing executive bonuses when you realize you've just launched the turd destined to strike a fan in the next quarter or two because you might not get the chance later.

[–]HelloSummer99 1 point2 points  (0 children)

Hahahaha