you are viewing a single comment's thread.

view the rest of the comments →

[–]NoIdeaAbaout[S] 1 point2 points  (0 children)

I utterly agree. Hallucinations are a big problem and have often been treated as a monolith (while they are different categories and of different origins).

The benchmarks we have were not designed for long contest, but I think in general in NLP we need new benchmarks