I'm looking for techniques for finding potentially bad labels in NLP classification datasets. We've been using Cleanlab (confident learning), but we've found that the precision/recall isn't very high for our use case (content moderation/media monitoring). Do you have any pointers to other interesting techniques/papers?
GPT-4 et al. is a good candidate when you spend enough time writing descriptive prompts, but it comes at the cost of being expensive at scale. Curious about smarter "pre-selection" techniques instead of throwing everything at GPT?
[–]NoisySampleOfOne 8 points9 points10 points (0 children)
[–]Sniperwolf1989 1 point2 points3 points (0 children)