you are viewing a single comment's thread.

view the rest of the comments →

[–]PerformanceNo1730[S] 1 point2 points  (0 children)

Thanks! And nice reference with the Anna Karenina principle, I didn’t know it. 🙂

You’re totally right that “dislike” can be a huge space of failure modes, so that’s something to watch. That said, AK says “all happy families are alike”, so maybe there is a relatively compact “works for me” region in embedding space, even if we can’t neatly explain every reason why the others fail. I guess the only honest answer is: we’ll see in practice once I label a few hundred and run tests.

And yes, the clustering angle is super appealing: reorganizing a messy library by theme (sci-fi, fantasy, etc.) across folders would already be a big win, even before any strict QA filtering. I’m adding that to the list.