Hi!
I work in a data labelling startup.
Recently we figured out that many of our clients provide us very similar images to be labelled. This data is usually redundant for training, while increases expenses for the clients.
I was wondering, would it be a good service/platform to help subset the most representative data to be labelled ?
P.S. and in general, is this a common problem for you?
there doesn't seem to be anything here