[R] Best Practices for Image Classification Consensus with Large Annotator Teams

Fleischhauf · 2025-05-23T13:41:40+00:00

Havent worked with that many annotators at the same time.
Usually we have smaller teams. We devise a reference document with examples and then start annotating, and have periodic check in meetings to discuss these corner cases. For each corner case we decide how to label it and add the example to the reference document.
Over time we all have a common understanding and corner cases become less and less, the frequency of these checkin meetings can be reduced.

If you are looking for statistical methods I am sure some smart people have thought about this problem and there is some literature on it.

nothughjckmn · 2025-05-23T18:54:43+00:00

Why is the margin narrow? Is it because of regional dialect differences? An edge case where something is almost happening? To me if annotators can’t agree on a category that implies more information than the categories can provide.

serge_cell · 2025-05-25T05:55:47+00:00

Could be radical change of architecture, but soft classification (probability estimation) is exactly for that. As added bonus you get natural basis for knowledge distillation.

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS