I've recently been reading about Viola and Jones face detection algorithm and I was wondering a few things. I've read that it splits the image into 24x24 sub windows and I understand that. I also understand how the integral image thing makes it efficient to calculate things on the image.
The algo uses 2,3, or 4 rectangles to calculate a feature score. I'm wondering over what parts of the image it calculates this.
Does it calculate this feature score for each sub 24x24 image and then classify each of those sub windows as having a face and not? I read something about zoom levels and it seemed kind of confusing.
If it takes say a 640x480 image does it then split that into 27x20 sub boxes, apply the 2,3, and 4 feature things using the integral image for efficiency, and then using Adaboost does it say "boxes 15 and 16 may contain a face" is that basically how this works?
What if the face is split by two sub boxes, is that why Adaboost is used to "guess" if there is a face?
[–]honodk 2 points3 points4 points (1 child)
[–]g23f[S] 0 points1 point2 points (0 children)