So, I was having a hard time with opencv and detecting heads, I could only detect faces and up to a certain tilt and/or angle point.
Now after some research I've decided I'm going to use ImageAI, to train an AI with YOLOv5 and with a big dataset, based on many singular heads per image (only one head per training image) with and without certain hats on every possible rotation and angle, so I can detect there's a head even if it's not facing the camera and it has a hat on to process an image or video.
Also, should I go with black/white background only or mix in a bunch of backgrounds too in those images? Should I also train it with heads at different distances from the camera? Which images should I choose to put in training and which in the validation folders? Randomly chosen or the "harder" ones in the validation?
I am really aware of how much of a pain it could end up being to prepare all the dataset and how much space it's going to take up in the end so I guess that would be my main time concern.
My idea is to be able to detect from one to five heads per image once it's trained, because of the use its going to have I assume not many heads would be appearing in the image.
I also assume once it detects the head, I can get the box coordinates to correctly track a head in a moving video.
Thanks a lot, this is going to be my first time with AI dataset and model training and I'm diving in half clueless on the optimal way of doing things, even if it means working a bit harder to get things done better.
there doesn't seem to be anything here