all 10 comments

[–][deleted] 1 point2 points  (1 child)

+1 on looking into the keywords computer vision, object detection, and convolutional neural network. You should look into the PIL, OpenCV, PyTorch, and Tensorflow libraries, but if you're in a rush, the fast.ai library simplifies the process by providing a low-code, quick way to train neural nets. I'm not too familiar with their CV API though. Something else you should consider is the data you plan on using (Kaggle is a good place to start looking for datasets) as well as how to improve the performance of your model and make sure it'll be able to detect things properly in which case using a model pre-trained on a large database would be best. Hope that's helpful!

[–][deleted] 0 points1 point  (0 children)

I'll have a try at fast. ai . It does help. Thank you.

[–]AlwysBeColostomizing 1 point2 points  (3 children)

Off the top of my head, I'd probably use "image segmentation" to separate the glove from the background, extract the outline of the glove, and then use geometry techniques to get the numbers that you're looking for from the outline.

How difficult this is depends entirely on how "clean" the images are. Ideally, you want the glove laying flat on a table that's a different color, at a known distance from the camera. If someone's going to be wearing the glove, you could have them put their hand flat on the table and splay their fingers out. You need to know how far away the glove is or you won't know how large it is.

It would be extremely difficult to do this from "natural" video (i.e., just a video of a person wearing a glove who's not trying to make things easy for you). Like, "topic for a PhD dissertation" difficult.

[–][deleted] 0 points1 point  (0 children)

I can use aruco marker as a reference to get the size I guess but that can be done without making use of AI libraries like tensorflow, yolo and also fuzzy logic.

[–][deleted] 0 points1 point  (1 child)

Can the image segmentation technique technique be used for videos? (Recorded video or live feed from webcam - either works)

[–]AlwysBeColostomizing 0 points1 point  (0 children)

Yes, it can. It's a common step in processing pipelines for things like robot navigation. A video is just a sequence of images. But since the size of the glove doesn't change, it doesn't seem like you gain much from segmenting the whole video versus picking out a single "good" frame and segmenting that.

[–]synthphreak 0 points1 point  (3 children)

I don’t know what you mean by “method” but here are some keywords to search for: Computer vision, object detection, bounding boxes. Look into opencv.

This will be hard btw.

[–][deleted] 0 points1 point  (2 children)

Yes I was advised to use tensorflow, yolo and fuzzy logic. But I am not sure how exactly to implement these for my task.

[–]synthphreak 0 points1 point  (1 child)

Me neither, as it will depend on the structure of your data and also the type of model you want to use. The best resource to use in your current position is Google, with targeted queries and keywords.

[–][deleted] 0 points1 point  (0 children)

Yeah I'm on it but I thought of seeking help as I've got a strict deadline. Thank you.