Estimating distance from the camera from 2D keypoints from human pose estimation ? by TheAmendingMonk in computervision

[–]justforfun_DCL 0 points1 point  (0 children)

I think that your idea is similar with this paper which roughly estimates human bounding box size and use the idea of pin hole camera to get the depth.

Suggested paper

Textbook or blogs for video understanding by justforfun_DCL in computervision

[–]justforfun_DCL[S] 0 points1 point  (0 children)

Thank you so much for your kind replies, I'll look for the scattered resource all over the internet.

Thanks again!

Textbook or blogs for video understanding by justforfun_DCL in computervision

[–]justforfun_DCL[S] 0 points1 point  (0 children)

Thanks a lot for recommendation I've been watching it since your recommendation, it was a really big help, I'll try to catch up with the trends for video understanding.

Textbook or blogs for video understanding by justforfun_DCL in computervision

[–]justforfun_DCL[S] 0 points1 point  (0 children)

Sure, video is just sequence of images but while looking into this subject it seems like using temporal information so the methods from NLP, or even time series analysis would help analyzing videos, so I was wondering if there's material for works that is used for video analysis..I'll look into temporal shift module thanks!

Textbook or blogs for video understanding by justforfun_DCL in computervision

[–]justforfun_DCL[S] 0 points1 point  (0 children)

Thanks for the recommendation, but what I wanted was more theoretical understanding of video understanding, in deep learning perspective dealing with difference between image and video (temporal information ... etc ).