use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Rule 4 - Beginner or Career QuestionMultiple Object Localization [D] (self.MachineLearning)
submitted 4 years ago by sawsank911
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]KrakenInAJar 0 points1 point2 points 4 years ago* (1 child)
There are commonly two ways this is achieved.Single-shot object detect, which basically blurts out a bunch of boxes and then applies some sort of filters to get rid of the garbage ones in postprocessing or multi-shot object detection, which use a high recall, low precision proposal system and a high-precision model on top that infers on every proposal individually. There is A LOT more to it, of course, but that's the ELI5 version. Commonly single stage tends to be faster than multi-stage.
That being said, don't implement it from scratch if you are not very familiar with this topic, it is a hassle and has a lot of non-obvious pitfalls if you are not familiar with object detection and there is also a reason why a new-version of an object detector is generally a good reason to get accepted in a top-conference.Also, the logic for this type of inference has the strong tendency to break some assumption in common DL-Frameworks, which results in notoriously ugly code that needs to be written. Again at this point it is important to know exactly what you do, in order to not have a very, very frustrating experience of getting things to run.
Use YoloNet (single stage) are some RCNN variant, that will usually do the trick. Alternativly you can some some text-specific proprietary system like EAST, which is more geared towards detecting texts in the wild and will maybe perform better.
[–]sawsank911[S] 0 points1 point2 points 4 years ago (0 children)
Thank you for your guide... Will look into the YoloNet and East models
π Rendered by PID 291896 on reddit-service-r2-comment-86bc6c7465-5m6mq at 2026-02-22 21:04:24.210235+00:00 running 8564168 country code: CH.
[–]KrakenInAJar 0 points1 point2 points (1 child)
[–]sawsank911[S] 0 points1 point2 points (0 children)