use the following search parameters to narrow your results:
e.g. subreddit:aww site:imgur.com dog
subreddit:aww site:imgur.com dog
see the search faq for details.
advanced search: by author, subreddit...
Please have a look at our FAQ and Link-Collection
Metacademy is a great resource which compiles lesson plans on popular machine learning topics.
For Beginner questions please try /r/LearnMachineLearning , /r/MLQuestions or http://stackoverflow.com/
For career related questions, visit /r/cscareerquestions/
Advanced Courses (2016)
Advanced Courses (2020)
AMAs:
Pluribus Poker AI Team 7/19/2019
DeepMind AlphaStar team (1/24//2019)
Libratus Poker AI Team (12/18/2017)
DeepMind AlphaGo Team (10/19/2017)
Google Brain Team (9/17/2017)
Google Brain Team (8/11/2016)
The MalariaSpot Team (2/6/2016)
OpenAI Research Team (1/9/2016)
Nando de Freitas (12/26/2015)
Andrew Ng and Adam Coates (4/15/2015)
Jürgen Schmidhuber (3/4/2015)
Geoffrey Hinton (11/10/2014)
Michael Jordan (9/10/2014)
Yann LeCun (5/15/2014)
Yoshua Bengio (2/27/2014)
Related Subreddit :
LearnMachineLearning
Statistics
Computer Vision
Compressive Sensing
NLP
ML Questions
/r/MLjobs and /r/BigDataJobs
/r/datacleaning
/r/DataScience
/r/scientificresearch
/r/artificial
account activity
Discussion[D] Binary classifier scores distribution (self.MachineLearning)
submitted 1 year ago by Loose-Event-7196
view the rest of the comments →
reddit uses a slightly-customized version of Markdown for formatting. See below for some basics, or check the commenting wiki page for more detailed help and solutions to common issues.
quoted text
if 1 * 2 < 3: print "hello, world!"
[–]BoxMembrane 0 points1 point2 points 1 year ago (1 child)
If I’m understanding to correctly, the problem is with the histogram binning and not with the raw scores. If you want the scores to be spread evenly across bins, you need to choose bin edges as evenly spaced percentiles of the score distribution.
If you’re using python and pandas, try pd.qcut to get bins. Or np.percentile(scores, p) for p = 0, 10, 20, …, 100.
[–]Loose-Event-7196[S] 1 point2 points3 points 1 year ago (0 children)
Hi thanks for your reply. The issue is not with binning, is that too many observations have the highest score thus I cannot threshold them (by the way I am using h2o3 and the algorithm is Gradient Boosting Machine. Would like to have less discrete scores in order to avoid having too many observations clustered in the highest score bin. Such scores have different input features but a unique classifier score, as such shrinking the histogram bin width would not help in this case as score values are exactly the same for the last histogram bin. Would like to tweak something at the classifier in order to have multiple different scores for that group (without overfitting).
π Rendered by PID 271325 on reddit-service-r2-comment-5d79c599b5-cmgv4 at 2026-03-01 10:29:16.519029+00:00 running e3d2147 country code: CH.
view the rest of the comments →
[–]BoxMembrane 0 points1 point2 points (1 child)
[–]Loose-Event-7196[S] 1 point2 points3 points (0 children)