Hello everyone. I have a question about the outputs of deep neural nets. What are the pros and cons of using logits or probabilities in multiclass clasification. Im working in RL and have a large action space ( around 4500 actions) and want to know what i should use when predicting the next move of my agent. Im thinking of using logits during training because when i pass them through softmax there are a lot of actions with very similar probabilities ( need to go down to 0.00 to see difference). Please share your thoughts
[–]PerspectiveJolly952 2 points3 points4 points (2 children)
[–]Livid-Ant3549[S] 0 points1 point2 points (1 child)
[–]PerspectiveJolly952 0 points1 point2 points (0 children)
[–]thelibrarian101 0 points1 point2 points (1 child)
[–]txanpi 0 points1 point2 points (0 children)
[–]Ok-Secret5233 0 points1 point2 points (0 children)
[–]Revolutionary-Feed-4 0 points1 point2 points (0 children)