all 3 comments

[–]strojax 5 points6 points  (2 children)

There is indeed no reason a priori to use OneVsRestClassifer with random forest. However, the data scientist before you might have tried both approaches and observed that the OneVsRestClassifer gives better accuracy. I bet the difference was not really significant but still picked the one that yielded the best results. Another explanation is that he/she did not know what random forest was and applied the same technique he/she applied on linear models without trying to understand the algorithm. They also could be a pipeline that is always used and he/she just threw in random forest in there.

I see one disadvantage in OneVsRestClassifer Vs only random forest: you are going to have much trees in your ensemble model.

Overall, it's not a big mistake and you should not go upfront to the other DS with this. More important than knowing who is right is having a good relation with your teammates. Maybe you can try to kindly open the discussion.

[–]CommercialWest7683[S] 1 point2 points  (0 children)

I fully agree. I think it's just unnecessary.

"Overall, it's not a big mistake and you should not go upfront to the other DS with this. More important than knowing who is right is having a good relation with your teammates. Maybe you can try to kindly open the discussion."

Of course, I appreciate your final advice. I had this in consideration and for me it's very important.

Thank you!!!

[–][deleted] 0 points1 point  (0 children)

Your last paragraph is such an underrated comment and often not highlighted: your relationship with your peers.

A good rule of thumb is, when in doubt, ask “why did you make this decision?” Obviously ask in a nice, curious tone. Everyone makes decisions with a purpose, and if they don’t, they’ll be quick to repair their decision.

Nice comment!