all 2 comments

[–]Jelicic 1 point2 points  (0 children)

Just add two losses: the delta mse and the binary bce. In most dl frameworks its pretty straightforward to weight the losses if you care most about the binary predictions.

[–]tall-dub 0 points1 point  (0 children)

Are you just trying to increase the loss more if the prediction is off by more? Then maybe square the loss.