Submitted by alkaway t3_zkwrix in MachineLearning
I'm training a per-pixel image classification network, which, for each pixel in the image, predicts whether it is a sign for disease A or disease B. Note that a given pixel could be a sign for both disease A and disease B (this is a multi-label problem).
My question is: are the relative probabilities going to be calibrated? In other words, does it make sense to sort the NxNx2 probabilities, or are the probabilities for the two diseases (i.e. channels) not calibrated / comparable, since it is similar to solving two independent problems?
If it matters, I am using a ResNet, some fully-connected layers, and then a convolutional decoder.
Any thoughts will be much appreciated, thanks in advance!
bimtuckboo t1_j02jss1 wrote
Easiest way to find out is to make some calibration plots with your validation set. From there, depending on what the plots look like, there are some things you can do to improve the calibration post training. Look into temperature scaling and Platt scaling.