Submitted by alkaway t3_zkwrix in MachineLearning
I'm training a per-pixel image classification network, which, for each pixel in the image, predicts whether it is a sign for disease A or disease B. Note that a given pixel could be a sign for both disease A and disease B (this is a multi-label problem).
My question is: are the relative probabilities going to be calibrated? In other words, does it make sense to sort the NxNx2 probabilities, or are the probabilities for the two diseases (i.e. channels) not calibrated / comparable, since it is similar to solving two independent problems?
If it matters, I am using a ResNet, some fully-connected layers, and then a convolutional decoder.
Any thoughts will be much appreciated, thanks in advance!
pm_me_your_ensembles t1_j01xzcw wrote
The two are not comparable. In a multi-class single-label problem, you do K distinct projections, one for each class, but then they are combined via softmax to give you something that resembles probabilities. Since no such function is applied, it's not possible to compare the two as they don't influence each other in any way.
However, you shouldn't treat whatever a NN outputs as a probability even if it's within [0,1] as NNs are known to be overconfident.