Submitted by mrwafflezzz t3_116nm8c in MachineLearning
Repulsive_Tart3669 t1_j97p211 wrote
I believe a common approach is to use a linear activation function for regression problems unless target variable has certain semantics that suggest the use of other non-linearities (sigmoid, tanh etc.). Also consider rescaling your targets instead of trying to match the desired output with activation functions.
From you description (I might be wrong though), it seems like the 0 output is a special case. In this case you might want to use a binary classifier to classify input samples into two classes first. For class 0 the output is 0. For class 1 you use another model (regressor) that outputs a prediction.
mrwafflezzz OP t1_j99f6kd wrote
The two model approach is the original setup :). I'm just looking for potential alternatives.
Viewing a single comment thread. View all comments