Submitted by 51616 t3_yt6slt in MachineLearning
vwings t1_iw857q2 wrote
Reply to comment by machinelearner77 in Relative representations enable zero-shot latent space communication by 51616
Yes, sure you can backprop, but what I meant is that you are able to train a network reasonably with this -- although in the backward pass the gradient gets diluted to all anchor samples. I thought you would at least need softmax attention (forward pass) to be able to route the gradients back reasonably.
Viewing a single comment thread. View all comments