Submitted by SaltyStackSmasher t3_11euzja in MachineLearning
CMUOresama t1_jaj8n4d wrote
Here's a paper that comes up with a differentiable relaxation of beam search and optimizes it directly to MT metrics as you suggest: https://arxiv.org/abs/1708.00111
SaltyStackSmasher OP t1_jal0r2l wrote
thanks a lot for this. will definitely take a look
Viewing a single comment thread. View all comments