Viewing a single comment thread. View all comments

CMUOresama t1_jaj8n4d wrote on March 1, 2023 at 8:47 PM

Here's a paper that comes up with a differentiable relaxation of beam search and optimizes it directly to MT metrics as you suggest: https://arxiv.org/abs/1708.00111

SaltyStackSmasher OP t1_jal0r2l wrote on March 2, 2023 at 4:25 AM

thanks a lot for this. will definitely take a look