CMUOresama t1_jaj8n4d wrote on March 1, 2023 at 8:47 PM Reply to [D] backprop through beam sampling ? by SaltyStackSmasher Here's a paper that comes up with a differentiable relaxation of beam search and optimizes it directly to MT metrics as you suggest: https://arxiv.org/abs/1708.00111 Permalink 3
CMUOresama t1_jaj8n4d wrote
Reply to [D] backprop through beam sampling ? by SaltyStackSmasher
Here's a paper that comes up with a differentiable relaxation of beam search and optimizes it directly to MT metrics as you suggest: https://arxiv.org/abs/1708.00111