[D] backprop through beam sampling ? Submitted by SaltyStackSmasher t3_11euzja on March 1, 2023 at 5:10 AM in MachineLearning 13 comments 12
jamesvoltage t1_jajjsh3 wrote on March 1, 2023 at 9:55 PM Reply to comment by Kaleidophon in [D] backprop through beam sampling ? by SaltyStackSmasher The nano chat GPT repository extended with Gumbel softmax https://github.com/sanjeevanahilan/nanoChatGPT Permalink Parent 1
Viewing a single comment thread. View all comments