Submitted by nullspace1729 t3_y0dk5c in MachineLearning
TheInfelicitousDandy t1_irsfw1a wrote
Reply to comment by Small-Reason-8096 in [D] Recent ML papers to implement from scratch by nullspace1729
I've tried to reimplement AWD-LSTM in pytorch > 1. and have never been able to get close to the original results. I've also seen other people try and not get close. Pretty sure it has to do with the weight dropout they used.
If anyone knows of any pytorch > 1. version that achieves the same PPL on PTB/Wiki02 I'd very much like to know.
Small-Reason-8096 t1_irzvwc8 wrote
That surprises me as there was a good Fastai version:
https://docs.fast.ai/text.models.awdlstm.html
which is built on pytorch. When I played with it ages ago the results seemed comparable to the paper, but I haven't revisited it for a while :)
TheInfelicitousDandy t1_is0ajet wrote
As far as I know that version doesn't give comparable PPL.
Someone else saying the same https://github.com/salesforce/awd-lstm-lm/issues/86#issuecomment-453266265
A major issue here (and for other reproductions) are people saying they have a reproduction because they can run it without errors but never actually getting the same results.
Viewing a single comment thread. View all comments