Submitted by nullspace1729 t3_y0dk5c in MachineLearning
I’m looking for a recent (last 5 years) paper that introduces a new e.g. objective function, optimiser or model etc that I can try to implement myself in python/torch/keras. I mainly want to do this to learn new ideas and improve my coding skills.
Do you have any recommendations or alternatively any advice for how to find new interesting papers for someone who isn’t a researcher? I’ve looked on arxiv but I haven’t found what I’m looking for.
Small-Reason-8096 t1_irr6g0q wrote
Hands down the best paper I have ever read (and reimplemented) is the ResNets paper:
https://arxiv.org/abs/1512.03385
The descriptions are clear and concise - but with enough detail to reimplement in whatever framework you like. Also, OOTB the results I got on CIFAR10 matched the paper pretty much perfectly (not always a given!).
Another good paper to try is AWD-LSTM: https://arxiv.org/pdf/1708.02182.pdf
Basically, if you are implementing and training from scratch, focus on something you can train with a smallish dataset in a reasonable period of time. I would generally steer away from LLMs and object detection / segmentation models as they require more resources to train that are commonly available!