[P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch Submitted by korec1234 t3_11t1857 on March 16, 2023 at 5:53 PM in MachineLearning 25 comments 258
[deleted] t1_jcj7qap wrote on March 17, 2023 at 5:44 AM Reply to comment by impossiblefork in [P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch by korec1234 20 dollar models are popping up: https://www.mosaicml.com/blog/mosaicbert Holy cow! Permalink Parent 7
Viewing a single comment thread. View all comments