impossiblefork t1_jch7ker wrote
Reply to comment by currentscurrents in [P] nanoT5 - Inspired by Jonas Geiping's Cramming and Andrej Karpathy's nanoGPT, we fill the gap of a repository for pre-training T5-style "LLMs" under a limited budget in PyTorch by korec1234
Still, probably useful for research-- validating alternatives to transformers, etc.
Viewing a single comment thread. View all comments