[D] What kinds of interesting models can I train with just an RTX 4080? Submitted by faker10101891 t3_10cxuo2 on January 15, 2023 at 11:02 PM in MachineLearning 10 comments 1
DaLameLama t1_j4mamhy wrote on January 16, 2023 at 6:50 PM Relevant: https://arxiv.org/abs/2212.14034 >Cramming: Training a Language Model on a Single GPU in One Day Permalink 3
Viewing a single comment thread. View all comments