Viewing a single comment thread. View all comments

LetterRip t1_j164stx wrote

If slow training is acceptable you can use DeepSpeed with the weights mapped to the NVME drive (DeepSpeed ZeRo Infinity). It will take significantly longer to fine tune, but dramatically lowers the hardware investment.

1