Viewing a single comment thread. View all comments

suricatasuricata t1_iumxisa wrote

I tried fine tuning gpt2 on 1000 examples on my M1, I think it was supposed to be 10 hours versus 30 minutes on a V100 on Gcloud. Inference was comparable. I think there is a way out by installing the right drivers, but honestly, what would be the point in that for someone who works in industry? I am not planning to run production code trained on a MBP, so might as well develop cloud first.

1