Viewing a single comment thread. View all comments

emad_eldeen t1_ivw9rd5 wrote

There's no rule of thumb, but usually, you use less learning rate in fine-tuning than the one used in pretraining.

1