TiredOldCrow t1_its89ot wrote on October 25, 2022 at 10:34 PM

Since you're using different pre-trained VGG16 models as a starting point, you may just be demonstrating that the PyTorch torchvision model is more amenable to your combination of hyperparameters than the TensorFlow one.

Ideally for this kind of comparison you'd use the exact same pretrained model architecture+weights as a starting point. Maybe look for a set of weights that has been ported to both PyTorch and TensorFlow?

seba07 t1_ittpya9 wrote on October 26, 2022 at 6:23 AM

Or otherwise don't use a pre-trained network for this test. Pytorch randomness shouldn't be better than Tensorflows.

aleguida OP t1_itvgiu8 wrote on October 26, 2022 at 4:33 PM

Thanks for the feedback. I tried retraining everything from scratch without downloading any pretrained weights. here is the colab links update.

While Pytorch is learning something, Tf is not learning anything. This is actually quite confusing as I used tf.Keras to minimize any possible error on my part. I will try to build the same network from scratch in both frameworks next

[deleted] t1_itzwb38 wrote on October 27, 2022 at 3:03 PM

[deleted]