Submitted by killver t3_y2vvne in MachineLearning
Some initial benchmarks can be found here: https://www.pugetsystems.com/labs/hpc/NVIDIA-RTX4090-ML-AI-and-Scientific-Computing-Performance-Preliminary-2382/
To me it looks very disappointing, but unfortunately expected given the memory limitations.
da_yu t1_is55aez wrote
We probably need to wait for driver and library updates for Ada specific optimization to get a fair picture (CUDA 12). Tensorflow benchmarks without XLA (in my opinion) should be taken with a grain of salt too.
But if the results stays the same, the improvement (especially fp16) is a disappointment.