Submitted by killver t3_y2vvne in MachineLearning
ReginaldIII t1_is8j7x8 wrote
Reply to comment by kajladk in [N] First RTX 4090 ML benchmarks by killver
I would say there's precedent for lucky run benchmark scores. Consider 3dmark as an example.
https://benchmarks.ul.com/hall-of-fame-2/timespy+3dmark+score+performance+preset/version+1.0
All of those runs with different system configurations are peoples luckiest runs.
kajladk t1_is8lqr0 wrote
But isn't this different? We are comparing raw metric (fps, images/sec) with an aggregate score which might already have ways to eliminate or regularize some outlier metrics in-built
ReginaldIII t1_isal2mt wrote
Nvidia and Puget want to report lucky run. Lots of people do this. They're being fully transparent that they are reporting lucky runs. And it makes sense from their perspective to report their best theoretical performance.
It honestly just doesn't bother me to see them doing it because it's very normal and lots of people report this way. Even if we think an average with an error bar would be fairer.
Viewing a single comment thread. View all comments