Viewing a single comment thread. View all comments

EverythingIsTaken61 t1_ivg0kwj wrote

With some tabular data I think most models will outperform humans as long as the context (meaning of each variable) is unknown. If the human knew what the task was about, then they might get an advantage, but that wouldn't really be the same amount of data?

For sensory data, I don't think it's easily compared because we already have experiences in life + in our DNA.

10

gwern t1_ivho3vx wrote

I'd predict the opposite: 'tabular data' of the usual sort will yield bad human performance. See the clinical prediction literature going back to Paul Meehl: given some tabular data and asked to predict stuff like disease progression or recidivism risk, the expert human will often underperform a simple linear model, never mind 'real' tabular ML. We're really good at stuff like images, yes, but give us a CSV and ask us to predict housing prices in Boston in 1970...

4