[deleted] t1_j7y325j wrote on February 10, 2023 at 5:32 AM

Reply to comment by currentscurrents in [D] Critique of statistics research from machine learning perspectives (and vice versa)? by fromnighttilldawn

[deleted]

currentscurrents t1_j7y4073 wrote on February 10, 2023 at 5:42 AM

>Right now basically all progress is with large models,

You mean all progress... in machine learning. A lot of scientific fields necessarily must make do with a smaller number of data points.

You can't test a new drug on a million people, especially in early phase trials. Even outside of medicine, you may have very few samples if you're studying a rare phenomena.

Statistics gives you tools to make limited conclusions from small samples, and also measure how meaningful those conclusions actually are.

[deleted] t1_j7y67bi wrote on February 10, 2023 at 6:06 AM

[deleted]

[deleted] t1_j7y9mjs wrote on February 10, 2023 at 6:46 AM

[deleted]

WikiSummarizerBot t1_j7y9nn5 wrote on February 10, 2023 at 6:47 AM

All models are wrong

>All models are wrong is a common aphorism in statistics; it is often expanded as "All models are wrong, but some are useful". The aphorism acknowledges that statistical models always fall short of the complexities of reality but can still be useful nonetheless. The aphorism originally referred just to statistical models, but it is now sometimes used for scientific models in general. The aphorism is generally attributed to the statistician George Box.

^([ )^(F.A.Q)^( | )^(Opt Out)^( | )^(Opt Out Of Subreddit)^( | )^(GitHub)^( ] Downvote to remove | v1.5)

psyyduck t1_j7ybb3i wrote on February 10, 2023 at 7:07 AM

Eh. I don’t care enough about this to argue

[deleted] t1_j7ybqh1 wrote on February 10, 2023 at 7:12 AM

[deleted]

Jemimas_witness t1_j7y68en wrote on February 10, 2023 at 6:06 AM

This is only correct for certain problems, like everything it has best use cases. When you only have a hammer everything looks like a nail.

In medicine the backbone of clinical trial results that change the field relies often on 2000-3000 patients (datapoints) and often groundbreaking achievements in medical practice are made by simple statistics and simple methods. Go to the New England journal of medicine and pick any trial and the weight of their conclusions are based off of survival functions, hazard ratios, and chi squared statistics. Then go look at the funding section - these projects are funded by millions. The only disciplines in medicine with ML datapoints are epidemiology and claims level data which strays way into econometrics.

I myself study rare diseases as well as AI/ML applications in medicine and for some projects I’d be stoked to get 80 patients because there just simply aren’t that many around.

[deleted] t1_j7y84nz wrote on February 10, 2023 at 6:28 AM

[deleted]