I run mlcontests.com, a website that aggregates ML competitions across Kaggle and other platforms.

I've just finished a detailed analysis of 200+ competitions in 2022, and what winners did (we found winning solutions for 67 competitions).

Some highlights:

Kaggle still dominant with the most prize money, most competitions, and most entries per competition...
... but there are 10+ other platforms with interesting competitions and decent prize money, and dozens of single-competition sites
Almost all competition winners used Python, 1 used C++, 1 used R, 1 used Java
96% (!) of Deep Learning solutions used PyTorch (up from 77% last year)
All winning NLP solutions we found used Transformers
Most computer vision solutions used CNNs, though some used Transformer-based models
Tabular data competitions were mostly won by GBDTs (gradient-boosted decision trees; mostly LightGBM), though ensembles with PyTorch are common
Some winners spent hundreds of dollars on cloud compute for a single training run, others managed to win just using Colab's free tier
Winners have largely converged on a common toolkit - PyData stack for the basics, PyTorch for deep learning, LightGBM/XGBoost/CatBoost for GBDTs, Optuna for hyperparam optimisation.
Half of competition winners are first-time winners; a third have won multiple comps before; half are solo winners. Some serial winners won 2-3 competitions just in 2022!

Way more details as well as methodology here in the full report: https://mlcontests.com/state-of-competitive-machine-learning-2022?ref=mlc_reddit

Most common Python Packages used by winners

When I published something similar here last year, I got a lot of questions about tabular data, so I did a deep dive into that this year.People also asked about leaderboard shakeups and compute cost trends, so those are included too. I'd love to hear your suggestions for next year.

I managed to spend way more time on this analysis than last year thanks to the report sponsors (G-Research, a top quant firm, and Genesis Cloud, a renewable-energy cloud compute firm) - if you want to support this research, please check them out. I won't spam you with links here, there's more detail on them at the bottom of the report.

Comments

backhanderer t1_jb9ph2v wrote on March 7, 2023 at 2:11 PM

#2,178,187

Thanks for this. I knew PyTorch was dominant but didn’t realise it was this dominant for deep learning!

hcarlens OP t1_jb9q3cm wrote on March 7, 2023 at 2:16 PM

#2,178,223

Replying to backhanderer (#2,178,187)

Yeah, not just competitive ML but the research community as a whole seem to have almost entirely switched to PyTorch now (based on the Papers With Code data). I was expecting to see some people using JAX though!

jamesmundy t1_jb9rdku wrote on March 7, 2023 at 2:25 PM

#2,178,284

Really detailed analysis! For the Real Robot Challenge you mentioned (very cool!) were people able to test on the robot before the competition/during training?

WirrryWoo t1_jb9v31t wrote on March 7, 2023 at 2:52 PM

#2,178,434

Thank you for the analysis! Any insights on forecasting and time series related problems? Do most solutions use GRUs? Thanks!

hcarlens OP t1_jb9vu1f wrote on March 7, 2023 at 2:58 PM

#2,178,472

Replying to jamesmundy (#2,178,284)

Yeah that one is really cool! They had an initial competition stage, open to everyone, where evaluation was done in a simulation environment (in software) as opposed to real robots. Competitors were given data from dozens of hours of actual robot interaction which they could use to train their policies.

The teams that qualified there made it through to the real robot stage. At that point they could submit their policies for weekly evaluation on actual robots - so they could have a few practice runs on the actual robots before the final leaderboard run.

hcarlens OP t1_jb9woj6 wrote on March 7, 2023 at 3:04 PM

#2,178,508

Replying to WirrryWoo (#2,178,434)

I found that for a lot of time-series problems, people often treated them as if they were standard tabular/supervised learning problems. There's a separate page of the report which goes into these in detail: https://mlcontests.com/tabular-data?ref=mlc_reddit

For example, for the Kaggle Amex default prediction competition, the data is time-series in the sense that you're given a sequence of customer statements, and then have to predict the probability of them defaulting within a set time period after that. The winner's solution mostly seemed to flatten the features and use LightGBM, but they did use a GRU for part of their final ensemble: https://www.kaggle.com/competitions/amex-default-prediction/discussion/348111

The M6 forecasting competition finished recently, I'm looking forward to seeing what their winners did: https://m6competition.com/

cesarebo t1_jba5k13 wrote on March 7, 2023 at 4:03 PM

#2,178,909

Good job u/hcarlens and MLcontests team! Thank you for the insights!

Svaruz t1_jbadan9 wrote on March 7, 2023 at 4:54 PM

#2,179,252

That’s a lot of work 🫡. Thanks

TubasAreFun t1_jbapwmb wrote on March 7, 2023 at 6:15 PM

#2,179,873

Replying to hcarlens (#2,178,223)

JAX does offer some general matrix math that can be more useful/fast than torch alone. I often do deep learning with torch and then use JAX on the top to train statistical models (i.e. fuse features from multiple models, raw features, etc. into a single regression/inference)

Lucas_Matheus t1_jbatxn8 wrote on March 7, 2023 at 6:40 PM

#2,180,047

Amazing. This seems like a great way to learn how things are currently being done in ML

randyzmzzzz t1_jbau9l2 wrote on March 7, 2023 at 6:42 PM

#2,180,068

🫡 F

Alchera_QQ t1_jbbfukn wrote on March 7, 2023 at 8:59 PM

#2,180,958

Can somebody elaborate on the discrepancy between PyTorch and TF?

I keep hearing that Torch is preferred for research and academic purposes, but TF seems to be very close in terms of accuracy and performance.

What factor makes Torch users over an order of magnitude more popular here?

senacchrib t1_jbbsgju wrote on March 7, 2023 at 10:19 PM

#2,181,545

This is amazing, thank you! Where do L2R problems fall in your classification? Tabular?

XGDragon t1_jbc09qd wrote on March 7, 2023 at 11:13 PM

#2,181,867

Awesome roundup. I wonder, was https://grand-challenge.org/ included in your analysis?

saintshing t1_jbdbgwy wrote on March 8, 2023 at 5:32 AM

#2,183,869

Replying to hcarlens (#2,178,223)

Is pytorch also better than TF for usecases where I have to do training/inference on mobile?

Alchera_QQ t1_jbdf7dx wrote on March 8, 2023 at 6:13 AM

#2,183,985

Replying to yumiko14 (#2,182,251)

Great one, thanks ;)

NitroXSC t1_jbdp6ge wrote on March 8, 2023 at 8:22 AM

#2,184,298

Interesting meta-study with many remarkable trends.

This is seen from the competitor's side, but what correctly the best website to set up a simple prediction competition? I'm asking this since I'm planning on creating a small competition for students of one of the courses given (no large files needed).

hcarlens OP t1_jbdv11j wrote on March 8, 2023 at 9:46 AM

#2,184,441

Replying to senacchrib (#2,181,545)

Thanks! I didn't create a separate category for learning-to-rank problems because they often overlap with other domains.

For example, some of the conservation competitions (https://mlcontests.com/state-of-competitive-machine-learning-2022/#conservation-competitions) are L2R problems on image data.

Or the Amazon/AIcrowd competitions (https://mlcontests.com/state-of-competitive-machine-learning-2022/#nlp--search) which were L2R with NLP.

In reality the mapping of competition:(competition type) is almost always one:many, and I'm planning on updating the ML Contests website to reflect that!

If I'd had more time and better data I would have sliced the data in multiple different ways to also look into e.g. L2R problems specifically in more depth.

hcarlens OP t1_jbdv2vl wrote on March 8, 2023 at 9:47 AM

#2,184,442

Replying to XGDragon (#2,181,867)

Thanks! Yes, but I didn't manage to get as much data as I wanted for the competitions on there. I emailed some of the competition organisers but didn't get a response.

hcarlens OP t1_jbdvb7c wrote on March 8, 2023 at 9:50 AM

#2,184,448

Replying to NitroXSC (#2,184,298)

Thanks! It depends on exactly what you're planning, but CodaLab (https://codalab.lisn.upsaclay.fr/competitions/) or their new platform CodaBench (https://www.codabench.org/) will probably work well.

They both allow you to run competitions for free, and people do use them for teaching purposes (you'll see some examples if you browse through the list of competitions).

I'm planning on writing a shorter blog post on running competitions soon.

Alchera_QQ t1_jbe4ndt wrote on March 8, 2023 at 11:54 AM

#2,184,774

Replying to hcarlens (#2,184,443)

Also a cool read, thanks!

senacchrib t1_jbfczfo wrote on March 8, 2023 at 5:26 PM

#2,186,742

Replying to hcarlens (#2,184,441)

What you accomplished is wonderful enough. I agree wholeheartedly with your 1:n mapping

scaldingpotato t1_jbflxan wrote on March 8, 2023 at 6:22 PM

#2,187,164

For noobs like me: GBDT=gradient boost decision tree

sLqHA3RbL2MBi t1_jbgya0w wrote on March 8, 2023 at 11:33 PM

#2,189,057

This was a fascinating read, thanks a lot!

ilovekungfuu t1_jbi2xao wrote on March 9, 2023 at 4:50 AM

#2,191,038

Thank you very much u/hcarlens !
This distills so much of information .

Do you think that we might be at a plateau in terms of new methods being tried out (say, since last 24 months) ?

hcarlens OP t1_jbnreef wrote on March 10, 2023 at 10:52 AM

#2,199,562

Replying to ilovekungfuu (#2,191,038)

Hi! I'm not sure I fully understand your question, but if you're asking whether the rate of progress in competitive ML is slowing down, I think probably not. A lot of the key areas of debate (gbdt vs nn in tabular data, cnn vs transformers in vision) are seeing a lot of research still and I expect the competitive ML community to adopt new advances when they happen. Also in NLP there's a move towards more efficient models, which would also be very useful.

hcarlens OP t1_jbnrg6z wrote on March 10, 2023 at 10:53 AM

#2,199,565

Replying to scaldingpotato (#2,187,164)

Good point! Edited to clarify.

ilovekungfuu t1_jbql8h9 wrote on March 10, 2023 at 11:11 PM

#2,204,624

Replying to hcarlens (#2,199,562)

Thank you (you cleared my doubt) !

[R] Analysis of 200+ ML competitions in 2022