An overview of the top ML and data science competition platforms in 2023
7 Mar, 2023
Taking part in ML competitions is a great way to learn and improve your skills. You’ve probably heard of Kaggle — but there are dozens of sites listing ML competitions. In 2022, the total prize pool across all competition sites was over $5m!
For a more structured comparison of various platforms and their features, go to our platform comparison page.
Below we list the top 16 data science competition platforms in 2023 — including Kaggle, DrivenData, AIcrowd, Zindi, CodaLab, and others. Each platform tends to have its own niche. Some have only been around for a year.
Kaggle has the biggest community, DrivenData produce detailed write-ups of winning solutions, CodaLab is open-source and allows anyone to list competitions, and Zindi mostly works with companies and data scientists in Africa.
Kaggle was founded in 2010, and has over 10 million users. As well as listing competitions, it hosts discussion forums, notebooks, datasets, and models. Kaggle hosted 34 official competitions in 2022, as well as many “community” competitions.
Kaggle is still the biggest platform, but there are many others running interesting competitions with significant prize money. Each Kaggle competitions tends to draw over 1,000 competing teams — so competing on another platform can increase your chances of winning!
DrivenData focuses on running competitions with social impact, and has run competitions for NASA and other organisations. Competitions are always followed up by in-depth research reports describing the solutions. It was founded in 2014, and has around 100,000 users.
AIcrowd started as a research project at EPFL in 2017, and is now one of the top five competition platforms. It has hosted several official NeurIPS competitions, and has around 60,000 users.
Zindi has a very active community, and is focused on connecting organisations with data scientists in Africa. Zindi also runs in-person hackathons and community events. It was founded in 2018, and has around 60,000 users.
CodaLab is an open-source competitions platform founded
in 2013, with an instance maintained by Université Paris-Saclay. Anyone can sign up and host or take part in a competition. Free CPU resources
are available for inference, and competition organisers can supplement this with their own hardware.
CodaLab had over 120,000 users as of October 2022.
Tianchi was founded in 2014, as part of Alibaba Cloud, and is the second-biggest global platform after Kaggle (by number of competitions and total prize money). The number of users is not public.
Signate was founded in 2014 and is the largest Japanese-language ML competition platform, with over 85,000 users.
EvalAI was founded in 2017, as part of CloudCV. It has around 30,000 users.
Waymo is a self-driving car company that started as a project within Google. Waymo published the Waymo Open Dataset — intended for autonomous driving research — in 2020, and since then has been running annual competitions on its Open Dataset Challenges site using the dataset. There are usually 4 competitions every year with $22,000 in prizes each, for a total of $88k in annual prizes.
Xeek was founded in 2022 as part of Shell’s Studio X, and has around 3,000 users. Xeek hosted 4 competitions in 2022.
DS works was created by SberCloud in 2021. It is unclear how many users the platform has.
The Makridakis Open Forecasting Center (MOFC) conducts forecasting research and runs leading time-series forecasting competitions.
Microprediction runs ongoing time-series prediction challenges. It paid out around $50,000 in 2022. There are 1,000 users in the Slack group and around 500 live autonomous algorithms making predictions.
MIT’s Battlecode has run an annual programming competition most years since 2003. While this competition can be tackled without ML or data science skills, it is well-suited to reinforcement learning approaches.
Topcoder is a competitive programming site which hosts some data science competitions.
Hugging Face launched its 🤗 Competitions platform in February 2023.
ML Contests is a directory of ongoing machine learning and data science competitions, and lists competitions from all the platforms mentioned on this page. As well as listing competitions, ML Contests also publishes research on competitive machine learning.
Some other sites which host different forms of competitive ML:
In our report on the state of competitive machine learning in 2022, we review the 200+ machine learning competitions that took place in 2022 and give an overview of winning solutions.