Thanks to all Participants!

  • DATE Jan 14 - Feb 10, 2018 23:00 UTC

Challenge outline

Bringing the world of Data Competitions to the grand stage for people in all walks of life, be they PhD holders, company teams, students or new data scientists, the event is open to all teams and individuals alike.

The event aims to bridge the gap between the all-increasing complexity of Machine Learning models and performance bottlenecks of the industry. The participants will strive not only to maximize the quality of their predictions, but also to devise resource-efficient algorithms.

This will be a team machine learning competition, divided into two stages.

The first stage will be online, open to all participants.
The second stage will be the offline on-site finals, in which the top 30 performing teams from the online round will compete at the Yandex office in Moscow.

About Stage I

There will be two separate tracks during the online stage. From the machine learning perspective, the tracks will be similar, yet the restrictions put on the solutions are different for each track.

The first track will be a traditional data science competition. Having a labeled training data set, participants will be asked to make a prediction for the test data and submit their predictions to the leaderboard. In this track, participants can produce arbitrarily complex models. If you like to use 4-level stacking or deep neural networks, this is the right track for you – you will only need to submit test predictions. However, those who qualify for the finals will be asked to submit the full code of the solution for validation by the judges.

About Stage II

The second competition track, your task will be to solve the same problem as was in track one, but with tight restrictions on the time and on the memory used during both learning and inference. You will need to upload the end-to-end code for your solution: both learning and inference. The evaluation server will run training and testing for your model and report the result. Both learning and evaluation must fit into time and memory constraints. If you like the most efficient solutions, this is the right track for you.

We hope that the two tracks will make the Olympiad fascinating for both machine learning competition experts and competitive programming masters, Kaggle winners and ACM champions, as well as everyone eager to solve real world problems with Data.
Moreover, we encourage people with different backgrounds, ML and ACM, to team up and push Data Analysis to new frontiers.

The first stage of our IDAO has already begun with over 500 teams from 29 different countries. Although, the registration is open throughout the online competition - until February 11. For strong team two weeks is enough to cope with the online task.


  • Each team consists of 1-3 members.
  • Each team participant must register
  • Only participants who have reached the age of 18 before the start of the on-site finals can participate
  • Each team can submit only one solution.


At the finals, participants will need to use their own computer. Use of any legal software is allowed.


Three prizes will be awarded in the final round: one for the winning team, and two runners up.

Where and when

Jan 15 - Feb 11, 2018