Skip to main content
Open Access Publications from the University of California

UC San Diego

UC San Diego Previously Published Works bannerUC San Diego

The LHC Olympics 2020 a community challenge for anomaly detection in high energy physics.

  • Author(s): Kasieczka, Gregor;
  • Nachman, Benjamin;
  • Shih, David;
  • Amram, Oz;
  • Andreassen, Anders;
  • Benkendorfer, Kees;
  • Bortolato, Blaz;
  • Brooijmans, Gustaaf;
  • Canelli, Florencia;
  • Collins, Jack H;
  • Dai, Biwei;
  • De Freitas, Felipe F;
  • Dillon, Barry M;
  • Dinu, Ioan-Mihail;
  • Dong, Zhongtian;
  • Donini, Julien;
  • Duarte, Javier;
  • Faroughy, DA;
  • Gonski, Julia;
  • Harris, Philip;
  • Kahn, Alan;
  • Kamenik, Jernej F;
  • Khosa, Charanjit K;
  • Komiske, Patrick;
  • Le Pottier, Luc;
  • Martín-Ramiro, Pablo;
  • Matevc, Andrej;
  • Metodiev, Eric;
  • Mikuni, Vinicius;
  • Murphy, Christopher W;
  • Ochoa, Inês;
  • Park, Sang Eon;
  • Pierini, Maurizio;
  • Rankin, Dylan;
  • Sanz, Veronica;
  • Sarda, Nilai;
  • Seljak, Urŏ;
  • Smolkovic, Aleks;
  • Stein, George;
  • Suarez, Cristina Mantilla;
  • Szewc, Manuel;
  • Thaler, Jesse;
  • Tsan, Steven;
  • Udrescu, Silviu-Marian;
  • Vaslin, Louis;
  • Vlimant, Jean-Roch;
  • Williams, Daniel;
  • Yunus, Mikaeel
  • et al.

A new paradigm for data-driven, model-agnostic new physics searches at colliders is emerging, and aims to leverage recent breakthroughs in anomaly detection and machine learning. In order to develop and benchmark new anomaly detection methods within this framework, it is essential to have standard datasets. To this end, we have created the LHC Olympics 2020, a community challenge accompanied by a set of simulated collider events. Participants in these Olympics have developed their methods using an R&D dataset and then tested them on black boxes: datasets with an unknown anomaly (or not). Methods made use of modern machine learning tools and were based on unsupervised learning (autoencoders, generative adversarial networks, normalizing flows), weakly supervised learning, and semi-supervised learning. This paper will review the LHC Olympics 2020 challenge, including an overview of the competition, a description of methods deployed in the competition, lessons learned from the experience, and implications for data analyses with future datasets as well as future colliders.

Many UC-authored scholarly publications are freely available on this site because of the UC's open access policies. Let us know how this access is important for you.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View