Abstract

We describe a technique to learn the underlying structure of collider events directly from the data, without having a particular theoretical model in mind. It allows to infer aspects of the theoretical model that may have given rise to this structure, and can be used to cluster or classify the events for analysis purposes. The unsupervised machine-learning technique is based on the probabilistic (Bayesian) generative model of Latent Dirichlet Allocation. We pair the model with an approximate inference algorithm called Variational Inference, which we then use to extract the latent probability distributions describing the learned underlying structure of collider events. We provide a detailed systematic study of the technique using two example scenarios to learn the latent structure of di-jet event samples made up of QCD background events and either tt¯ or hypothetical W′ → (ϕWW)W signal events.

Details

Title
Learning the latent structure of collider events
Author
Dillon, B M 1 ; Faroughy, D A 2 ; Kamenik, J F 3 ; Szewc, M 4 

 Jožef Stefan Institute, Ljubljana, Slovenia (GRID:grid.11375.31) (ISNI:0000 0001 0706 0012) 
 Universität Zürich, Physik-Institut, Zürich, Switzerland (GRID:grid.7400.3) (ISNI:0000 0004 1937 0650) 
 Jožef Stefan Institute, Ljubljana, Slovenia (GRID:grid.11375.31) (ISNI:0000 0001 0706 0012); University of Ljubljana, Faculty of Mathematics and Physics, Ljubljana, Slovenia (GRID:grid.8954.0) (ISNI:0000 0001 0721 6013) 
 International Center for Advanced Studies (ICAS) and CONICET, UNSAM, San Martín, Argentina (GRID:grid.108365.9) (ISNI:0000 0001 2105 0048) 
Publication year
2020
Publication date
Oct 2020
Publisher
Springer Nature B.V.
e-ISSN
10298479
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2473427114
Copyright
© The Author(s) 2020. This work is published under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.