van Beekveld, M., Caron, S., Hendriks, L., Jackson, P., Leinweber, A., Otten, S., et al. (2021). Combining outlier analysis algorithms to identify new physics at the LHC. J. High Energy Phys., 09(9), 024–33pp.
Abstract: The lack of evidence for new physics at the Large Hadron Collider so far has prompted the development of model-independent search techniques. In this study, we compare the anomaly scores of a variety of anomaly detection techniques: an isolation forest, a Gaussian mixture model, a static autoencoder, and a beta-variational autoencoder (VAE), where we define the reconstruction loss of the latter as a weighted combination of regression and classification terms. We apply these algorithms to the 4-vectors of simulated LHC data, but also investigate the performance when the non-VAE algorithms are applied to the latent space variables created by the VAE. In addition, we assess the performance when the anomaly scores of these algorithms are combined in various ways. Using supersymmetric benchmark points, we find that the logical AND combination of the anomaly scores yielded from algorithms trained in the latent space of the VAE is the most effective discriminator of all methods tested.
|
Caron, S., Ruiz de Austri, R., & Zhang, Z. Y. (2023). Mixture-of-Theories training: can we find new physics and anomalies better by mixing physical theories? J. High Energy Phys., 03(3), 004–37pp.
Abstract: Model-independent search strategies have been increasingly proposed in recent years because on the one hand there has been no clear signal for new physics and on the other hand there is a lack of a highly probable and parameter-free extension of the standard model. For these reasons, there is no simple search target so far. In this work, we try to take a new direction and ask the question: bearing in mind that we have a large number of new physics theories that go beyond the Standard Model and may contain a grain of truth, can we improve our search strategy for unknown signals by using them “in combination”? In particular, we show that a signal hypothesis based on a large, intermingled set of many different theoretical signal models can be a superior approach to find an unknown BSM signal. Applied to a recent data challenge, we show that “mixture-of-theories training” outperforms strategies that optimize signal regions with a single BSM model as well as most unsupervised strategies. Applications of this work include anomaly detection and the definition of signal regions in the search for signals of new physics.
|