Otten, S., Rolbiecki, K., Caron, S., Kim, J. S., Ruiz de Austri, R., & Tattersall, J. (2020). DeepXS: fast approximation of MSSM electroweak cross sections at NLO. Eur. Phys. J. C, 80(1), 12–9pp.
Abstract: We present a deep learning solution to the prediction of particle production cross sections over a complicated, high-dimensional parameter space. We demonstrate the applicability by providing state-of-the-art predictions for the production of charginos and neutralinos at the Large Hadron Collider (LHC) at the next-to-leading order in the phenomenological MSSM-19 and explicitly demonstrate the performance for pp ->(chi) over tilde (+)(1)(chi) over tilde (-)(1), (chi) over tilde (0)(2)(chi) over tilde (0)(2) and (chi) over tilde (0)(2)(chi) over tilde (+/-)(1) as a proof of concept which will be extended to all SUSY electroweak pairs. We obtain errors that are lower than the uncertainty from scale and parton distribution functions with mean absolute percentage errors of well below 0.5% allowing a safe inference at the next-to-leading order with inference times that improve the Monte Carlo integration procedures that have been available so far by a factor of O(10(7)) from O(min) to O(mu s) per evaluation.
|
Otten, S., Caron, S., de Swart, W., van Beekveld, M., Hendriks, L., van Leeuwen, C., et al. (2021). Event generation and statistical sampling for physics with deep generative models and a density information buffer. Nat. Commun., 12(1), 2985–16pp.
Abstract: Simulating nature and in particular processes in particle physics require expensive computations and sometimes would take much longer than scientists can afford. Here, we explore ways to a solution for this problem by investigating recent advances in generative modeling and present a study for the generation of events from a physical process with deep generative models. The simulation of physical processes requires not only the production of physical events, but to also ensure that these events occur with the correct frequencies. We investigate the feasibility of learning the event generation and the frequency of occurrence with several generative machine learning models to produce events like Monte Carlo generators. We study three processes: a simple two-body decay, the processes e(+)e(-)-> Z -> l(+)l(-) and pp -> tt<mml:mo><overbar></mml:mover> including the decay of the top quarks and a simulation of the detector response. By buffering density information of encoded Monte Carlo events given the encoder of a Variational Autoencoder we are able to construct a prior for the sampling of new events from the decoder that yields distributions that are in very good agreement with real Monte Carlo events and are generated several orders of magnitude faster. Applications of this work include generic density estimation and sampling, targeted event generation via a principal component analysis of encoded ground truth data, anomaly detection and more efficient importance sampling, e.g., for the phase space integration of matrix elements in quantum field theories. Here, the authors report buffered-density variational autoencoders for the generation of physical events. This method is computationally less expensive over other traditional methods and beyond accelerating the data generation process, it can help to steer the generation and to detect anomalies.
|
Caron, S., Ruiz de Austri, R., & Zhang, Z. Y. (2023). Mixture-of-Theories training: can we find new physics and anomalies better by mixing physical theories? J. High Energy Phys., 03(3), 004–37pp.
Abstract: Model-independent search strategies have been increasingly proposed in recent years because on the one hand there has been no clear signal for new physics and on the other hand there is a lack of a highly probable and parameter-free extension of the standard model. For these reasons, there is no simple search target so far. In this work, we try to take a new direction and ask the question: bearing in mind that we have a large number of new physics theories that go beyond the Standard Model and may contain a grain of truth, can we improve our search strategy for unknown signals by using them “in combination”? In particular, we show that a signal hypothesis based on a large, intermingled set of many different theoretical signal models can be a superior approach to find an unknown BSM signal. Applied to a recent data challenge, we show that “mixture-of-theories training” outperforms strategies that optimize signal regions with a single BSM model as well as most unsupervised strategies. Applications of this work include anomaly detection and the definition of signal regions in the search for signals of new physics.
|
Caron, S., Kim, J. S., Rolbiecki, K., Ruiz de Austri, R., & Stienen, B. (2017). The BSM-AI project: SUSY-AI-generalizing LHC limits on supersymmetry with machine learning. Eur. Phys. J. C, 77(4), 257–25pp.
Abstract: A key research question at the Large Hadron Collider is the test of models of new physics. Testing if a particular parameter set of such a model is excluded by LHC data is a challenge: it requires time consuming generation of scattering events, simulation of the detector response, event reconstruction, cross section calculations and analysis code to test against several hundred signal regions defined by the ATLAS and CMS experiments. In the BSM-AI project we approach this challenge with a new idea. A machine learning tool is devised to predict within a fraction of a millisecond if a model is excluded or not directly from the model parameters. A first example is SUSY-AI, trained on the phenomenological supersymmetric standard model (pMSSM). About 300,000 pMSSM model sets – each tested against 200 signal regions by ATLAS – have been used to train and validate SUSY-AI. The code is currently able to reproduce theATLAS exclusion regions in 19 dimensions with an accuracy of at least 93%. It has been validated further within the constrained MSSM and the minimal natural supersymmetric model, again showing high accuracy. SUSY-AI and its future BSM derivatives will help to solve the problem of recasting LHC results for any model of new physics. SUSY-AI can be downloaded from http://susyai.hepforge.org/. An on-line interface to the program for quick testing purposes can be found at http://www.susy-ai.org/.
|
Caron, S., Gomez-Vargas, G. A., Hendriks, L., & Ruiz de Austri, R. (2018). Analyzing gamma rays of the Galactic Center with deep learning. J. Cosmol. Astropart. Phys., 05(5), 058–24pp.
Abstract: We present the application of convolutional neural networks to a particular problem in gamma ray astronomy. Explicitly, we use this method to investigate the origin of an excess emission of GeV gamma rays in the direction of the Galactic Center, reported by several groups by analyzing Fermi-LAT data. Interpretations of this excess include gamma rays created by the annihilation of dark matter particles and gamma rays originating from a collection of unresolved point sources, such as millisecond pulsars. We train and test convolutional neural networks with simulated Fermi-LAT images based on point and diffuse emission models of the Galactic Center tuned to measured gamma ray data. Our new method allows precise measurements of the contribution and properties of an unresolved population of gamma ray point sources in the interstellar diffuse emission model. The current model predicts the fraction of unresolved point sources with an error of up to 10% and this is expected to decrease with future work.
|