|
Khosa, C. K., Sanz, V., & Soughton, M. (2022). A simple guide from machine learning outputs to statistical criteria in particle physics. SciPost Phys. Core, 5(4), 050–31pp.
Abstract: In this paper we propose ways to incorporate Machine Learning training outputs into a study of statistical significance. We describe these methods in supervised classification tasks using a CNN and a DNN output, and unsupervised learning based on a VAE. As use cases, we consider two physical situations where Machine Learning are often used: high-pT hadronic activity, and boosted Higgs in association with a massive vector boson.
|
|
|
Garcia Navarro, J. E., Fernandez-Prieto, L. M., Villaseñor, A., Sanz, V., Ammirati, J. B., Diaz Suarez, E. A., et al. (2022). Performance of Deep Learning Pickers in Routine Network Processing Applications. Seismol. Res. Lett., 93, 2529–2542.
Abstract: Picking arrival times of P and S phases is a fundamental and time‐consuming task for the routine processing of seismic data acquired by permanent and temporary networks. A large number of automatic pickers have been developed, but to perform well they often require the tuning of multiple parameters to adapt them to each dataset. Despite the great advance in techniques, some problems remain, such as the difficulty to accurately pick S waves and earthquake recordings with a low signal‐to‐noise ratio. Recently, phase pickers based on deep learning (DL) have shown great potential for event identification and arrival‐time picking. However, the general adoption of these methods for the routine processing of monitoring networks has been held back by factors such as the availability of well‐documented software, computational resources, and a gap in knowledge of these methods. In this study, we evaluate recent available DL pickers for earthquake data, comparing the performance of several neural network architectures. We test the selected pickers using three datasets with different characteristics. We found that the analyzed DL pickers (generalized phase detection, PhaseNet, and EQTransformer) perform well in the three tested cases. They are very efficient at ignoring large‐amplitude transient noise and at picking S waves, a task that is often difficult even for experienced analysts. Nevertheless, the performance of the analyzed DL pickers varies widely in terms of sensitivity and false discovery rate, with some pickers missing a significant percentage of true picks and others producing a large number of false positives. There are also variations in run time between DL pickers, with some of them requiring significant resources to process large datasets. In spite of these drawbacks, we show that DL pickers can be used efficiently to process large seismic datasets and obtain results comparable or better than current standard procedures.
|
|
|
Hirn, J., Garcia, J. E., Montesinos-Navarro, A., Sanchez-Martin, R., Sanz, V., & Verdu, M. (2022). A deep Generative Artificial Intelligence system to predict species coexistence patterns. Methods Ecol. Evol., 13, 1052–1061.
Abstract: Predicting coexistence patterns is a current challenge to understand diversity maintenance, especially in rich communities where these patterns' complexity is magnified through indirect interactions that prevent their approximation with classical experimental approaches. We explore cutting-edge Machine Learning techniques called Generative Artificial Intelligence (GenAI) to predict species coexistence patterns in vegetation patches, training generative adversarial networks (GAN) and variational AutoEncoders (VAE) that are then used to unravel some of the mechanisms behind community assemblage. The GAN accurately reproduces real patches' species composition and plant species' affinity to different soil types, and the VAE also reaches a high level of accuracy, above 99%. Using the artificially generated patches, we found that high-order interactions tend to suppress the positive effects of low-order interactions. Finally, by reconstructing successional trajectories, we could identify the pioneer species with larger potential to generate a high diversity of distinct patches in terms of species composition. Understanding the complexity of species coexistence patterns in diverse ecological communities requires new approaches beyond heuristic rules. Generative Artificial Intelligence can be a powerful tool to this end as it allows to overcome the inherent dimensionality of this challenge.
|
|
|
Cepedello, R., Esser, F., Hirsch, M., & Sanz, V. (2024). Fermionic UV models for neutral triple gauge boson vertices. J. High Energy Phys., 07(7), 275–28pp.
Abstract: Searches for anomalous neutral triple gauge boson couplings (NTGCs) provide important tests for the gauge structure of the standard model. In SMEFT (“standard model effective field theory”) NTGCs appear only at the level of dimension-8 operators. While the phenomenology of these operators has been discussed extensively in the literature, renormalizable UV models that can generate these operators are scarce. In this work, we study a variety of extensions of the SM with heavy fermions and calculate their matching to d = 8 NTGC operators. We point out that the complete matching of UV models requires four different CP-conserving d = 8 operators and that the single CPC d = 8 operator, most commonly used by the experimental collaborations, does not describe all possible NTGC form factors. Despite stringent experimental constraints on NTGCs, limits on the scale of UV models are relatively weak, because their contributions are doubly suppressed (being d = 8 and 1-loop). We suggest a series of benchmark UV scenarios suitable for interpreting searches for NTGCs in the upcoming LHC runs, obtain their current limits and provide estimates for the expected sensitivity of the high-luminosity LHC.
|
|
|
Conde, D., Castillo, F. L., Escobar, C., García, C., Garcia Navarro, J. E., Sanz, V., et al. (2023). Forecasting Geomagnetic Storm Disturbances and Their Uncertainties Using Deep Learning. Space Weather, 21(11), e2023SW003474–27pp.
Abstract: Severe space weather produced by disturbed conditions on the Sun results in harmful effects both for humans in space and in high-latitude flights, and for technological systems such as spacecraft or communications. Also, geomagnetically induced currents (GICs) flowing on long ground-based conductors, such as power networks, potentially threaten critical infrastructures on Earth. The first step in developing an alarm system against GICs is to forecast them. This is a challenging task given the highly non-linear dependencies of the response of the magnetosphere to these perturbations. In the last few years, modern machine-learning models have shown to be very good at predicting magnetic activity indices. However, such complex models are on the one hand difficult to tune, and on the other hand they are known to bring along potentially large prediction uncertainties which are generally difficult to estimate. In this work we aim at predicting the SYM-H index characterizing geomagnetic storms multiple-hour ahead, using public interplanetary magnetic field (IMF) data from the Sun-Earth L1 Lagrange point and SYM-H data. We implement a type of machine-learning model called long short-term memory (LSTM) network. Our scope is to estimate the prediction uncertainties coming from a deep-learning model in the context of forecasting the SYM-H index. These uncertainties will be essential to set reliable alarm thresholds. The resulting uncertainties turn out to be sizable at the critical stages of the geomagnetic storms. Our methodology includes as well an efficient optimization of important hyper-parameters of the LSTM network and robustness tests.
|
|