Stoppa, F., Bhattacharyya, S., Ruiz de Austri, R., Vreeswijk, P., Caron, S., Zaharijas, G., et al. (2023). AutoSourceID-Classifier Star-galaxy classification using a convolutional neural network with spatial information. Astron. Astrophys., 680, A109–16pp.
Abstract: Aims. Traditional star-galaxy classification techniques often rely on feature estimation from catalogs, a process susceptible to introducing inaccuracies, thereby potentially jeopardizing the classification's reliability. Certain galaxies, especially those not manifesting as extended sources, can be misclassified when their shape parameters and flux solely drive the inference. We aim to create a robust and accurate classification network for identifying stars and galaxies directly from astronomical images.Methods. The AutoSourceID-Classifier (ASID-C) algorithm developed for this work uses 32x32 pixel single filter band source cutouts generated by the previously developed AutoSourceID-Light (ASID-L) code. By leveraging convolutional neural networks (CNN) and additional information about the source position within the full-field image, ASID-C aims to accurately classify all stars and galaxies within a survey. Subsequently, we employed a modified Platt scaling calibration for the output of the CNN, ensuring that the derived probabilities were effectively calibrated, delivering precise and reliable results.Results. We show that ASID-C, trained on MeerLICHT telescope images and using the Dark Energy Camera Legacy Survey (DECaLS) morphological classification, is a robust classifier and outperforms similar codes such as SourceExtractor. To facilitate a rigorous comparison, we also trained an eXtreme Gradient Boosting (XGBoost) model on tabular features extracted by SourceExtractor. While this XGBoost model approaches ASID-C in performance metrics, it does not offer the computational efficiency and reduced error propagation inherent in ASID-C's direct image-based classification approach. ASID-C excels in low signal-to-noise ratio and crowded scenarios, potentially aiding in transient host identification and advancing deep-sky astronomy.
|
Beenakker, W., Caron, S., Kip, J., Ruiz de Austri, R., & Zhang, Z. (2023). New energy spectra in neutrino and photon detectors to reveal hidden dark matter signals. J. High Energy Phys., 11(11), 028–13pp.
Abstract: Neutral particles capable of travelling cosmic distances from a source to detectors on Earth are limited to photons and neutrinos. Examination of the Dark Matter annihilation/decay spectra for these particles reveals the presence of continuum spectra (e.g. due to fragmentation and W or Z decay) and peaks (due to direct annihilations/decays). However, when one explores extensions of the Standard Model (BSM), unexplored spectra emerge that differ significantly from those of the Standard Model (SM) for both neutrinos and photons. In this paper, we argue for the inclusion of important spectra that include peaks as well as previously largely unexplored entities such as boxes and combinations of box, peak and continuum decay spectra.
|
Caron, S., Dobreva, N., Ferrer Sanchez, A., Martin-Guerrero, J. D., Odyurt, U., Ruiz de Austri, R., et al. (2025). Trackformers: in search of transformer-based particle tracking for the high-luminosity LHC era. Eur. Phys. J. C, 85(4), 460–20pp.
Abstract: High-Energy Physics experiments are facing a multi-fold data increase with every new iteration. This is certainly the case for the upcoming High-Luminosity LHC upgrade. Such increased data processing requirements forces revisions to almost every step of the data processing pipeline. One such step in need of an overhaul is the task of particle track reconstruction, a.k.a., tracking. A Machine Learning-assisted solution is expected to provide significant improvements, since the most time-consuming step in tracking is the assignment of hits to particles or track candidates. This is the topic of this paper. We take inspiration from large language models. As such, we consider two approaches: the prediction of the next word in a sentence (next hit point in a track), as well as the one-shot prediction of all hits within an event. In an extensive design effort, we have experimented with three models based on the Transformer architecture and one model based on the U-Net architecture, performing track association predictions for collision event hit points. In our evaluation, we consider a spectrum of simple to complex representations of the problem, eliminating designs with lower metrics early on. We report extensive results, covering both prediction accuracy (score) and computational performance. We have made use of the REDVID simulation framework, as well as reductions applied to the TrackML data set, to compose five data sets from simple to complex, for our experiments. The results highlight distinct advantages among different designs in terms of prediction accuracy and computational performance, demonstrating the efficiency of our methodology. Most importantly, the results show the viability of a one-shot encoder-classifier based Transformer solution as a practical approach for the task of tracking.
|
Caron, S., Garcia Navarro, J. E., Moreno Llacer, M., Moskvitina, P., Rovers, M., Rubio Jimenez, A., et al. (2025). Universal anomaly detection at the LHC: transforming optimal classifiers and the DDD method. Eur. Phys. J. C, 85(4), 415–17pp.
Abstract: In this work, we present a novel approach to transform supervised classifiers into effective unsupervised anomaly detectors. The method we have developed, termed Discriminatory Detection of Distortions (DDD), enhances anomaly detection by training a discriminator model on both original and artificially modified datasets. We conducted a comprehensive evaluation of our models on the Dark Machines Anomaly Score Challenge channels and a search for 4-top quark events, demonstrating the effectiveness of our approach across various final states and beyond the Standard Model scenarios. We compare the performance of the DDD method with the Deep Robust One-Class Classification method (DROCC), which incorporates signals in the training process, and the Deep Support Vector Data Description (DeepSVDD) method, a well-established and well-performing method for anomaly detection. Results show that the effectiveness of each model varies by signal and channel, with DDD proving to be a very effective anomaly detector. We recommend the combined use of DeepSVDD and DDD for purely unsupervised applications, with the addition of flow models for improved performance when resources allow. Findings suggest that network architectures that excel in supervised contexts, such as the particle transformer with standard model interactions, also perform well as unsupervised anomaly detectors. We also show that with these methods, it is likely possible to recognize 4-top quark production as an anomaly without prior knowledge of the process. We argue that the Large Hadron Collider community can transform supervised classifiers into anomaly detectors to uncover potential new physical phenomena in each search.
|