|
Fernandez Casani, A., Orduña, J. M., Sanchez, J., & Gonzalez de la Hoz, S. (2021). A Reliable Large Distributed Object Store Based Platform for Collecting Event Metadata. J. Grid Comput., 19(3), 39–19pp.
Abstract: The Large Hadron Collider (LHC) is about to enter its third run at unprecedented energies. The experiments at the LHC face computational challenges with enormous data volumes that need to be analysed by thousands of physics users. The ATLAS EventIndex project, currently running in production, builds a complete catalogue of particle collisions, or events, for the ATLAS experiment at the LHC. The distributed nature of the experiment data model is exploited by running jobs at over one hundred Grid data centers worldwide. Millions of files with petabytes of data are indexed, extracting a small quantity of metadata per event, that is conveyed with a data collection system in real time to a central Hadoop instance at CERN. After a successful first implementation based on a messaging system, some issues suggested performance bottlenecks for the challenging higher rates in next runs of the experiment. In this work we characterize the weaknesses of the previous messaging system, regarding complexity, scalability, performance and resource consumption. A new approach based on an object-based storage method was designed and implemented, taking into account the lessons learned and leveraging the ATLAS experience with this kind of systems. We present the experiment that we run during three months in the real production scenario worldwide, in order to evaluate the messaging and object store approaches. The results of the experiment show that the new object-based storage method can efficiently support large-scale data collection for big data environments like the next runs of the ATLAS experiment at the LHC.
|
|
|
Fernandez Casani, A., Garcia Montoro, C., Gonzalez de la Hoz, S., Salt, J., Sanchez, J., & Villaplana Perez, M. (2023). Big Data Analytics for the ATLAS EventIndex Project with Apache Spark. Comput. Math. Methods, 2023, 6900908–19pp.
Abstract: The ATLAS EventIndex was designed to provide a global event catalogue and limited event-level metadata for ATLAS experiment of the Large Hadron Collider (LHC) and their analysis groups and users during Run 2 (2015-2018) and has been running in production since. The LHC Run 3, started in 2022, has seen increased data-taking and simulation production rates, with which the current infrastructure would still cope but may be stretched to its limits by the end of Run 3. A new core storage service is being developed in HBase/Phoenix, and there is work in progress to provide at least the same functionality as the current one for increased data ingestion and search rates and with increasing volumes of stored data. In addition, new tools are being developed for solving the needed access cases within the new storage. This paper describes a new tool using Spark and implemented in Scala for accessing the big data quantities of the EventIndex project stored in HBase/Phoenix. With this tool, we can offer data discovery capabilities at different granularities, providing Spark Dataframes that can be used or refined within the same framework. Data analytic cases of the EventIndex project are implemented, like the search for duplicates of events from the same or different datasets. An algorithm and implementation for the calculation of overlap matrices of events across different datasets are presented. Our approach can be used by other higher-level tools and users, to ease access to the data in a performant and standard way using Spark abstractions. The provided tools decouple data access from the actual data schema, which makes it convenient to hide complexity and possible changes on the backed storage.
|
|
|
Felix-Beltran, O., Gonzalez-Canales, F., Hernandez-Sanchez, J., Moretti, S., Noriega-Papaqui, R., & Rosado, A. (2015). Analysis of the quark sector in the 2HDM with a four-zero Yukawa texture using the most recent data on the CKM matrix. Phys. Lett. B, 742, 347–352.
Abstract: In this Letter we analyse, in the context of the general 2-Higgs Doublet Model, the structure of the Yukawa matrices, (Y) over tilde (q)(1,2), by assuming a four-zero texture ansatz for their definition. In this framework, we obtain compact expressions for (Y) over tilde (q)(1,2), which are reduced to the Cheng and Sher ansatz with the difference that they are obtained naturally as a direct consequence of the invariants of the fermion mass matrices. Furthermore, in order to avoid large flavour violating effects coming from charged Higgs exchange, we consider the main flavour constraints on the off-diagonal terms of Yukawa texture ((chi) over tilde (q)(j))(kl) (k not equal l). We perform a chi(2)-fit based on current experimental data on the quark masses and the Cabibbo-KobayashiMaskawa mixing matrix V-CKM. Hence, we obtain the allowed ranges for the parameters (Y) over tilde (q)(1,2) at 1 sigma for several values of tan beta. The results are in complete agreement with the bounds obtained taking into account constraints on Flavour Changing Neutral Currents reported in the literature.
|
|
|
Bates, R. L. et al, Bernabeu Verdú, J., Civera, J. V., Gonzalez, F., Lacasta, C., & Sanchez, J. (2012). The ATLAS SCT grounding and shielding concept and implementation. J. Instrum., 7, P03005.
Abstract: This paper describes the design and implementation of the grounding and shielding system for the ATLAS SemiConductor Tracker (SCT). The mitigation of electromagnetic interference and noise pickup through power lines is the critical design goal as they have the potential to jeopardize the electrical performance. We accomplish this by adhering to the ATLAS grounding rules, by avoiding ground loops and isolating the different subdetectors. Noise sources are identified and design rules to protect the SCT against them are described. A rigorous implementation of the design was crucial to achieve the required performance. This paper highlights the location, connection and assembly of the different components that affect the grounding and shielding system: cables, filters, cooling pipes, shielding enclosure, power supplies and others. Special care is taken with the electrical properties of materials and joints. The monitoring of the grounding system during the installation period is also discussed. Finally, after connecting more than four thousand SCT modules to all of their services, electrical, mechanical and thermal within the wider ATLAS experimental environment, dedicated tests show that noise pickup is minimised.
|
|
|
Barberis, D. et al, Fernandez Casani, A., Garcia Montoro, C., Gonzalez de la Hoz, S., Salt, J., Sanchez, J., et al. (2023). The ATLAS EventIndex: A BigData Catalogue for All ATLAS Experiment Events. Comput. Softw. Big Sci., 7, 2–21pp.
Abstract: The ATLAS EventIndex system comprises the catalogue of all events collected, processed or generated by the ATLAS experiment at the CERN LHC accelerator, and all associated software tools to collect, store and query this information. ATLAS records several billion particle interactions every year of operation, processes them for analysis and generates even larger simulated data samples; a global catalogue is needed to keep track of the location of each event record and be able to search and retrieve specific events for in-depth investigations. Each EventIndex record includes summary information on the event itself and the pointers to the files containing the full event. Most components of the EventIndex system are implemented using BigData free and open-source software. This paper describes the architectural choices and their evolution in time, as well as the past, current and foreseen future implementations of all EventIndex components.
|
|