Data Analysis, Statistics and Probability
See recent articles
Showing new listings for Friday, 31 January 2025
- [1] arXiv:2501.17892 (cross-list from physics.ins-det) [pdf, html, other]
-
Title: Object Detection with Deep Learning for Rare Event Search in the GADGET II TPCTyler Wheeler, S. Ravishankar, C. Wrede, A. Andalib, A. Anthony, Y. Ayyad, B. Jain, A. Jaros, R. Mahajan, L. Schaedig, A. Adams, S. Ahn, J.M. Allmond, D. Bardayan, D. Bazin, K. Bosmpotinis, T. Budner, S.R. Carmichael, S.M. Cha, A. Chen, K.A. Chipps, J.M. Christie, I. Cox, J. Dopfer, M. Friedman, J. Garcia-Duarte, E. Good, T.J. Gray, A. Green, R. Grzywacz, K. Hahn, R. Jain, E. Jensen, T. King, S. Liddick, B. Longfellow, R. Lubna, C. Marshall, Y. Mishnayot, A.J. Mitchell, F. Montes, T.H. Ogunbeku, J. Owens-Fryar, S.D. Pain, J. Pereira, E. Pollacco, A.M. Rogers, M.Z. Serikow, K. Setoodehnia, L.J. Sun, J. Surbrook, A. Tsantiri, L.E. WeghornSubjects: Instrumentation and Detectors (physics.ins-det); Nuclear Experiment (nucl-ex); Data Analysis, Statistics and Probability (physics.data-an)
In the pursuit of identifying rare two-particle events within the GADGET II Time Projection Chamber (TPC), this paper presents a comprehensive approach for leveraging Convolutional Neural Networks (CNNs) and various data processing methods. To address the inherent complexities of 3D TPC track reconstructions, the data is expressed in 2D projections and 1D quantities. This approach capitalizes on the diverse data modalities of the TPC, allowing for the efficient representation of the distinct features of the 3D events, with no loss in topology uniqueness. Additionally, it leverages the computational efficiency of 2D CNNs and benefits from the extensive availability of pre-trained models. Given the scarcity of real training data for the rare events of interest, simulated events are used to train the models to detect real events. To account for potential distribution shifts when predominantly depending on simulations, significant perturbations are embedded within the simulations. This produces a broad parameter space that works to account for potential physics parameter and detector response variations and uncertainties. These parameter-varied simulations are used to train sensitive 2D CNN object detectors. When combined with 1D histogram peak detection algorithms, this multi-modal detection framework is highly adept at identifying rare, two-particle events in data taken during experiment 21072 at the Facility for Rare Isotope Beams (FRIB), demonstrating a 100% recall for events of interest. We present the methods and outcomes of our investigation and discuss the potential future applications of these techniques.
- [2] arXiv:2501.17936 (cross-list from astro-ph.CO) [pdf, html, other]
-
Title: Aspects of Spatially-Correlated Random Fields: Extreme-Value Statistics and Clustering PropertiesComments: 7 pages, 6 figuresSubjects: Cosmology and Nongalactic Astrophysics (astro-ph.CO); General Relativity and Quantum Cosmology (gr-qc); High Energy Physics - Phenomenology (hep-ph); Data Analysis, Statistics and Probability (physics.data-an)
Rare events of large-scale spatially-correlated exponential random fields are studied. The influence of spatial correlations on clustering and non-sphericity is investigated. The size of the performed simulations permits to study beyond-$7.5$-sigma events ($1$ in $10^{13}$). As an application, this allows to resolve individual Hubble patches which fulfill the condition for primordial black hole formation. It is argued that their mass spectrum is drastically altered due to co-collapse of clustered overdensities as well as the mutual threshold-lowering through the latter. Furthermore, the corresponding non-sphericities imply possibly large changes in the initial black hole spin distribution.
- [3] arXiv:2501.18423 (cross-list from gr-qc) [pdf, html, other]
-
Title: DeepExtractor: Time-domain reconstruction of signals and glitches in gravitational wave data with deep learningTom Dooney, Harsh Narola, Stefano Bromuri, R. Lyana Curier, Chris Van Den Broeck, Sarah Caudill, Daniel Stanley TanComments: 22 pages, 16 figures, 4 tablesSubjects: General Relativity and Quantum Cosmology (gr-qc); Instrumentation and Methods for Astrophysics (astro-ph.IM); Machine Learning (cs.LG); Data Analysis, Statistics and Probability (physics.data-an); Instrumentation and Detectors (physics.ins-det)
Gravitational wave (GW) interferometers, detect faint signals from distant astrophysical events, such as binary black hole mergers. However, their high sensitivity also makes them susceptible to background noise, which can obscure these signals. This noise often includes transient artifacts called "glitches" that can mimic astrophysical signals or mask their characteristics. Fast and accurate reconstruction of both signals and glitches is crucial for reliable scientific inference. In this study, we present DeepExtractor, a deep learning framework designed to reconstruct signals and glitches with power exceeding interferometer noise, regardless of their source. We design DeepExtractor to model the inherent noise distribution of GW interferometers, following conventional assumptions that the noise is Gaussian and stationary over short time scales. It operates by predicting and subtracting the noise component of the data, retaining only the clean reconstruction. Our approach achieves superior generalization capabilities for arbitrary signals and glitches compared to methods that directly map inputs to the clean training waveforms. We validate DeepExtractor's effectiveness through three experiments: (1) reconstructing simulated glitches injected into simulated detector noise, (2) comparing performance with the state-of-the-art BayesWave algorithm, and (3) analyzing real data from the Gravity Spy dataset to demonstrate effective glitch subtraction from LIGO strain data. DeepExtractor achieves a median mismatch of only 0.9% for simulated glitches, outperforming several deep learning baselines. Additionally, DeepExtractor surpasses BayesWave in glitch recovery, offering a dramatic computational speedup by reconstructing one glitch sample in approx. 0.1 seconds on a CPU, compared to BayesWave's processing time of approx. one hour per glitch.
Cross submissions (showing 3 of 3 entries)
- [4] arXiv:2406.01602 (replaced) [pdf, html, other]
-
Title: Effectiveness of denoising diffusion probabilistic models for fast and high-fidelity whole-event simulation in high-energy heavy-ion experimentsYeonju Go, Dmitrii Torbunov, Timothy Rinn, Yi Huang, Haiwang Yu, Brett Viren, Meifeng Lin, Yihui Ren, Jin HuangComments: 11 pages, 7 figuresJournal-ref: Phys.Rev.C 110 (2024) 3, 034912Subjects: Data Analysis, Statistics and Probability (physics.data-an); High Energy Physics - Experiment (hep-ex); Nuclear Experiment (nucl-ex)
Artificial intelligence (AI) generative models, such as generative adversarial networks (GANs), variational auto-encoders, and normalizing flows, have been widely used and studied as efficient alternatives for traditional scientific simulations. However, they have several drawbacks, including training instability and inability to cover the entire data distribution, especially for regions where data are rare. This is particularly challenging for whole-event, full-detector simulations in high-energy heavy-ion experiments, such as sPHENIX at the Relativistic Heavy Ion Collider and Large Hadron Collider experiments, where thousands of particles are produced per event and interact with the detector. This work investigates the effectiveness of Denoising Diffusion Probabilistic Models (DDPMs) as an AI-based generative surrogate model for the sPHENIX experiment that includes the heavy-ion event generation and response of the entire calorimeter stack. DDPM performance in sPHENIX simulation data is compared with a popular rival, GANs. Results show that both DDPMs and GANs can reproduce the data distribution where the examples are abundant (low-to-medium calorimeter energies). Nonetheless, DDPMs significantly outperform GANs, especially in high-energy regions where data are rare. Additionally, DDPMs exhibit superior stability compared to GANs. The results are consistent between both central and peripheral centrality heavy-ion collision events. Moreover, DDPMs offer a substantial speedup of approximately a factor of 100 compared to the traditional Geant4 simulation method.
- [5] arXiv:2412.13741 (replaced) [pdf, html, other]
-
Title: Data-driven assessment of optimal spatiotemporal resolutions for information extraction in noisy time series dataSubjects: Data Analysis, Statistics and Probability (physics.data-an)
In general, comprehension of any type of complex system depends on the resolution used to examine the phenomena occurring within it. However, identifying a priori, for example, the best time frequencies/scales to study a certain system over-time, or the spatial distances at which correlations, symmetries, and fluctuations are, most often non-trivial. Here we describe an unsupervised approach that, starting solely from the data of a system, allows learning the characteristic length scales of the dominant key events/processes and the optimal spatiotemporal resolutions to characterize them. We tested this approach on time series data obtained from simulation or experimental trajectories of various example many-body complex systems ranging from the atomic to the macroscopic scale and having diverse internal dynamic complexities. Our method automatically analyzes the system data by analyzing correlations at all relevant inter-particle distances and at all possible inter-frame intervals in which their time series can be subdivided, namely, at all space and time this http URL optimal spatiotemporal resolution for studying a certain system thus maximizes information extraction and classification from the system's data, which we prove to be related to the characteristic spatiotemporal length scales of the local/collective physical events dominating it. This approach is broadly applicable and can be used to optimize the study of different types of data (static distributions, time series, or signals). The concept of 'optimal resolution' has a general character and provides a robust basis for characterizing any type of system based on its data, as well as to guide data analysis in general.
- [6] arXiv:2411.02740 (replaced) [pdf, html, other]
-
Title: An information-matching approach to optimal experimental design and active learningYonatan Kurniawan (1), Tracianne B. Neilsen (1), Benjamin L. Francis (2), Alex M. Stankovic (3), Mingjian Wen (4), Ilia Nikiforov (5), Ellad B. Tadmor (5), Vasily V. Bulatov (6), Vincenzo Lordi (6), Mark K. Transtrum (1, 2, and 3) ((1) Brigham Young University, Provo, UT, USA, (2) Achilles Heel Technologies, Orem, UT, USA, (3) SLAC National Accelerator Laboratory, Menlo Park, CA, USA, (4) University of Houston, Houston, TX, USA, (5) University of Minnesota, Minneapolis, MN, USA, (6) Lawrence Livermore National Laboratory)Subjects: Machine Learning (cs.LG); Materials Science (cond-mat.mtrl-sci); Applied Physics (physics.app-ph); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
The efficacy of mathematical models heavily depends on the quality of the training data, yet collecting sufficient data is often expensive and challenging. Many modeling applications require inferring parameters only as a means to predict other quantities of interest (QoI). Because models often contain many unidentifiable (sloppy) parameters, QoIs often depend on a relatively small number of parameter combinations. Therefore, we introduce an information-matching criterion based on the Fisher Information Matrix to select the most informative training data from a candidate pool. This method ensures that the selected data contain sufficient information to learn only those parameters that are needed to constrain downstream QoIs. It is formulated as a convex optimization problem, making it scalable to large models and datasets. We demonstrate the effectiveness of this approach across various modeling problems in diverse scientific fields, including power systems and underwater acoustics. Finally, we use information-matching as a query function within an Active Learning loop for material science applications. In all these applications, we find that a relatively small set of optimal training data can provide the necessary information for achieving precise predictions. These results are encouraging for diverse future applications, particularly active learning in large machine learning models.
- [7] arXiv:2412.00405 (replaced) [pdf, html, other]
-
Title: Stochastic Dynamics and Probability Analysis for a Generalized Epidemic Model with Environmental NoiseSubjects: Populations and Evolution (q-bio.PE); Dynamical Systems (math.DS); Data Analysis, Statistics and Probability (physics.data-an); Neurons and Cognition (q-bio.NC)
In this paper we consider a stochastic SEIQR (susceptible-exposed-infected-quarantined-recovered) epidemic model with a generalized incidence function. Using the Lyapunov method, we establish the existence and uniqueness of a global positive solution to the model, ensuring that it remains well-defined over time. Through the application of Young's inequality and Chebyshev's inequality, we demonstrate the concepts of stochastic ultimate boundedness and stochastic permanence, providing insights into the long-term behavior of the epidemic dynamics under random perturbations. Furthermore, we derive conditions for stochastic extinction, which describe scenarios where the epidemic may eventually die out, and V-geometric ergodicity, which indicates the rate at which the system's state converges to its equilibrium. Finally, we perform numerical simulations to verify our theoretical results and assess the model's behavior under different parameters.
- [8] arXiv:2412.12312 (replaced) [pdf, html, other]
-
Title: Applications of machine learning in ion beam analysis of materialsComments: 9 pages, 3 figuresSubjects: Materials Science (cond-mat.mtrl-sci); Data Analysis, Statistics and Probability (physics.data-an)
Ion Beam Analysis (IBA) is an established tool for material characterization, providing precise information on elemental composition, depth profiles, and structural information in the region near the surface of materials. However, traditional data processing methods can be slow and computationally intensive, limiting the efficiency and speed of the analysis. This article explores the current landscape of applying Machine Learning Algorithms (MLA) in the field of IBA, demonstrating the immense potential to optimize and accelerate processes. We present how ML has been employed to extract valuable insights from large datasets, automate repetitive tasks, and enhance the interpretability of results, with practical examples of applications in various IBA techniques, such as RBS, PIXE, and others. Finally, perspectives on using MLA to approach open problems in IBA are also discussed.