Data Analysis, Statistics and Probability
See recent articles
Showing new listings for Wednesday, 29 January 2025
- [1] arXiv:2501.16814 [pdf, other]
-
Title: Dynamic Metadata Schemes in the Neutron and Photon Science Communities: A Case Study of X-Ray Photon Correlation SpectroscopyJournal-ref: Engineering and Technology International Journal of Computer and Information Engineering, Vol:18, No:5, 2024Subjects: Data Analysis, Statistics and Probability (physics.data-an)
Metadata is one of the most important aspects for advancing data management practices within all research communities. Definitions and schemes of metadata are inter alia of particular significance in the domain of neutron and photon scattering experiments covering a broad area of different scientific disciplines. The demand of describing continuously evolving highly nonstandardized experiments, including the resulting processed and published data, constitutes a considerable challenge for a static definition of metadata. Here, we present the concept of dynamic metadata for the neutron and photon scientific community, which enriches a static set of defined basic metadata. We explore the idea of dynamic metadata with the help of the use case of X-ray Photon Correlation Spectroscopy (XPCS), which is a synchrotron-based scattering technique that allows the investigation of nanoscale dynamic processes. It serves here as a demonstrator of how dynamic metadata can improve data acquisition, sharing, and analysis workflows. Our approach enables researchers to tailor metadata definitions dynamically and adapt them to the evolving demands of describing data and results from a diverse set of experiments. We demonstrate that dynamic metadata standards yield advantages that enhance data reproducibility, interoperability, and the dissemination of knowledge.
New submissions (showing 1 of 1 entries)
- [2] arXiv:2501.17061 (cross-list from quant-ph) [pdf, html, other]
-
Title: Two measurement bases are asymptotically informationally complete for any pure state tomographyTianfeng Feng, Tianqi Xiao, Yu Wang, Shengshi Pang, Farhan Hanif, Xiaoqi Zhou, Qi Zhao, M. S. Kim, Jinzhao SunComments: 28 pages, 8 figures, 1 tableSubjects: Quantum Physics (quant-ph); Mathematical Physics (math-ph); Data Analysis, Statistics and Probability (physics.data-an)
One of the fundamental questions in quantum information theory is to find how many measurement bases are required to obtain the full information of a quantum state. While a minimum of four measurement bases is typically required to determine an arbitrary pure state, we prove that for any states generated by finite-depth Clifford + T circuits, just two measurement bases are sufficient. More generally, we prove that two measurement bases are informationally complete for determining algebraic pure states whose state-vector elements represented in the computational basis are algebraic numbers. Since any pure state can be asymptotically approximated by a sequence of algebraic states with arbitrarily high precision, our scheme is referred to as asymptotically informationally complete for pure state tomography. Furthermore, existing works mostly construct the measurements using entangled bases. So far, the best result requires $O(n)$ local measurement bases for $n$-qubit pure-state tomography. Here, we show that two measurement bases that involve polynomial elementary gates are sufficient for uniquely determining sparse algebraic states. Moreover, we prove that two local measurement bases, involving single-qubit local operations only, are informationally complete for certain algebraic states, such as GHZ-like and W-like states. Besides, our two-measurement-bases scheme remains valid for mixed states with certain types of noises. We numerically test the uniqueness of the reconstructed states under two (local) measurement bases with and without measurement and depolarising types of noise. Our scheme provides a theoretical guarantee for pure state tomography in the fault-tolerant quantum computing regime.
- [3] arXiv:2501.17143 (cross-list from math.NA) [pdf, html, other]
-
Title: Numerical Approximation of High-Dimensional Gibbs Distributions Using the Functional Hierarchical TensorSubjects: Numerical Analysis (math.NA); Computational Physics (physics.comp-ph); Data Analysis, Statistics and Probability (physics.data-an)
The numerical representation of high-dimensional Gibbs distributions is challenging due to the curse of dimensionality manifesting through the intractable normalization constant calculations. This work addresses this challenge by performing a particle-based high-dimensional parametric density estimation subroutine, and the input to the subroutine is Gibbs samples generated by leveraging advanced sampling techniques. Specifically, to generate Gibbs samples, we employ ensemble-based annealed importance sampling, a population-based approach for sampling multimodal distributions. These samples are then processed using functional hierarchical tensor sketching, a tensor-network-based density estimation method for high-dimensional distributions, to obtain the numerical representation of the Gibbs distribution. We successfully apply the proposed approach to complex Ginzburg-Landau models with hundreds of variables. In particular, we show that the approach proposed is successful at addressing the metastability issue under difficult numerical cases.
Cross submissions (showing 2 of 2 entries)
- [4] arXiv:2406.05240 (replaced) [pdf, html, other]
-
Title: A General Track Fit based on TripletsComments: 24 pages, 12 figures, revised version submitted to Nuclear Instruments and Methods in Physics Research Section ASubjects: Instrumentation and Detectors (physics.ins-det); High Energy Physics - Experiment (hep-ex); Nuclear Experiment (nucl-ex); Data Analysis, Statistics and Probability (physics.data-an)
This paper presents a general three-dimensional track fit based on hit triplets.
The general track fit considers spatial hit and multiple Coulomb scattering uncertainties, and can also be extended to include energy losses.
Input to the fit are detector-specific triplet parameters, which contain information about
the triplet geometry (hit positions),
the radiation length of the material and the magnetic field.
Since the solution is given by an analytical closed-form, it is possible to use the same fitting code for all kind of tracking detectors.
Fitting formulas are given for the global track fit as well as for the local hit triplets.
The latter allows filtering out triplets with poor fit quality at an early stage of track reconstruction.
The construction and fit of local triplets is fully parallelizable, enabling accelerated computation with parallel hardware architectures.
Formulas for the detector-specific triplet parameters are derived for the two most commonly used field configuration for tracking detectors, namely a uniform solenoidal field and gap spectrometer dipole.
An algorithm to calculate the triplet parameters for an arbitrary magnetic field configuration is presented too.
This paper also includes a discussion of inherent track fit biases.
Furthermore, a new method is proposed to accelerate track fitting by classifying tracking regimes and using optimal fit formulas. - [5] arXiv:2410.19420 (replaced) [pdf, html, other]
-
Title: Doppler correlation-driven vetoes for the Frequency Hough analysis in continuous gravitational-wave searchesMatteo Di Giovanni, Paola Leaci, Pia Astone, Stefano Dal Pra, Sabrina D'Antonio, Luca D'Onofrio, Sergio Frasca, Federico Muciaccia, Cristiano Palomba, Lorenzo Pierini, Francesco Safai TehraniComments: 13 pages, 9 figures, 5 tablesSubjects: General Relativity and Quantum Cosmology (gr-qc); Instrumentation and Methods for Astrophysics (astro-ph.IM); Data Analysis, Statistics and Probability (physics.data-an)
We present an improved method for vetoing candidates of continuous gravitational-wave sources during all-sky searches utilizing the Frequency Hough pipeline. This approach leverages linear correlations between source parameters induced by the Earth Doppler effect, which can be effectively identified through the Hough Transform. Candidates that do not align with these patterns are considered spurious and can thus be vetoed, enhancing the depth and statistical significance of follow-up analyses. Additionally, we provide a comprehensive explanation of the method calibration, which intrinsically linked to the total duration of the observing run. On average, the procedure successfully vetoes $56\%$ of candidates. To assess the method performance, we conducted a Monte-Carlo simulation injecting fake continuous-wave signals into data from the third observing run of the LIGO detectors. This analysis allowed us to infer strain amplitude upper limits at a $90\%$ confidence level. We found that the optimal sensitivity is $h_0^{90\%} = 3.62^{+0.23}_{-0.22}\times 10^{-26}$ in the [128, 200] Hz band, which is within the most sensible frequency band of the LIGO detectors.
- [6] arXiv:2411.13402 (replaced) [pdf, html, other]
-
Title: Extraction of gravitational wave signals from LISA data in the presence of artifactsEleonora Castelli, Quentin Baghi, John G. Baker, Jacob Slutsky, Jérôme Bobin, Nikolaos Karnesis, Antoine Petiteau, Orion Sauter, Peter Wass, William J. WeberComments: 28 pages, 15 figuresSubjects: General Relativity and Quantum Cosmology (gr-qc); Instrumentation and Methods for Astrophysics (astro-ph.IM); Data Analysis, Statistics and Probability (physics.data-an)
The Laser Interferometer Space Antenna (LISA) mission is being developed by ESA with NASA participation. As it has recently passed the Mission Adoption milestone, models of the instruments and noise performance are becoming more detailed, and likewise prototype data analyses must as well. Assumptions such as Gaussianity, stationarity, and data continuity are unrealistic, and must be replaced with physically motivated data simulations, and data analysis methods adapted to accommodate such likely imperfections. To this end, the LISA Data Challenges have produced datasets featuring time-varying and unequal constellation armlength, and measurement artifacts including data interruptions and instrumental transients. In this work, we assess the impact of these data artifacts on the inference of Galactic Binary and Massive Black Hole properties. Our analysis shows that the treatment of noise transients and gaps is necessary for effective parameter estimation, as they substantially corrupt the analysis if unmitigated. We find that straightforward mitigation techniques can significantly if imperfectly suppress artifacts. For the Galactic Binaries, mitigation of glitches was essentially total, while mitigations of the data gaps increased parameter uncertainty by approximately 10%. For the Massive Black Hole binaries the particularly pernicious glitches resulted in a 30% uncertainty increase after mitigations, while the data gaps can increase parameter uncertainty by up to several times. Critically, this underlines the importance of early detection of transient gravitational waves to ensure they are protected from planned data interruptions.