NUMOSIM: A Synthetic Mobility Dataset with Anomaly Detection Benchmarks

Stanford, Chris; Adari, Suman; Liao, Xishun; He, Yueshuai; Jiang, Qinhua; Kuai, Chenchen; Ma, Jiaqi; Tung, Emmanuel; Qian, Yinlong; Zhao, Lingyi; Zhou, Zihao; Rasheed, Zeeshan; Shafique, Khurram

Computer Science > Machine Learning

arXiv:2409.03024 (cs)

[Submitted on 4 Sep 2024 (v1), last revised 6 Sep 2024 (this version, v2)]

Title:NUMOSIM: A Synthetic Mobility Dataset with Anomaly Detection Benchmarks

Authors:Chris Stanford, Suman Adari, Xishun Liao, Yueshuai He, Qinhua Jiang, Chenchen Kuai, Jiaqi Ma, Emmanuel Tung, Yinlong Qian, Lingyi Zhao, Zihao Zhou, Zeeshan Rasheed, Khurram Shafique

View PDF HTML (experimental)

Abstract:Collecting real-world mobility data is challenging. It is often fraught with privacy concerns, logistical difficulties, and inherent biases. Moreover, accurately annotating anomalies in large-scale data is nearly impossible, as it demands meticulous effort to distinguish subtle and complex patterns. These challenges significantly impede progress in geospatial anomaly detection research by restricting access to reliable data and complicating the rigorous evaluation, comparison, and benchmarking of methodologies. To address these limitations, we introduce a synthetic mobility dataset, NUMOSIM, that provides a controlled, ethical, and diverse environment for benchmarking anomaly detection techniques. NUMOSIM simulates a wide array of realistic mobility scenarios, encompassing both typical and anomalous behaviours, generated through advanced deep learning models trained on real mobility data. This approach allows NUMOSIM to accurately replicate the complexities of real-world movement patterns while strategically injecting anomalies to challenge and evaluate detection algorithms based on how effectively they capture the interplay between demographic, geospatial, and temporal factors. Our goal is to advance geospatial mobility analysis by offering a realistic benchmark for improving anomaly detection and mobility modeling techniques. To support this, we provide open access to the NUMOSIM dataset, along with comprehensive documentation, evaluation metrics, and benchmark results.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2409.03024 [cs.LG]
	(or arXiv:2409.03024v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2409.03024

Submission history

From: Chris Stanford [view email]
[v1] Wed, 4 Sep 2024 18:31:24 UTC (3,141 KB)
[v2] Fri, 6 Sep 2024 16:55:26 UTC (3,141 KB)

Computer Science > Machine Learning

Title:NUMOSIM: A Synthetic Mobility Dataset with Anomaly Detection Benchmarks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:NUMOSIM: A Synthetic Mobility Dataset with Anomaly Detection Benchmarks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators