OSPtrack: A Labeled Dataset Targeting Simulated Execution of Open-Source Software

Tan, Zhuoran; Anagnosstopoulos, Christos; Singer, Jeremy

Computer Science > Cryptography and Security

arXiv:2411.14829 (cs)

[Submitted on 22 Nov 2024 (v1), last revised 28 Nov 2024 (this version, v2)]

Title:OSPtrack: A Labeled Dataset Targeting Simulated Execution of Open-Source Software

Authors:Zhuoran Tan, Christos Anagnosstopoulos, Jeremy Singer

View PDF HTML (experimental)

Abstract:Open-source software serves as a foundation for the internet and the cyber supply chain, but its exploitation is becoming increasingly prevalent. While advances in vulnerability detection for OSS have been significant, prior research has largely focused on static code analysis, often neglecting runtime indicators. To address this shortfall, we created a comprehensive dataset spanning five ecosystems, capturing features generated during the execution of packages and libraries in isolated environments. The dataset includes 9,461 package reports, of which 1,962 are identified as malicious, and encompasses both static and dynamic features such as files, sockets, commands, and DNS records. Each report is labeled with verified information and detailed sub-labels for attack types, facilitating the identification of malicious indicators when source code is unavailable. This dataset supports runtime detection, enhances detection model training, and enables efficient comparative analysis across ecosystems, contributing to the strengthening of supply chain security.

Subjects:	Cryptography and Security (cs.CR)
Cite as:	arXiv:2411.14829 [cs.CR]
	(or arXiv:2411.14829v2 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2411.14829

Submission history

From: Zhuoran Tan [view email]
[v1] Fri, 22 Nov 2024 10:07:42 UTC (256 KB)
[v2] Thu, 28 Nov 2024 10:17:05 UTC (256 KB)

Computer Science > Cryptography and Security

Title:OSPtrack: A Labeled Dataset Targeting Simulated Execution of Open-Source Software

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:OSPtrack: A Labeled Dataset Targeting Simulated Execution of Open-Source Software

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators