Back to Search Start Over

OSPtrack: A Labeled Dataset Targeting Simulated Open-Source Package Execution

Authors :
Tan, Zhuoran
Anagnosstopoulos, Christos
Singer, Jeremy
Publication Year :
2024

Abstract

Open-source software is a fundamental part of the internet and the cyber supply chain, but its exploitation has become more frequent. While vulnerability detection in OSS has advanced, previous work mainly focuses on static code analysis, neglecting runtime indicators. To address this, we created a dataset spanning multiple ecosystems, capturing features generated during the execution of packages and libraries in isolated environments. The dataset includes 9,461 package reports (1,962 malicious), with static and dynamic features such as files, sockets, commands, and DNS records. Labeled with verified information and detailed sub-labels for attack types, this dataset helps identify malicious indicators, especially when source code access is limited, and supports efficient detection methods during runtime.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2411.14829
Document Type :
Working Paper