Variables are a Curse in Software Vulnerability Prediction

Groppe, Jinghua; Groppe, Sven; Möller, Ralf

doi:10.1007/978-3-031-39847-6_41

Computer Science > Software Engineering

arXiv:2407.02509 (cs)

[Submitted on 18 Jun 2024]

Title:Variables are a Curse in Software Vulnerability Prediction

Authors:Jinghua Groppe, Sven Groppe, Ralf Möller

View PDF HTML (experimental)

Abstract:Deep learning-based approaches for software vulnerability prediction currently mainly rely on the original text of software code as the feature of nodes in the graph of code and thus could learn a representation that is only specific to the code text, rather than the representation that depicts the 'intrinsic' functionality of a program hidden in the text representation. One curse that causes this problem is an infinite number of possibilities to name a variable. In order to lift the curse, in this work we introduce a new type of edge called name dependence, a type of abstract syntax graph based on the name dependence, and an efficient node representation method named 3-property encoding scheme. These techniques will allow us to remove the concrete variable names from code, and facilitate deep learning models to learn the functionality of software hidden in diverse code expressions. The experimental results show that the deep learning models built on these techniques outperform the ones based on existing approaches not only in the prediction of vulnerabilities but also in the memory need. The factor of memory usage reductions of our techniques can be up to the order of 30,000 in comparison to existing approaches.

Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
ACM classes:	I.2.0; D.2.m
Cite as:	arXiv:2407.02509 [cs.SE]
	(or arXiv:2407.02509v1 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2407.02509
Journal reference:	In Database and Expert Systems Applications: 34th International Conference, DEXA 2023, Penang, Malaysia, August 28-30, 2023, Proceedings, Part I. Springer-Verlag, Berlin, Heidelberg, 516-521
Related DOI:	https://doi.org/10.1007/978-3-031-39847-6_41

Submission history

From: Jinghua Groppe [view email]
[v1] Tue, 18 Jun 2024 16:02:29 UTC (74 KB)

Computer Science > Software Engineering

Title:Variables are a Curse in Software Vulnerability Prediction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Variables are a Curse in Software Vulnerability Prediction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators