Enhancing AI-based Generation of Software Exploits with Contextual Information

Liguori, Pietro; Improta, Cristina; Natella, Roberto; Cukic, Bojan; Cotroneo, Domenico

Computer Science > Software Engineering

arXiv:2408.02402 (cs)

[Submitted on 5 Aug 2024 (v1), last revised 6 Sep 2024 (this version, v3)]

Title:Enhancing AI-based Generation of Software Exploits with Contextual Information

Authors:Pietro Liguori, Cristina Improta, Roberto Natella, Bojan Cukic, Domenico Cotroneo

View PDF

Abstract:This practical experience report explores Neural Machine Translation (NMT) models' capability to generate offensive security code from natural language (NL) descriptions, highlighting the significance of contextual understanding and its impact on model performance. Our study employs a dataset comprising real shellcodes to evaluate the models across various scenarios, including missing information, necessary context, and unnecessary context. The experiments are designed to assess the models' resilience against incomplete descriptions, their proficiency in leveraging context for enhanced accuracy, and their ability to discern irrelevant information. The findings reveal that the introduction of contextual data significantly improves performance. However, the benefits of additional context diminish beyond a certain point, indicating an optimal level of contextual information for model training. Moreover, the models demonstrate an ability to filter out unnecessary context, maintaining high levels of accuracy in the generation of offensive security code. This study paves the way for future research on optimizing context use in AI-driven code generation, particularly for applications requiring a high degree of technical precision such as the generation of offensive code.

Comments:	Accepted for publication at The 35th IEEE International Symposium on Software Reliability Engineering
Subjects:	Software Engineering (cs.SE); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.02402 [cs.SE]
	(or arXiv:2408.02402v3 [cs.SE] for this version)
	https://doi.org/10.48550/arXiv.2408.02402

Submission history

From: Pietro Liguori [view email]
[v1] Mon, 5 Aug 2024 11:52:34 UTC (1,047 KB)
[v2] Tue, 6 Aug 2024 10:19:26 UTC (1,044 KB)
[v3] Fri, 6 Sep 2024 12:51:35 UTC (362 KB)

Computer Science > Software Engineering

Title:Enhancing AI-based Generation of Software Exploits with Contextual Information

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Software Engineering

Title:Enhancing AI-based Generation of Software Exploits with Contextual Information

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators