Towards Neural Decompilation

Katz, Omer; Olshaker, Yuval; Goldberg, Yoav; Yahav, Eran

Computer Science > Programming Languages

arXiv:1905.08325 (cs)

[Submitted on 20 May 2019]

Title:Towards Neural Decompilation

Authors:Omer Katz, Yuval Olshaker, Yoav Goldberg, Eran Yahav

View PDF

Abstract:We address the problem of automatic decompilation, converting a program in low-level representation back to a higher-level human-readable programming language. The problem of decompilation is extremely important for security researchers. Finding vulnerabilities and understanding how malware operates is much easier when done over source code.
The importance of decompilation has motivated the construction of hand-crafted rule-based decompilers. Such decompilers have been designed by experts to detect specific control-flow structures and idioms in low-level code and lift them to source level. The cost of supporting additional languages or new language features in these models is very high.
We present a novel approach to decompilation based on neural machine translation. The main idea is to automatically learn a decompiler from a given compiler. Given a compiler from a source language S to a target language T , our approach automatically trains a decompiler that can translate (decompile) T back to S . We used our framework to decompile both LLVM IR and x86 assembly to C code with high success rates. Using our LLVM and x86 instantiations, we were able to successfully decompile over 97% and 88% of our benchmarks respectively.

Subjects:	Programming Languages (cs.PL); Machine Learning (cs.LG)
Cite as:	arXiv:1905.08325 [cs.PL]
	(or arXiv:1905.08325v1 [cs.PL] for this version)
	https://doi.org/10.48550/arXiv.1905.08325

Submission history

From: Omer Katz [view email]
[v1] Mon, 20 May 2019 20:02:53 UTC (227 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.PL

< prev | next >

new | recent | 2019-05

Change to browse by:

cs
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Omer Katz
Yuval Olshaker
Yoav Goldberg
Eran Yahav

export BibTeX citation

Computer Science > Programming Languages

Title:Towards Neural Decompilation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Programming Languages

Title:Towards Neural Decompilation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators