GREEN-CODE: Optimizing Energy Efficiency in Large Language Models for Code Generation

Ilager, Shashikant; Briem, Lukas Florian; Brandic, Ivona

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2501.11006 (cs)

[Submitted on 19 Jan 2025]

Title:GREEN-CODE: Optimizing Energy Efficiency in Large Language Models for Code Generation

Authors:Shashikant Ilager, Lukas Florian Briem, Ivona Brandic

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) are becoming integral to daily life, showcasing their vast potential across various Natural Language Processing (NLP) tasks. Beyond NLP, LLMs are increasingly used in software development tasks, such as code completion, modification, bug fixing, and code translation. Software engineers widely use tools like GitHub Copilot and Amazon Q, streamlining workflows and automating tasks with high accuracy. While the resource and energy intensity of LLM training is often highlighted, inference can be even more resource-intensive over time, as it's a continuous process with a high number of invocations. Therefore, developing resource-efficient alternatives for LLM inference is crucial for sustainability. This work proposes GREEN-CODE, a framework for energy-aware code generation in LLMs. GREEN-CODE performs dynamic early exit during LLM inference. We train a Reinforcement Learning (RL) agent that learns to balance the trade-offs between accuracy, latency, and energy consumption. Our approach is evaluated on two open-source LLMs, Llama 3.2 3B and OPT 2.7B, using the JavaCorpus and PY150 datasets. Results show that our method reduces the energy consumption between 23-50 % on average for code generation tasks without significantly affecting accuracy.

Comments:	Under submission in ACM/IEEE conference, 11 pages
Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Artificial Intelligence (cs.AI); Performance (cs.PF); Software Engineering (cs.SE)
ACM classes:	C4, D.0, E4, I7
Cite as:	arXiv:2501.11006 [cs.DC]
	(or arXiv:2501.11006v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2501.11006

Submission history

From: Shashikant Ilager Mr [view email]
[v1] Sun, 19 Jan 2025 10:44:03 UTC (3,608 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:GREEN-CODE: Optimizing Energy Efficiency in Large Language Models for Code Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:GREEN-CODE: Optimizing Energy Efficiency in Large Language Models for Code Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators