A Water Efficiency Dataset for African Data Centers

Shumba, Noah; Tshekiso, Opelo; Li, Pengfei; Fanti, Giulia; Ren, Shaolei

Computer Science > Machine Learning

arXiv:2412.03716 (cs)

[Submitted on 4 Dec 2024 (v1), last revised 6 Dec 2024 (this version, v2)]

Title:A Water Efficiency Dataset for African Data Centers

Authors:Noah Shumba, Opelo Tshekiso, Pengfei Li, Giulia Fanti, Shaolei Ren

View PDF HTML (experimental)

Abstract:AI computing and data centers consume a large amount of freshwater, both directly for cooling and indirectly for electricity generation. While most attention has been paid to developed countries such as the U.S., this paper presents the first-of-its-kind dataset that combines nation-level weather and electricity generation data to estimate water usage efficiency for data centers in 41 African countries across five different climate regions. We also use our dataset to evaluate and estimate the water consumption of inference on two large language models (i.e., Llama-3-70B and GPT-4) in 11 selected African countries. Our findings show that writing a 10-page report using Llama-3-70B could consume about \textbf{0.7 liters} of water, while the water consumption by GPT-4 for the same task may go up to about 60 liters. For writing a medium-length email of 120-200 words, Llama-3-70B and GPT-4 could consume about \textbf{0.13 liters} and 3 liters of water, respectively. Interestingly, given the same AI model, 8 out of the 11 selected African countries consume less water than the global average, mainly because of lower water intensities for electricity generation. However, water consumption can be substantially higher in some African countries with a steppe climate than the U.S. and global averages, prompting more attention when deploying AI computing in these countries. Our dataset is publicly available on \href{this https URL}{Hugging Face}.

Comments:	Accepted by NeurIPS 2024 Workshop on Tackling Climate Change with Machine Learning
Subjects:	Machine Learning (cs.LG); Computers and Society (cs.CY)
Cite as:	arXiv:2412.03716 [cs.LG]
	(or arXiv:2412.03716v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.03716

Submission history

From: Pengfei Li [view email]
[v1] Wed, 4 Dec 2024 21:09:45 UTC (51 KB)
[v2] Fri, 6 Dec 2024 04:40:40 UTC (51 KB)

Computer Science > Machine Learning

Title:A Water Efficiency Dataset for African Data Centers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Water Efficiency Dataset for African Data Centers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators