Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sotaro Kaneda

LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Sep 25, 2023

Ahmad Faiz, Sotaro Kaneda, Ruhan Wang, Rita Osi, Parteek Sharma, Fan Chen, Lei Jiang

Figure 1 for LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Figure 2 for LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Figure 3 for LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Figure 4 for LLMCarbon: Modeling the end-to-end Carbon Footprint of Large Language Models

Abstract:The carbon footprint associated with large language models (LLMs) is a significant concern, encompassing emissions from their training, inference, experimentation, and storage processes, including operational and embodied carbon emissions. An essential aspect is accurately estimating the carbon impact of emerging LLMs even before their training, which heavily relies on GPU usage. Existing studies have reported the carbon footprint of LLM training, but only one tool, mlco2, can predict the carbon footprint of new neural networks prior to physical training. However, mlco2 has several serious limitations. It cannot extend its estimation to dense or mixture-of-experts (MoE) LLMs, disregards critical architectural parameters, focuses solely on GPUs, and cannot model embodied carbon footprints. Addressing these gaps, we introduce \textit{LLMCarbon}, an end-to-end carbon footprint projection model designed for both dense and MoE LLMs. Compared to mlco2, LLMCarbon significantly enhances the accuracy of carbon footprint estimations for various LLMs.

Via

Access Paper or Ask Questions