Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Speed and Conversational Large Language Models: Not All Is About Tokens per Second

Feb 23, 2025

Javier Conde, Miguel González, Pedro Reviriego, Zhen Gao, Shanshan Liu, Fabrizio Lombardi

Share this with someone who'll enjoy it:

Abstract:The speed of open-weights large language models (LLMs) and its dependency on the task at hand, when run on GPUs, is studied to present a comparative analysis of the speed of the most popular open LLMs.

* Computer (Volume: 57, Issue: 8, August 2024)

View paper on

Share this with someone who'll enjoy it:

Title:Speed and Conversational Large Language Models: Not All Is About Tokens per Second

Paper and Code