Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alfredo Garrachón Ruiz

On the Temporal Question-Answering Capabilities of Large Language Models Over Anonymized Data

Apr 10, 2025

Alfredo Garrachón Ruiz, Tomás de la Rosa, Daniel Borrajo

Abstract:The applicability of Large Language Models (LLMs) in temporal reasoning tasks over data that is not present during training is still a field that remains to be explored. In this paper we work on this topic, focusing on structured and semi-structured anonymized data. We not only develop a direct LLM pipeline, but also compare various methodologies and conduct an in-depth analysis. We identified and examined seventeen common temporal reasoning tasks in natural language, focusing on their algorithmic components. To assess LLM performance, we created the \textit{Reasoning and Answering Temporal Ability} dataset (RATA), featuring semi-structured anonymized data to ensure reliance on reasoning rather than on prior knowledge. We compared several methodologies, involving SoTA techniques such as Tree-of-Thought, self-reflexion and code execution, tuned specifically for this scenario. Our results suggest that achieving scalable and reliable solutions requires more than just standalone LLMs, highlighting the need for integrated approaches.

* 18 pages, 7 tables, 5 figures

Via

Access Paper or Ask Questions

TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

Dec 10, 2024

Alfredo Garrachón Ruiz, Tomás de la Rosa, Daniel Borrajo

Figure 1 for TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

Figure 2 for TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

Figure 3 for TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

Figure 4 for TRIM: Token Reduction and Inference Modeling for Cost-Effective Language Generation

Abstract:The inference cost of Large Language Models (LLMs) is a significant challenge due to their computational demands, specially on tasks requiring long outputs. However, natural language often contains redundancy, which presents an opportunity for optimization. We have observed that LLMs can generate distilled language-concise outputs that retain essential meaning, when prompted appropriately. We propose a framework for saving computational cost, in which a shorter distilled output from the LLM is reconstructed into a full narrative by a smaller model with lower inference costs. Our experiments show promising results, particularly in general knowledge domains with 20.58% saved tokens on average with tiny decrease in evaluation metrics, hinting that this approach can effectively balance efficiency and accuracy in language processing tasks.

* 12 pages

Via

Access Paper or Ask Questions