Picture for Alfonso Amayuelas

Alfonso Amayuelas

Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation

Add code
Apr 09, 2025
Viaarxiv icon

Self-Resource Allocation in Multi-Agent LLM Systems

Add code
Apr 02, 2025
Viaarxiv icon

Grounding LLM Reasoning with Knowledge Graphs

Add code
Feb 18, 2025
Viaarxiv icon

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Add code
Nov 29, 2024
Figure 1 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 2 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 3 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Figure 4 for INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge
Viaarxiv icon

Game-theoretic LLM: Agent Workflow for Negotiation Games

Add code
Nov 12, 2024
Figure 1 for Game-theoretic LLM: Agent Workflow for Negotiation Games
Figure 2 for Game-theoretic LLM: Agent Workflow for Negotiation Games
Figure 3 for Game-theoretic LLM: Agent Workflow for Negotiation Games
Figure 4 for Game-theoretic LLM: Agent Workflow for Negotiation Games
Viaarxiv icon

Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data

Add code
Jul 20, 2024
Figure 1 for Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data
Figure 2 for Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data
Figure 3 for Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data
Figure 4 for Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data
Viaarxiv icon

DebUnc: Mitigating Hallucinations in Large Language Model Agent Communication with Uncertainty Estimations

Add code
Jul 08, 2024
Viaarxiv icon

MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate

Add code
Jun 26, 2024
Figure 1 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Figure 2 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Figure 3 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Figure 4 for MultiAgent Collaboration Attack: Investigating Adversarial Attacks in Large Language Model Collaborations via Debate
Viaarxiv icon

DistiLRR: Transferring Code Repair for Low-Resource Programming Languages

Add code
Jun 21, 2024
Figure 1 for DistiLRR: Transferring Code Repair for Low-Resource Programming Languages
Figure 2 for DistiLRR: Transferring Code Repair for Low-Resource Programming Languages
Figure 3 for DistiLRR: Transferring Code Repair for Low-Resource Programming Languages
Figure 4 for DistiLRR: Transferring Code Repair for Low-Resource Programming Languages
Viaarxiv icon

Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation

Add code
Feb 05, 2024
Viaarxiv icon