Picture for Ignacio Iacobacci

Ignacio Iacobacci

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Add code
Nov 05, 2024
Figure 1 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 2 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 3 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 4 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Viaarxiv icon

Code-Optimise: Self-Generated Preference Data for Correctness and Efficiency

Add code
Jun 18, 2024
Viaarxiv icon

HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants

Add code
May 15, 2024
Figure 1 for HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants
Figure 2 for HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants
Figure 3 for HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants
Figure 4 for HumanRankEval: Automatic Evaluation of LMs as Conversational Assistants
Viaarxiv icon

MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation

Add code
Apr 03, 2024
Viaarxiv icon

Findings of the First Workshop on Simulating Conversational Intelligence in Chat

Add code
Feb 09, 2024
Viaarxiv icon

Automatic Unit Test Data Generation and Actor-Critic Reinforcement Learning for Code Synthesis

Add code
Oct 20, 2023
Viaarxiv icon

A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems

Add code
Oct 19, 2023
Figure 1 for A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Figure 2 for A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Figure 3 for A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Figure 4 for A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems
Viaarxiv icon

The Regular Expression Inference Challenge

Add code
Aug 15, 2023
Viaarxiv icon

Multi3WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems

Add code
Jul 26, 2023
Viaarxiv icon

Topic-Aware Response Generation in Task-Oriented Dialogue with Unstructured Knowledge Access

Add code
Dec 10, 2022
Viaarxiv icon