Picture for Rocktim Jyoti Das

Rocktim Jyoti Das

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Figure 1 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 2 for MALT: Improving Reasoning with Multi-Agent LLM Training
Figure 3 for MALT: Improving Reasoning with Multi-Agent LLM Training
Viaarxiv icon

MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation

Add code
Nov 26, 2024
Figure 1 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 2 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 3 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 4 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Viaarxiv icon

MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations

Add code
Oct 18, 2024
Figure 1 for MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations
Figure 2 for MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations
Figure 3 for MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations
Figure 4 for MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations
Viaarxiv icon

Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems

Add code
May 24, 2024
Viaarxiv icon

Can a Multichoice Dataset be Repurposed for Extractive Question Answering?

Add code
Apr 26, 2024
Figure 1 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 2 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 3 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Figure 4 for Can a Multichoice Dataset be Repurposed for Extractive Question Answering?
Viaarxiv icon

EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models

Add code
Mar 15, 2024
Figure 1 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Figure 2 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Figure 3 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Figure 4 for EXAMS-V: A Multi-Discipline Multilingual Multimodal Exam Benchmark for Evaluating Vision Language Models
Viaarxiv icon

Factuality of Large Language Models in the Year 2024

Add code
Feb 09, 2024
Figure 1 for Factuality of Large Language Models in the Year 2024
Figure 2 for Factuality of Large Language Models in the Year 2024
Figure 3 for Factuality of Large Language Models in the Year 2024
Viaarxiv icon

Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models

Add code
Nov 08, 2023
Viaarxiv icon

DKAF: KB Arbitration for Learning Task-Oriented Dialog Systems with Dialog-KB Inconsistencies

Add code
May 26, 2023
Viaarxiv icon

Exploring Distributional Shifts in Large Language Models for Code Analysis

Add code
Mar 16, 2023
Viaarxiv icon