Picture for Ivan Laptev

Ivan Laptev

WILLOW, LIENS

RoomTour3D: Geometry-Aware Video-Instruction Tuning for Embodied Navigation

Add code
Dec 11, 2024
Viaarxiv icon

MALT: Improving Reasoning with Multi-Agent LLM Training

Add code
Dec 02, 2024
Viaarxiv icon

ShowHowTo: Generating Scene-Conditioned Step-by-Step Visual Instructions

Add code
Dec 02, 2024
Viaarxiv icon

MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation

Add code
Nov 26, 2024
Figure 1 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 2 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 3 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Figure 4 for MALMM: Multi-Agent Large Language Models for Zero-Shot Robotics Manipulation
Viaarxiv icon

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

Add code
Nov 25, 2024
Figure 1 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 2 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 3 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Figure 4 for All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
Viaarxiv icon

Mitigating Object Hallucination via Concentric Causal Attention

Add code
Oct 21, 2024
Viaarxiv icon

Learning feasible transitions for efficient contact planning

Add code
Jul 16, 2024
Viaarxiv icon

Short Film Dataset : A Benchmark for Story-Level Video Understanding

Add code
Jun 14, 2024
Viaarxiv icon

MirrorCheck: Efficient Adversarial Defense for Vision-Language Models

Add code
Jun 13, 2024
Figure 1 for MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Figure 2 for MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Figure 3 for MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Figure 4 for MirrorCheck: Efficient Adversarial Defense for Vision-Language Models
Viaarxiv icon

ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos

Add code
Apr 24, 2024
Viaarxiv icon