Picture for Stefan Lee

Stefan Lee

Calibrating MLLM-as-a-judge via Multimodal Bayesian Prompt Ensembles

Add code
Sep 10, 2025
Viaarxiv icon

Towards Scalable Schema Mapping using Large Language Models

Add code
May 30, 2025
Viaarxiv icon

Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Add code
Mar 20, 2025
Figure 1 for Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Figure 2 for Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Figure 3 for Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Figure 4 for Do Visual Imaginations Improve Vision-and-Language Navigation Agents?
Viaarxiv icon

GABAR: Graph Attention-Based Action Ranking for Relational Policy Learning

Add code
Dec 06, 2024
Viaarxiv icon

Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks

Add code
Dec 03, 2024
Figure 1 for Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
Figure 2 for Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
Figure 3 for Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
Figure 4 for Hijacking Vision-and-Language Navigation Agents with Adversarial Environmental Attacks
Viaarxiv icon

You Never Know: Quantization Induces Inconsistent Biases in Vision-Language Foundation Models

Add code
Oct 26, 2024
Viaarxiv icon

Language-Informed Beam Search Decoding for Multilingual Machine Translation

Add code
Aug 11, 2024
Viaarxiv icon

Point Cloud Models Improve Visual Robustness in Robotic Learners

Add code
Apr 29, 2024
Figure 1 for Point Cloud Models Improve Visual Robustness in Robotic Learners
Figure 2 for Point Cloud Models Improve Visual Robustness in Robotic Learners
Figure 3 for Point Cloud Models Improve Visual Robustness in Robotic Learners
Figure 4 for Point Cloud Models Improve Visual Robustness in Robotic Learners
Viaarxiv icon

FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication

Add code
Apr 24, 2024
Figure 1 for FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Figure 2 for FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Figure 3 for FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Figure 4 for FairDeDup: Detecting and Mitigating Vision-Language Fairness Disparities in Semantic Dataset Deduplication
Viaarxiv icon

VLSlice: Interactive Vision-and-Language Slice Discovery

Add code
Sep 13, 2023
Figure 1 for VLSlice: Interactive Vision-and-Language Slice Discovery
Figure 2 for VLSlice: Interactive Vision-and-Language Slice Discovery
Figure 3 for VLSlice: Interactive Vision-and-Language Slice Discovery
Figure 4 for VLSlice: Interactive Vision-and-Language Slice Discovery
Viaarxiv icon