Picture for Serena Yeung-Levy

Serena Yeung-Levy

iSight: Towards expert-AI co-assessment for improved immunohistochemistry staining interpretation

Add code
Feb 03, 2026
Viaarxiv icon

Seeing Is Believing? A Benchmark for Multimodal Large Language Models on Visual Illusions and Anomalies

Add code
Feb 02, 2026
Viaarxiv icon

Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions

Add code
Jan 29, 2026
Viaarxiv icon

PaperSearchQA: Learning to Search and Reason over Scientific Papers with RLVR

Add code
Jan 26, 2026
Viaarxiv icon

RadDiff: Describing Differences in Radiology Image Sets with Natural Language

Add code
Jan 07, 2026
Viaarxiv icon

Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning

Add code
Dec 24, 2025
Figure 1 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Figure 2 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Figure 3 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Figure 4 for Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning
Viaarxiv icon

AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond

Add code
Sep 30, 2025
Viaarxiv icon

Can Large Language Models Match the Conclusions of Systematic Reviews?

Add code
May 28, 2025
Viaarxiv icon

NegVQA: Can Vision Language Models Understand Negation?

Add code
May 28, 2025
Viaarxiv icon

Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence

Add code
Apr 03, 2025
Viaarxiv icon