Picture for Serena Yeung-Levy

Serena Yeung-Levy

Zero-shot Action Localization via the Confidence of Large Vision-Language Models

Add code
Oct 18, 2024
Viaarxiv icon

How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities

Add code
Sep 18, 2024
Figure 1 for How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities
Figure 2 for How to Build the Virtual Cell with Artificial Intelligence: Priorities and Opportunities
Viaarxiv icon

Continuous Perception Benchmark

Add code
Aug 15, 2024
Viaarxiv icon

Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision

Add code
Jul 08, 2024
Viaarxiv icon

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding

Add code
Jul 01, 2024
Viaarxiv icon

Why are Visually-Grounded Language Models Bad at Image Classification?

Add code
May 28, 2024
Viaarxiv icon

Training Small Multimodal Models to Bridge Biomedical Competency Gap: A Case Study in Radiology Imaging

Add code
Mar 20, 2024
Viaarxiv icon

Depth-guided NeRF Training via Earth Mover's Distance

Add code
Mar 19, 2024
Viaarxiv icon

Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models

Add code
Mar 19, 2024
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Mar 15, 2024
Viaarxiv icon