Picture for Yexin Liu

Yexin Liu

VideoGen-of-Thought: A Collaborative Framework for Multi-Shot Video Generation

Add code
Dec 03, 2024
Viaarxiv icon

Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

Add code
Jun 15, 2024
Viaarxiv icon

EyeFound: A Multimodal Generalist Foundation Model for Ophthalmic Imaging

Add code
May 22, 2024
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Viaarxiv icon

Evaluating large language models in medical applications: a survey

Add code
May 13, 2024
Viaarxiv icon

Learning High-Quality Navigation and Zooming on Omnidirectional Images in Virtual Reality

Add code
May 01, 2024
Viaarxiv icon

Unsupervised Visible-Infrared ReID via Pseudo-label Correction and Modality-level Alignment

Add code
Apr 10, 2024
Viaarxiv icon

GoodSAM: Bridging Domain and Capacity Gaps via Segment Anything Model for Distortion-aware Panoramic Semantic Segmentation

Add code
Mar 25, 2024
Viaarxiv icon

SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model

Add code
Mar 05, 2024
Figure 1 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Figure 2 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Figure 3 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Figure 4 for SynArtifact: Classifying and Alleviating Artifacts in Synthetic Images via Vision-Language Model
Viaarxiv icon

Efficient Multimodal Learning from Data-centric Perspective

Add code
Feb 18, 2024
Viaarxiv icon