Picture for Xiaohan Wang

Xiaohan Wang

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Add code
Jan 14, 2025
Viaarxiv icon

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Add code
Jan 06, 2025
Viaarxiv icon

DeepSeek-V3 Technical Report

Add code
Dec 27, 2024
Viaarxiv icon

Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration

Add code
Dec 17, 2024
Viaarxiv icon

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Add code
Dec 13, 2024
Viaarxiv icon

Targeted Learning for Variable Importance

Add code
Nov 04, 2024
Viaarxiv icon

Zero-shot Action Localization via the Confidence of Large Vision-Language Models

Add code
Oct 18, 2024
Viaarxiv icon

Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps

Add code
Oct 14, 2024
Figure 1 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Figure 2 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Figure 3 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Figure 4 for Innovative Thinking, Infinite Humor: Humor Research of Large Language Models through Structured Thought Leaps
Viaarxiv icon

RuleAlign: Making Large Language Models Better Physicians with Diagnostic Rule Alignment

Add code
Aug 22, 2024
Viaarxiv icon

MetaTool: Facilitating Large Language Models to Master Tools with Meta-task Augmentation

Add code
Jul 15, 2024
Viaarxiv icon