Picture for Yuhui Zhang

Yuhui Zhang

EquiBench: Benchmarking Code Reasoning Capabilities of Large Language Models via Equivalence Checking

Add code
Feb 18, 2025
Viaarxiv icon

Temporal Preference Optimization for Long-Form Video Understanding

Add code
Jan 23, 2025
Viaarxiv icon

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Add code
Jan 14, 2025
Viaarxiv icon

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Add code
Jan 06, 2025
Figure 1 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Figure 2 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Figure 3 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Figure 4 for Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation
Viaarxiv icon

DataLab: A Unified Platform for LLM-Powered Business Intelligence

Add code
Dec 04, 2024
Figure 1 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Figure 2 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Figure 3 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Figure 4 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Viaarxiv icon

DataLab: A Unifed Platform for LLM-Powered Business Intelligence

Add code
Dec 03, 2024
Figure 1 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Figure 2 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Figure 3 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Figure 4 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Viaarxiv icon

Compact SPICE model for TeraFET resonant detectors

Add code
Jul 27, 2024
Viaarxiv icon

Robust VAEs via Generating Process of Noise Augmented Data

Add code
Jul 26, 2024
Viaarxiv icon

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding

Add code
Jul 01, 2024
Figure 1 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Figure 2 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Figure 3 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Figure 4 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Viaarxiv icon

Why are Visually-Grounded Language Models Bad at Image Classification?

Add code
May 28, 2024
Viaarxiv icon