Picture for Yuhui Zhang

Yuhui Zhang

BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature

Add code
Jan 14, 2025
Viaarxiv icon

Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation

Add code
Jan 06, 2025
Viaarxiv icon

DataLab: A Unified Platform for LLM-Powered Business Intelligence

Add code
Dec 04, 2024
Figure 1 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Figure 2 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Figure 3 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Figure 4 for DataLab: A Unified Platform for LLM-Powered Business Intelligence
Viaarxiv icon

DataLab: A Unifed Platform for LLM-Powered Business Intelligence

Add code
Dec 03, 2024
Figure 1 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Figure 2 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Figure 3 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Figure 4 for DataLab: A Unifed Platform for LLM-Powered Business Intelligence
Viaarxiv icon

Compact SPICE model for TeraFET resonant detectors

Add code
Jul 27, 2024
Viaarxiv icon

Robust VAEs via Generating Process of Noise Augmented Data

Add code
Jul 26, 2024
Viaarxiv icon

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding

Add code
Jul 01, 2024
Figure 1 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Figure 2 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Figure 3 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Figure 4 for μ-Bench: A Vision-Language Benchmark for Microscopy Understanding
Viaarxiv icon

Why are Visually-Grounded Language Models Bad at Image Classification?

Add code
May 28, 2024
Viaarxiv icon

A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data

Add code
Mar 24, 2024
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Mar 15, 2024
Viaarxiv icon