Picture for Yuhui Zhang

Yuhui Zhang

Compact SPICE model for TeraFET resonant detectors

Add code
Jul 27, 2024
Viaarxiv icon

Robust VAEs via Generating Process of Noise Augmented Data

Add code
Jul 26, 2024
Viaarxiv icon

μ-Bench: A Vision-Language Benchmark for Microscopy Understanding

Add code
Jul 01, 2024
Viaarxiv icon

Why are Visually-Grounded Language Models Bad at Image Classification?

Add code
May 28, 2024
Viaarxiv icon

A General and Efficient Federated Split Learning with Pre-trained Image Transformers for Heterogeneous Data

Add code
Mar 24, 2024
Viaarxiv icon

VideoAgent: Long-form Video Understanding with Large Language Model as Agent

Add code
Mar 15, 2024
Viaarxiv icon

Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data

Add code
Jan 16, 2024
Viaarxiv icon

Describing Differences in Image Sets with Natural Language

Add code
Dec 05, 2023
Figure 1 for Describing Differences in Image Sets with Natural Language
Figure 2 for Describing Differences in Image Sets with Natural Language
Figure 3 for Describing Differences in Image Sets with Natural Language
Figure 4 for Describing Differences in Image Sets with Natural Language
Viaarxiv icon

Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation

Add code
Nov 27, 2023
Viaarxiv icon

MoCa: Measuring Human-Language Model Alignment on Causal and Moral Judgment Tasks

Add code
Oct 31, 2023
Viaarxiv icon