Picture for Huan Zhang

Huan Zhang

The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination

Add code
Mar 20, 2025
Viaarxiv icon

Stealthy Backdoor Attack in Self-Supervised Learning Vision Encoders for Large Vision Language Models

Add code
Feb 25, 2025
Viaarxiv icon

Rethinking Diverse Human Preference Learning through Principal Component Analysis

Add code
Feb 18, 2025
Viaarxiv icon

MIH-TCCT: Mitigating Inconsistent Hallucinations in LLMs via Event-Driven Text-Code Cyclic Training

Add code
Feb 13, 2025
Viaarxiv icon

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Add code
Feb 13, 2025
Figure 1 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 2 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 3 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 4 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Viaarxiv icon

RenderBox: Expressive Performance Rendering with Text Control

Add code
Feb 11, 2025
Viaarxiv icon

Compensation based Dictionary Transfer for Similar Multispectral Image Spectral Super-resolution

Add code
Jan 27, 2025
Viaarxiv icon

Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

Add code
Dec 31, 2024
Viaarxiv icon

Causal Composition Diffusion Model for Closed-loop Traffic Generation

Add code
Dec 23, 2024
Viaarxiv icon

BaB-ND: Long-Horizon Motion Planning with Branch-and-Bound and Neural Dynamics

Add code
Dec 12, 2024
Viaarxiv icon