Picture for Cheng Zhang

Cheng Zhang

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs

Add code
Mar 10, 2026
Viaarxiv icon

From Blurry to Believable: Enhancing Low-quality Talking Heads with 3D Generative Priors

Add code
Feb 05, 2026
Viaarxiv icon

Joint Optimization of Latency and Accuracy for Split Federated Learning in User-Centric Cell-Free MIMO Networks

Add code
Feb 05, 2026
Viaarxiv icon

CL-bench: A Benchmark for Context Learning

Add code
Feb 03, 2026
Viaarxiv icon

Adaptive Dual-Weighting Framework for Federated Learning via Out-of-Distribution Detection

Add code
Feb 01, 2026
Viaarxiv icon

Importance Weighted Variational Inference without the Reparameterization Trick

Add code
Feb 01, 2026
Viaarxiv icon

A Kernel Approach for Semi-implicit Variational Inference

Add code
Jan 17, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

Add code
Jan 14, 2026
Viaarxiv icon

HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation

Add code
Dec 29, 2025
Viaarxiv icon