Picture for Yuxuan Zhang

Yuxuan Zhang

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Add code
Mar 13, 2025
Viaarxiv icon

Symplectic Optimization for Cross Subcarrier Precoder Design with Channel Smoothing in Massive MIMO-OFDM System

Add code
Mar 10, 2025
Viaarxiv icon

EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer

Add code
Mar 10, 2025
Viaarxiv icon

Difix3D+: Improving 3D Reconstructions with Single-Step Diffusion Models

Add code
Mar 03, 2025
Viaarxiv icon

SECURA: Sigmoid-Enhanced CUR Decomposition with Uninterrupted Retention and Low-Rank Adaptation in Large Language Models

Add code
Feb 26, 2025
Viaarxiv icon

PhotoDoodle: Learning Artistic Image Editing from Few-Shot Pairwise Data

Add code
Feb 23, 2025
Viaarxiv icon

ClipRover: Zero-shot Vision-Language Exploration and Target Discovery by Mobile Robots

Add code
Feb 12, 2025
Viaarxiv icon

Survey of Quantization Techniques for On-Device Vision-based Crack Detection

Add code
Feb 04, 2025
Viaarxiv icon

Cosmos World Foundation Model Platform for Physical AI

Add code
Jan 07, 2025
Figure 1 for Cosmos World Foundation Model Platform for Physical AI
Figure 2 for Cosmos World Foundation Model Platform for Physical AI
Figure 3 for Cosmos World Foundation Model Platform for Physical AI
Figure 4 for Cosmos World Foundation Model Platform for Physical AI
Viaarxiv icon

Beyond Words: AuralLLM and SignMST-C for Precise Sign Language Production and Bidirectional Accessibility

Add code
Jan 01, 2025
Viaarxiv icon