Picture for Xiaohong Liu

Xiaohong Liu

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

MULTIBENCH++: A Unified and Comprehensive Multimodal Fusion Benchmarking Across Specialized Domains

Add code
Nov 14, 2025
Viaarxiv icon

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing

Add code
Sep 26, 2025
Viaarxiv icon

Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising

Add code
Sep 19, 2025
Viaarxiv icon

AU-IQA: A Benchmark Dataset for Perceptual Quality Assessment of AI-Enhanced User-Generated Content

Add code
Aug 07, 2025
Viaarxiv icon

Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning

Add code
Aug 06, 2025
Figure 1 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 2 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 3 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 4 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Viaarxiv icon