Picture for Xiaohong Liu

Xiaohong Liu

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Add code
Dec 22, 2025
Figure 1 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 2 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 3 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Figure 4 for dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Figure 1 for Using GUI Agent for Electronic Design Automation
Figure 2 for Using GUI Agent for Electronic Design Automation
Figure 3 for Using GUI Agent for Electronic Design Automation
Figure 4 for Using GUI Agent for Electronic Design Automation
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

MULTIBENCH++: A Unified and Comprehensive Multimodal Fusion Benchmarking Across Specialized Domains

Add code
Nov 14, 2025
Viaarxiv icon

FlashEdit: Decoupling Speed, Structure, and Semantics for Precise Image Editing

Add code
Sep 26, 2025
Viaarxiv icon

Blind-Spot Guided Diffusion for Self-supervised Real-World Denoising

Add code
Sep 19, 2025
Viaarxiv icon

AU-IQA: A Benchmark Dataset for Perceptual Quality Assessment of AI-Enhanced User-Generated Content

Add code
Aug 07, 2025
Viaarxiv icon

Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning

Add code
Aug 06, 2025
Figure 1 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 2 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 3 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Figure 4 for Audio-Assisted Face Video Restoration with Temporal and Identity Complementary Learning
Viaarxiv icon