Picture for Chunyi Li

Chunyi Li

Automated Safety Benchmarking: A Multi-agent Pipeline for LVLMs

Add code
Jan 27, 2026
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Figure 1 for Embodied Image Compression
Figure 2 for Embodied Image Compression
Figure 3 for Embodied Image Compression
Figure 4 for Embodied Image Compression
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Figure 1 for Using GUI Agent for Electronic Design Automation
Figure 2 for Using GUI Agent for Electronic Design Automation
Figure 3 for Using GUI Agent for Electronic Design Automation
Figure 4 for Using GUI Agent for Electronic Design Automation
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

Data Assessment for Embodied Intelligence

Add code
Nov 12, 2025
Viaarxiv icon

AU-IQA: A Benchmark Dataset for Perceptual Quality Assessment of AI-Enhanced User-Generated Content

Add code
Aug 07, 2025
Viaarxiv icon

The Ever-Evolving Science Exam

Add code
Jul 22, 2025
Figure 1 for The Ever-Evolving Science Exam
Figure 2 for The Ever-Evolving Science Exam
Figure 3 for The Ever-Evolving Science Exam
Figure 4 for The Ever-Evolving Science Exam
Viaarxiv icon

Scaling-up Perceptual Video Quality Assessment

Add code
May 28, 2025
Viaarxiv icon

Perceptual Quality Assessment for Embodied AI

Add code
May 22, 2025
Figure 1 for Perceptual Quality Assessment for Embodied AI
Figure 2 for Perceptual Quality Assessment for Embodied AI
Figure 3 for Perceptual Quality Assessment for Embodied AI
Figure 4 for Perceptual Quality Assessment for Embodied AI
Viaarxiv icon

PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving

Add code
Apr 15, 2025
Viaarxiv icon