Picture for Guangtao Zhai

Guangtao Zhai

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Add code
Dec 19, 2025
Viaarxiv icon

Embodied Image Compression

Add code
Dec 12, 2025
Viaarxiv icon

Using GUI Agent for Electronic Design Automation

Add code
Dec 12, 2025
Viaarxiv icon

ManipShield: A Unified Framework for Image Manipulation Detection, Localization and Explanation

Add code
Nov 18, 2025
Viaarxiv icon

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Add code
Nov 18, 2025
Viaarxiv icon

GeoX-Bench: Benchmarking Cross-View Geo-Localization and Pose Estimation Capabilities of Large Multimodal Models

Add code
Nov 17, 2025
Viaarxiv icon

Data Assessment for Embodied Intelligence

Add code
Nov 12, 2025
Viaarxiv icon

MACEval: A Multi-Agent Continual Evaluation Network for Large Models

Add code
Nov 12, 2025
Viaarxiv icon

Evaluating from Benign to Dynamic Adversarial: A Squid Game for Large Language Models

Add code
Nov 12, 2025
Viaarxiv icon

One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework

Add code
Nov 05, 2025
Viaarxiv icon