Picture for Weijia Li

Weijia Li

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Add code
Apr 03, 2025
Viaarxiv icon

Scene4U: Hierarchical Layered 3D Scene Reconstruction from Single Panoramic Image for Your Immerse Exploration

Add code
Apr 01, 2025
Viaarxiv icon

LEGION: Learning to Ground and Explain for Synthetic Image Detection

Add code
Mar 19, 2025
Viaarxiv icon

Spot the Fake: Large Multimodal Model-Based Synthetic Image Detection with Artifact Explanation

Add code
Mar 19, 2025
Viaarxiv icon

Token Pruning in Multimodal Large Language Models: Are We Solving the Right Problem?

Add code
Feb 17, 2025
Viaarxiv icon

Stop Looking for Important Tokens in Multimodal Language Models: Duplication Matters More

Add code
Feb 17, 2025
Viaarxiv icon

Where am I? Cross-View Geo-localization with Natural Language Descriptions

Add code
Dec 22, 2024
Viaarxiv icon

Beyond Static Assumptions: the Predictive Justified Perspective Model for Epistemic Planning

Add code
Dec 10, 2024
Viaarxiv icon

LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models

Add code
Oct 13, 2024
Figure 1 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 2 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 3 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Figure 4 for LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Viaarxiv icon

UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios

Add code
Aug 30, 2024
Viaarxiv icon