Picture for Xiongkuo Min

Xiongkuo Min

Omni$^2$: Unifying Omnidirectional Image Generation and Editing in an Omni Model

Add code
Apr 15, 2025
Viaarxiv icon

PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving

Add code
Apr 15, 2025
Viaarxiv icon

Towards Explainable Partial-AIGC Image Quality Assessment

Add code
Apr 12, 2025
Viaarxiv icon

LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs

Add code
Apr 11, 2025
Viaarxiv icon

Q-Agent: Quality-Driven Chain-of-Thought Image Restoration Agent through Robust Multimodal Large Language Model

Add code
Apr 09, 2025
Viaarxiv icon

Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes

Add code
Apr 02, 2025
Viaarxiv icon

Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy

Add code
Mar 27, 2025
Viaarxiv icon

Image Quality Assessment: From Human to Machine Preference

Add code
Mar 13, 2025
Viaarxiv icon

Information Density Principle for MLLM Benchmarks

Add code
Mar 13, 2025
Viaarxiv icon

Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content

Add code
Mar 05, 2025
Viaarxiv icon