Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment

Oct 03, 2024

Kai Liu, Ziqing Zhang, Wenbo Li, Renjing Pei, Fenglong Song, Xiaohong Liu, Linghe Kong, Yulun Zhang

Figure 1 for Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment

Figure 2 for Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment

Figure 3 for Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment

Figure 4 for Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment

Share this with someone who'll enjoy it:

Abstract:Image quality assessment (IQA) serves as the golden standard for all models' performance in nearly all computer vision fields. However, it still suffers from poor out-of-distribution generalization ability and expensive training costs. To address these problems, we propose Dog-IQA, a standard-guided zero-shot mix-grained IQA method, which is training-free and utilizes the exceptional prior knowledge of multimodal large language models (MLLMs). To obtain accurate IQA scores, namely scores consistent with humans, we design an MLLM-based inference pipeline that imitates human experts. In detail, Dog-IQA applies two techniques. First, Dog-IQA objectively scores with specific standards that utilize MLLM's behavior pattern and minimize the influence of subjective factors. Second, Dog-IQA comprehensively takes local semantic objects and the whole image as input and aggregates their scores, leveraging local and global information. Our proposed Dog-IQA achieves state-of-the-art (SOTA) performance compared with training-free methods, and competitive performance compared with training-based methods in cross-dataset scenarios. Our code and models will be available at https://github.com/Kai-Liu001/Dog-IQA.

* 10 pages, 5 figures. The code and models will be available at https://github.com/Kai-Liu001/Dog-IQA

View paper on

Share this with someone who'll enjoy it:

Title:Dog-IQA: Standard-guided Zero-shot MLLM for Mix-grained Image Quality Assessment

Paper and Code