Picture for Chunfeng Yuan

Chunfeng Yuan

SA-GNAS: Seed Architecture Expansion for Efficient Large-scale Graph Neural Architecture Search

Add code
Dec 03, 2024
Viaarxiv icon

mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA

Add code
Nov 22, 2024
Figure 1 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 2 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 3 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Figure 4 for mR$^2$AG: Multimodal Retrieval-Reflection-Augmented Generation for Knowledge-Based VQA
Viaarxiv icon

SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning

Add code
Nov 15, 2024
Figure 1 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 2 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 3 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Figure 4 for SEAGULL: No-reference Image Quality Assessment for Regions of Interest via Vision-Language Instruction Tuning
Viaarxiv icon

Granularity Matters in Long-Tail Learning

Add code
Oct 21, 2024
Figure 1 for Granularity Matters in Long-Tail Learning
Figure 2 for Granularity Matters in Long-Tail Learning
Figure 3 for Granularity Matters in Long-Tail Learning
Figure 4 for Granularity Matters in Long-Tail Learning
Viaarxiv icon

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

Add code
Jul 21, 2024
Figure 1 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 2 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 3 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Figure 4 for MIBench: Evaluating Multimodal Large Language Models over Multiple Images
Viaarxiv icon

How to Make Cross Encoder a Good Teacher for Efficient Image-Text Retrieval?

Add code
Jul 10, 2024
Viaarxiv icon

EA-VTR: Event-Aware Video-Text Retrieval

Add code
Jul 10, 2024
Viaarxiv icon

PromptIQA: Boosting the Performance and Generalization for No-Reference Image Quality Assessment via Prompts

Add code
Mar 08, 2024
Viaarxiv icon

Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training

Add code
Mar 01, 2024
Viaarxiv icon

Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval

Add code
Feb 26, 2024
Viaarxiv icon