Picture for Chenhang Cui

Chenhang Cui

Dual-Optimized Adaptive Graph Reconstruction for Multi-View Graph Clustering

Add code
Oct 30, 2024
Viaarxiv icon

Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment

Add code
Oct 18, 2024
Viaarxiv icon

MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models

Add code
Oct 14, 2024
Figure 1 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 2 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 3 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Figure 4 for MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
Viaarxiv icon

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Add code
Jul 05, 2024
Viaarxiv icon

Calibrated Self-Rewarding Vision Language Models

Add code
May 23, 2024
Viaarxiv icon

Aligning Modalities in Vision Large Language Models via Preference Fine-tuning

Add code
Feb 18, 2024
Viaarxiv icon

How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs

Add code
Nov 27, 2023
Viaarxiv icon

Holistic Analysis of Hallucination in GPT-4V: Bias and Interference Challenges

Add code
Nov 07, 2023
Viaarxiv icon

Analyzing and Mitigating Object Hallucination in Large Vision-Language Models

Add code
Oct 01, 2023
Viaarxiv icon

A Novel Approach for Effective Multi-View Clustering with Information-Theoretic Perspective

Add code
Sep 25, 2023
Viaarxiv icon