Picture for Chen-Wei Xie

Chen-Wei Xie

Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models

Add code
Mar 20, 2025
Viaarxiv icon

Rethinking Video Tokenization: A Conditioned Diffusion-based Approach

Add code
Mar 05, 2025
Viaarxiv icon

What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Coverage of MLLMs

Add code
Feb 19, 2025
Viaarxiv icon

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Add code
Jul 03, 2024
Figure 1 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 2 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 3 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Figure 4 for BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations
Viaarxiv icon

Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model

Add code
Dec 18, 2023
Figure 1 for Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Figure 2 for Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Figure 3 for Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Figure 4 for Understanding the Multi-modal Prompts of the Pre-trained Vision-Language Model
Viaarxiv icon

MomentDiff: Generative Video Moment Retrieval from Random to Real

Add code
Jul 06, 2023
Viaarxiv icon

Vortex Pooling: Improving Context Representation in Semantic Segmentation

Add code
Apr 22, 2018
Figure 1 for Vortex Pooling: Improving Context Representation in Semantic Segmentation
Figure 2 for Vortex Pooling: Improving Context Representation in Semantic Segmentation
Figure 3 for Vortex Pooling: Improving Context Representation in Semantic Segmentation
Figure 4 for Vortex Pooling: Improving Context Representation in Semantic Segmentation
Viaarxiv icon

Deep Label Distribution Learning with Label Ambiguity

Add code
May 10, 2017
Figure 1 for Deep Label Distribution Learning with Label Ambiguity
Figure 2 for Deep Label Distribution Learning with Label Ambiguity
Figure 3 for Deep Label Distribution Learning with Label Ambiguity
Figure 4 for Deep Label Distribution Learning with Label Ambiguity
Viaarxiv icon

Deep Descriptor Transforming for Image Co-Localization

Add code
May 08, 2017
Figure 1 for Deep Descriptor Transforming for Image Co-Localization
Figure 2 for Deep Descriptor Transforming for Image Co-Localization
Figure 3 for Deep Descriptor Transforming for Image Co-Localization
Figure 4 for Deep Descriptor Transforming for Image Co-Localization
Viaarxiv icon

Dense CNN Learning with Equivalent Mappings

Add code
May 24, 2016
Figure 1 for Dense CNN Learning with Equivalent Mappings
Figure 2 for Dense CNN Learning with Equivalent Mappings
Figure 3 for Dense CNN Learning with Equivalent Mappings
Figure 4 for Dense CNN Learning with Equivalent Mappings
Viaarxiv icon