Picture for Chaoya Jiang

Chaoya Jiang

MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model

Add code
Aug 26, 2024
Viaarxiv icon

MIBench: Evaluating Multimodal Large Language Models over Multiple Images

Add code
Jul 21, 2024
Viaarxiv icon

Enhancing In-Context Learning via Implicit Demonstration Augmentation

Add code
Jun 27, 2024
Figure 1 for Enhancing In-Context Learning via Implicit Demonstration Augmentation
Figure 2 for Enhancing In-Context Learning via Implicit Demonstration Augmentation
Figure 3 for Enhancing In-Context Learning via Implicit Demonstration Augmentation
Figure 4 for Enhancing In-Context Learning via Implicit Demonstration Augmentation
Viaarxiv icon

Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

Add code
Feb 24, 2024
Viaarxiv icon

TiMix: Text-aware Image Mixing for Effective Vision-Language Pre-training

Add code
Dec 14, 2023
Viaarxiv icon

Hallucination Augmented Contrastive Learning for Multimodal Large Language Model

Add code
Dec 13, 2023
Viaarxiv icon

BUS:Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization

Add code
Jul 17, 2023
Viaarxiv icon

PandaLM: An Automatic Evaluation Benchmark for LLM Instruction Tuning Optimization

Add code
Jun 08, 2023
Viaarxiv icon

Exploiting Pseudo Image Captions for Multimodal Summarization

Add code
May 09, 2023
Viaarxiv icon

Vision Langauge Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation

Add code
May 09, 2023
Viaarxiv icon