Picture for Kezhen Chen

Kezhen Chen

SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image Reasoning

Add code
Jan 07, 2025
Viaarxiv icon

Reinforcing Thinking through Reasoning-Enhanced Reward Models

Add code
Dec 31, 2024
Figure 1 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Figure 2 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Figure 3 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Figure 4 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Viaarxiv icon

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Hybrid Primal Sketch: Combining Analogy, Qualitative Representations, and Computer Vision for Scene Understanding

Add code
Jul 05, 2024
Viaarxiv icon

Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model

Add code
Jun 03, 2024
Figure 1 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 2 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 3 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 4 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Viaarxiv icon

IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

Add code
May 15, 2024
Figure 1 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 2 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 3 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 4 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Viaarxiv icon

Higher Layers Need More LoRA Experts

Add code
Feb 13, 2024
Viaarxiv icon

Evaluation and Mitigation of Agnosia in Multimodal Large Language Models

Add code
Sep 07, 2023
Figure 1 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Figure 2 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Figure 3 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Figure 4 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Viaarxiv icon

Tackling Vision Language Tasks Through Learning Inner Monologues

Add code
Aug 19, 2023
Viaarxiv icon

LOWA: Localize Objects in the Wild with Attributes

Add code
May 31, 2023
Viaarxiv icon