Picture for Kezhen Chen

Kezhen Chen

SMIR: Efficient Synthetic Data Pipeline To Improve Multi-Image Reasoning

Add code
Jan 07, 2025
Viaarxiv icon

Reinforcing Thinking through Reasoning-Enhanced Reward Models

Add code
Dec 31, 2024
Figure 1 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Figure 2 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Figure 3 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Figure 4 for Reinforcing Thinking through Reasoning-Enhanced Reward Models
Viaarxiv icon

RedPajama: an Open Dataset for Training Large Language Models

Add code
Nov 19, 2024
Viaarxiv icon

Hybrid Primal Sketch: Combining Analogy, Qualitative Representations, and Computer Vision for Scene Understanding

Add code
Jul 05, 2024
Viaarxiv icon

Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model

Add code
Jun 03, 2024
Figure 1 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 2 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 3 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Figure 4 for Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model
Viaarxiv icon

IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

Add code
May 15, 2024
Figure 1 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 2 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 3 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 4 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Viaarxiv icon

Higher Layers Need More LoRA Experts

Add code
Feb 13, 2024
Figure 1 for Higher Layers Need More LoRA Experts
Figure 2 for Higher Layers Need More LoRA Experts
Figure 3 for Higher Layers Need More LoRA Experts
Figure 4 for Higher Layers Need More LoRA Experts
Viaarxiv icon

Evaluation and Mitigation of Agnosia in Multimodal Large Language Models

Add code
Sep 07, 2023
Figure 1 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Figure 2 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Figure 3 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Figure 4 for Evaluation and Mitigation of Agnosia in Multimodal Large Language Models
Viaarxiv icon

Tackling Vision Language Tasks Through Learning Inner Monologues

Add code
Aug 19, 2023
Viaarxiv icon

LOWA: Localize Objects in the Wild with Attributes

Add code
May 31, 2023
Viaarxiv icon