Picture for Kezhen Chen

Kezhen Chen

Hybrid Primal Sketch: Combining Analogy, Qualitative Representations, and Computer Vision for Scene Understanding

Add code
Jul 05, 2024
Viaarxiv icon

Dragonfly: Multi-Resolution Zoom Supercharges Large Visual-Language Model

Add code
Jun 03, 2024
Viaarxiv icon

IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

Add code
May 15, 2024
Figure 1 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 2 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 3 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 4 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Viaarxiv icon

Higher Layers Need More LoRA Experts

Add code
Feb 13, 2024
Viaarxiv icon

Evaluation and Mitigation of Agnosia in Multimodal Large Language Models

Add code
Sep 07, 2023
Viaarxiv icon

Tackling Vision Language Tasks Through Learning Inner Monologues

Add code
Aug 19, 2023
Viaarxiv icon

LOWA: Localize Objects in the Wild with Attributes

Add code
May 31, 2023
Viaarxiv icon

Natural- to formal-language generation using Tensor Product Representations

Add code
Oct 05, 2019
Figure 1 for Natural- to formal-language generation using Tensor Product Representations
Figure 2 for Natural- to formal-language generation using Tensor Product Representations
Figure 3 for Natural- to formal-language generation using Tensor Product Representations
Figure 4 for Natural- to formal-language generation using Tensor Product Representations
Viaarxiv icon

Who are the Devils Wearing Prada in New York City?

Add code
Aug 19, 2015
Figure 1 for Who are the Devils Wearing Prada in New York City?
Figure 2 for Who are the Devils Wearing Prada in New York City?
Figure 3 for Who are the Devils Wearing Prada in New York City?
Figure 4 for Who are the Devils Wearing Prada in New York City?
Viaarxiv icon