Picture for Diji Yang

Diji Yang

Right this way: Can VLMs Guide Us to See More to Answer Questions?

Add code
Nov 01, 2024
Viaarxiv icon

Dual-Model Distillation for Efficient Action Classification with Hybrid Edge-Cloud Solution

Add code
Oct 16, 2024
Viaarxiv icon

IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues

Add code
May 15, 2024
Figure 1 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 2 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 3 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Figure 4 for IM-RAG: Multi-Round Retrieval-Augmented Generation Through Learning Inner Monologues
Viaarxiv icon

Tackling Vision Language Tasks Through Learning Inner Monologues

Add code
Aug 19, 2023
Viaarxiv icon

CPL: Counterfactual Prompt Learning for Vision and Language Models

Add code
Oct 19, 2022
Figure 1 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Figure 2 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Figure 3 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Figure 4 for CPL: Counterfactual Prompt Learning for Vision and Language Models
Viaarxiv icon