Picture for Jingyuan Qi

Jingyuan Qi

RoRA-VLM: Robust Retrieval-Augmented Vision Language Models

Add code
Oct 11, 2024
Figure 1 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Figure 2 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Figure 3 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Figure 4 for RoRA-VLM: Robust Retrieval-Augmented Vision Language Models
Viaarxiv icon

MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks

Add code
Oct 08, 2023
Viaarxiv icon

The Art of SOCRATIC QUESTIONING: Zero-shot Multimodal Reasoning with Recursive Thinking and Self-Questioning

Add code
May 24, 2023
Viaarxiv icon

A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy

Add code
Dec 16, 2021
Figure 1 for A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy
Figure 2 for A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy
Figure 3 for A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy
Figure 4 for A Deep-Learning Framework for Improving COVID-19 CT Image Quality and Diagnostic Accuracy
Viaarxiv icon