Picture for Qingyi Si

Qingyi Si

A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles

Add code
Nov 04, 2024
Viaarxiv icon

Towards Flexible Evaluation for Generative Visual Question Answering

Add code
Aug 01, 2024
Figure 1 for Towards Flexible Evaluation for Generative Visual Question Answering
Figure 2 for Towards Flexible Evaluation for Generative Visual Question Answering
Figure 3 for Towards Flexible Evaluation for Generative Visual Question Answering
Figure 4 for Towards Flexible Evaluation for Generative Visual Question Answering
Viaarxiv icon

Multimodal Table Understanding

Add code
Jun 12, 2024
Figure 1 for Multimodal Table Understanding
Figure 2 for Multimodal Table Understanding
Figure 3 for Multimodal Table Understanding
Figure 4 for Multimodal Table Understanding
Viaarxiv icon

Think out Loud: Emotion Deducing Explanation in Dialogues

Add code
Jun 07, 2024
Viaarxiv icon

Are Large Language Models Table-based Fact-Checkers?

Add code
Feb 04, 2024
Viaarxiv icon

Towards Unified Interactive Visual Grounding in The Wild

Add code
Jan 30, 2024
Viaarxiv icon

An Empirical Study of Instruction-tuning Large Language Models in Chinese

Add code
Oct 20, 2023
Viaarxiv icon

Combo of Thinking and Observing for Outside-Knowledge VQA

Add code
May 10, 2023
Viaarxiv icon

Compressing And Debiasing Vision-Language Pre-Trained Models for Visual Question Answering

Add code
Oct 26, 2022
Viaarxiv icon

Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA

Add code
Oct 10, 2022
Figure 1 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Figure 2 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Figure 3 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Figure 4 for Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA
Viaarxiv icon