Picture for Ning Shi

Ning Shi

Cross-Modal Consistency in Multimodal Large Language Models

Add code
Nov 14, 2024
Viaarxiv icon

MIO: A Foundation Model on Multimodal Tokens

Add code
Sep 26, 2024
Figure 1 for MIO: A Foundation Model on Multimodal Tokens
Figure 2 for MIO: A Foundation Model on Multimodal Tokens
Figure 3 for MIO: A Foundation Model on Multimodal Tokens
Figure 4 for MIO: A Foundation Model on Multimodal Tokens
Viaarxiv icon

Action Controlled Paraphrasing

Add code
May 18, 2024
Figure 1 for Action Controlled Paraphrasing
Figure 2 for Action Controlled Paraphrasing
Figure 3 for Action Controlled Paraphrasing
Figure 4 for Action Controlled Paraphrasing
Viaarxiv icon

Optimizing Negative Prompts for Enhanced Aesthetics and Fidelity in Text-To-Image Generation

Add code
Mar 12, 2024
Viaarxiv icon

Lost in Translation: When GPT-4V Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond

Add code
Oct 19, 2023
Viaarxiv icon

UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation

Add code
Jun 24, 2023
Viaarxiv icon

From Adversarial Arms Race to Model-centric Evaluation: Motivating a Unified Automatic Robustness Evaluation Framework

Add code
May 29, 2023
Viaarxiv icon

Don't Trust GPT When Your Question Is Not In English

Add code
May 24, 2023
Viaarxiv icon

Interactive Natural Language Processing

Add code
May 22, 2023
Figure 1 for Interactive Natural Language Processing
Figure 2 for Interactive Natural Language Processing
Figure 3 for Interactive Natural Language Processing
Figure 4 for Interactive Natural Language Processing
Viaarxiv icon

RoChBert: Towards Robust BERT Fine-tuning for Chinese

Add code
Oct 28, 2022
Viaarxiv icon