Picture for Dianqi Li

Dianqi Li

LangBridge: Interpreting Image as a Combination of Language Embeddings

Add code
Mar 26, 2025
Viaarxiv icon

ImageGen-CoT: Enhancing Text-to-Image In-context Learning with Chain-of-Thought Reasoning

Add code
Mar 25, 2025
Viaarxiv icon

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Add code
Dec 05, 2024
Figure 1 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 2 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 3 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Figure 4 for Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion
Viaarxiv icon

Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation

Add code
Oct 07, 2024
Viaarxiv icon

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Add code
Sep 24, 2024
Figure 1 for Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Figure 2 for Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Figure 3 for Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Figure 4 for Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts
Viaarxiv icon

RuleR: Improving LLM Controllability by Rule-based Data Recycling

Add code
Jun 22, 2024
Viaarxiv icon

AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models

Add code
Jun 16, 2024
Figure 1 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 2 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 3 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Figure 4 for AUTOHALLUSION: Automatic Generation of Hallucination Benchmarks for Vision-Language Models
Viaarxiv icon

TASA: Deceiving Question Answering Models by Twin Answer Sentences Attack

Add code
Oct 27, 2022
Viaarxiv icon

Phrase-level Textual Adversarial Attack with Label Preservation

Add code
May 24, 2022
Figure 1 for Phrase-level Textual Adversarial Attack with Label Preservation
Figure 2 for Phrase-level Textual Adversarial Attack with Label Preservation
Figure 3 for Phrase-level Textual Adversarial Attack with Label Preservation
Figure 4 for Phrase-level Textual Adversarial Attack with Label Preservation
Viaarxiv icon

Contextualized Perturbation for Textual Adversarial Attack

Add code
Sep 16, 2020
Figure 1 for Contextualized Perturbation for Textual Adversarial Attack
Figure 2 for Contextualized Perturbation for Textual Adversarial Attack
Figure 3 for Contextualized Perturbation for Textual Adversarial Attack
Figure 4 for Contextualized Perturbation for Textual Adversarial Attack
Viaarxiv icon