Picture for Dongdong Chen

Dongdong Chen

LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation

Add code
Nov 07, 2024
Figure 1 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 2 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 3 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Figure 4 for LLM2CLIP: Powerful Language Model Unlock Richer Visual Representation
Viaarxiv icon

ToolBridge: An Open-Source Dataset to Equip LLMs with External Tool Capabilities

Add code
Oct 08, 2024
Viaarxiv icon

SynChart: Synthesizing Charts from Language Models

Add code
Sep 25, 2024
Viaarxiv icon

Pluralistic Salient Object Detection

Add code
Sep 04, 2024
Viaarxiv icon

Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM

Add code
Jul 31, 2024
Viaarxiv icon

Rethinking Visual Prompting for Multimodal Large Language Models with External Knowledge

Add code
Jul 05, 2024
Viaarxiv icon

Transformer based Pluralistic Image Completion with Reduced Information Loss

Add code
Apr 15, 2024
Viaarxiv icon

OmniVid: A Generative Framework for Universal Video Understanding

Add code
Mar 26, 2024
Viaarxiv icon

Generative Enhancement for 3D Medical Images

Add code
Mar 19, 2024
Viaarxiv icon

Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object Segmentation

Add code
Mar 18, 2024
Viaarxiv icon