Picture for Yibing Song

Yibing Song

ATPrompt: Textual Prompt Learning with Embedded Attributes

Add code
Dec 12, 2024
Viaarxiv icon

A Stitch in Time Saves Nine: Small VLM is a Precise Guidance for Accelerating Large VLMs

Add code
Dec 05, 2024
Viaarxiv icon

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

Add code
Nov 25, 2024
Viaarxiv icon

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Add code
Nov 15, 2024
Viaarxiv icon

Aligning Audio-Visual Joint Representations with an Agentic Workflow

Add code
Oct 31, 2024
Viaarxiv icon

LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization

Add code
Oct 22, 2024
Figure 1 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Figure 2 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Figure 3 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Figure 4 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Viaarxiv icon

Dynamic Diffusion Transformer

Add code
Oct 04, 2024
Viaarxiv icon

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Add code
Mar 18, 2024
Viaarxiv icon

A Causal Inspired Early-Branching Structure for Domain Generalization

Add code
Mar 13, 2024
Viaarxiv icon

HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation

Add code
Dec 12, 2023
Viaarxiv icon