Picture for Yibing Song

Yibing Song

Aligning Audio-Visual Joint Representations with an Agentic Workflow

Add code
Oct 31, 2024
Viaarxiv icon

LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization

Add code
Oct 22, 2024
Figure 1 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Figure 2 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Figure 3 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Figure 4 for LFME: A Simple Framework for Learning from Multiple Experts in Domain Generalization
Viaarxiv icon

Dynamic Diffusion Transformer

Add code
Oct 04, 2024
Viaarxiv icon

Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation

Add code
Mar 18, 2024
Viaarxiv icon

A Causal Inspired Early-Branching Structure for Domain Generalization

Add code
Mar 13, 2024
Viaarxiv icon

HeadArtist: Text-conditioned 3D Head Generation with Self Score Distillation

Add code
Dec 12, 2023
Viaarxiv icon

Advancing Vision Transformers with Group-Mix Attention

Add code
Nov 26, 2023
Viaarxiv icon

InstructDET: Diversifying Referring Object Detection with Generalized Instructions

Add code
Oct 17, 2023
Viaarxiv icon

Speed Co-Augmentation for Unsupervised Audio-Visual Pre-training

Add code
Sep 25, 2023
Viaarxiv icon

Domain Generalization via Rationale Invariance

Add code
Aug 22, 2023
Viaarxiv icon