Picture for Xiangtai Li

Xiangtai Li

RelationBooth: Towards Relation-Aware Customized Object Generation

Add code
Oct 30, 2024
Viaarxiv icon

Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image

Add code
Oct 20, 2024
Viaarxiv icon

Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation

Add code
Oct 14, 2024
Figure 1 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 2 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 3 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Figure 4 for Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation
Viaarxiv icon

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Add code
Oct 10, 2024
Figure 1 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 2 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 3 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Figure 4 for Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
Viaarxiv icon

PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners

Add code
Oct 07, 2024
Viaarxiv icon

PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model

Add code
Aug 24, 2024
Figure 1 for PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model
Figure 2 for PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model
Figure 3 for PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model
Figure 4 for PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model
Viaarxiv icon

You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning

Add code
Aug 01, 2024
Viaarxiv icon

LLAVADI: What Matters For Multimodal Large Language Models Distillation

Add code
Jul 28, 2024
Viaarxiv icon

Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language

Add code
Jun 28, 2024
Viaarxiv icon

Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model

Add code
Jun 27, 2024
Viaarxiv icon