Picture for Pu Wang

Pu Wang

ControlMM: Controllable Masked Motion Generation

Add code
Oct 14, 2024
Viaarxiv icon

MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data

Add code
Sep 09, 2024
Viaarxiv icon

Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses

Add code
Jun 16, 2024
Viaarxiv icon

LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Add code
Jun 13, 2024
Viaarxiv icon

Complex Image-Generative Diffusion Transformer for Audio Denoising

Add code
Jun 13, 2024
Viaarxiv icon

Diffusion Gaussian Mixture Audio Denoise

Add code
Jun 13, 2024
Viaarxiv icon

BAMM: Bidirectional Autoregressive Motion Model

Add code
Apr 01, 2024
Viaarxiv icon

MMM: Generative Masked Motion Model

Add code
Dec 06, 2023
Viaarxiv icon

DCHT: Deep Complex Hybrid Transformer for Speech Enhancement

Add code
Oct 30, 2023
Viaarxiv icon

DPATD: Dual-Phase Audio Transformer for Denoising

Add code
Oct 30, 2023
Viaarxiv icon