Picture for Pu Wang

Pu Wang

Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation

Add code
Nov 26, 2024
Figure 1 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 2 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 3 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Figure 4 for Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation
Viaarxiv icon

ControlMM: Controllable Masked Motion Generation

Add code
Oct 14, 2024
Viaarxiv icon

MLLM-FL: Multimodal Large Language Model Assisted Federated Learning on Heterogeneous and Long-tailed Data

Add code
Sep 09, 2024
Viaarxiv icon

Learning Traffic Crashes as Language: Datasets, Benchmarks, and What-if Causal Analyses

Add code
Jun 16, 2024
Viaarxiv icon

Complex Image-Generative Diffusion Transformer for Audio Denoising

Add code
Jun 13, 2024
Viaarxiv icon

LLAVIDAL: Benchmarking Large Language Vision Models for Daily Activities of Living

Add code
Jun 13, 2024
Viaarxiv icon

Diffusion Gaussian Mixture Audio Denoise

Add code
Jun 13, 2024
Viaarxiv icon

BAMM: Bidirectional Autoregressive Motion Model

Add code
Apr 01, 2024
Viaarxiv icon

MMM: Generative Masked Motion Model

Add code
Dec 06, 2023
Viaarxiv icon

DCHT: Deep Complex Hybrid Transformer for Speech Enhancement

Add code
Oct 30, 2023
Viaarxiv icon