Picture for Xinhao Mei

Xinhao Mei

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Add code
Oct 22, 2023
Viaarxiv icon

FoleyGen: Visually-Guided Audio Generation

Add code
Sep 19, 2023
Viaarxiv icon

Enhance audio generation controllability through representation similarity regularization

Add code
Sep 15, 2023
Viaarxiv icon

AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining

Add code
Aug 10, 2023
Viaarxiv icon

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning

Add code
May 30, 2023
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Add code
Mar 30, 2023
Viaarxiv icon

AudioLDM: Text-to-Audio Generation with Latent Diffusion Models

Add code
Feb 16, 2023
Viaarxiv icon

Towards Generating Diverse Audio Captions via Adversarial Training

Add code
Dec 05, 2022
Viaarxiv icon

Ontology-aware Learning and Evaluation for Audio Tagging

Add code
Nov 22, 2022
Viaarxiv icon

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention

Add code
Oct 28, 2022
Viaarxiv icon