Picture for Wenwu Wang

Wenwu Wang

AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language Models

Add code
Nov 28, 2024
Viaarxiv icon

PSELDNets: Pre-trained Neural Networks on Large-scale Synthetic Datasets for Sound Event Localization and Detection

Add code
Nov 10, 2024
Viaarxiv icon

Differentiable Interacting Multiple Model Particle Filtering

Add code
Oct 01, 2024
Figure 1 for Differentiable Interacting Multiple Model Particle Filtering
Figure 2 for Differentiable Interacting Multiple Model Particle Filtering
Figure 3 for Differentiable Interacting Multiple Model Particle Filtering
Viaarxiv icon

FlowSep: Language-Queried Sound Separation with Rectified Flow Matching

Add code
Sep 11, 2024
Viaarxiv icon

Efficient Audio Captioning with Encoder-Level Knowledge Distillation

Add code
Jul 19, 2024
Viaarxiv icon

Universal Sound Separation with Self-Supervised Audio Masked Autoencoder

Add code
Jul 16, 2024
Viaarxiv icon

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

Add code
Jul 06, 2024
Viaarxiv icon

Improving Audio Generation with Visual Enhanced Caption

Add code
Jul 05, 2024
Viaarxiv icon

Learning Retrieval Augmentation for Personalized Dialogue Generation

Add code
Jun 27, 2024
Viaarxiv icon

Selective Prompting Tuning for Personalized Conversations with LLMs

Add code
Jun 26, 2024
Viaarxiv icon