Picture for Jianyuan Sun

Jianyuan Sun

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

Add code
Jul 06, 2024
Viaarxiv icon

Audio-Visual Speaker Tracking: Progress, Challenges, and Future Directions

Add code
Oct 23, 2023
Viaarxiv icon

Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning

Add code
May 30, 2023
Viaarxiv icon

Towards Generating Diverse Audio Captions via Adversarial Training

Add code
Dec 05, 2022
Viaarxiv icon

Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention

Add code
Oct 28, 2022
Viaarxiv icon

Automated Audio Captioning via Fusion of Low- and High- Dimensional Features

Add code
Oct 10, 2022
Figure 1 for Automated Audio Captioning via Fusion of Low- and High- Dimensional Features
Figure 2 for Automated Audio Captioning via Fusion of Low- and High- Dimensional Features
Figure 3 for Automated Audio Captioning via Fusion of Low- and High- Dimensional Features
Viaarxiv icon

On Metric Learning for Audio-Text Cross-Modal Retrieval

Add code
Apr 13, 2022
Figure 1 for On Metric Learning for Audio-Text Cross-Modal Retrieval
Figure 2 for On Metric Learning for Audio-Text Cross-Modal Retrieval
Viaarxiv icon

Leveraging Pre-trained BERT for Audio Captioning

Add code
Mar 27, 2022
Figure 1 for Leveraging Pre-trained BERT for Audio Captioning
Figure 2 for Leveraging Pre-trained BERT for Audio Captioning
Figure 3 for Leveraging Pre-trained BERT for Audio Captioning
Figure 4 for Leveraging Pre-trained BERT for Audio Captioning
Viaarxiv icon

Deep Neural Decision Forest for Acoustic Scene Classification

Add code
Mar 07, 2022
Figure 1 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 2 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 3 for Deep Neural Decision Forest for Acoustic Scene Classification
Figure 4 for Deep Neural Decision Forest for Acoustic Scene Classification
Viaarxiv icon

Diverse Audio Captioning via Adversarial Training

Add code
Oct 13, 2021
Figure 1 for Diverse Audio Captioning via Adversarial Training
Figure 2 for Diverse Audio Captioning via Adversarial Training
Figure 3 for Diverse Audio Captioning via Adversarial Training
Figure 4 for Diverse Audio Captioning via Adversarial Training
Viaarxiv icon