Picture for Huang Xie

Huang Xie

Text-based Audio Retrieval by Learning from Similarities between Audio Captions

Add code
Dec 02, 2024
Viaarxiv icon

Multi-label Zero-Shot Audio Classification with Temporal Attention

Add code
Aug 31, 2024
Viaarxiv icon

Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning

Add code
Aug 27, 2024
Figure 1 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Figure 2 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Figure 3 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Figure 4 for Integrating Continuous and Binary Relevances in Audio-Text Relevance Learning
Viaarxiv icon

Crowdsourcing and Evaluating Text-Based Audio Retrieval Relevances

Add code
Jun 16, 2023
Viaarxiv icon

On Negative Sampling for Contrastive Audio-Text Retrieval

Add code
Nov 08, 2022
Viaarxiv icon

Language-based Audio Retrieval Task in DCASE 2022 Challenge

Add code
Oct 04, 2022
Figure 1 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Figure 2 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Figure 3 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Figure 4 for Language-based Audio Retrieval Task in DCASE 2022 Challenge
Viaarxiv icon

DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval

Add code
Jun 15, 2022
Figure 1 for DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval
Figure 2 for DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval
Figure 3 for DCASE 2022 Challenge Task 6B: Language-Based Audio Retrieval
Viaarxiv icon

Zero-Shot Audio Classification using Image Embeddings

Add code
Jun 10, 2022
Figure 1 for Zero-Shot Audio Classification using Image Embeddings
Figure 2 for Zero-Shot Audio Classification using Image Embeddings
Figure 3 for Zero-Shot Audio Classification using Image Embeddings
Figure 4 for Zero-Shot Audio Classification using Image Embeddings
Viaarxiv icon

Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases

Add code
Oct 06, 2021
Figure 1 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Figure 2 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Figure 3 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Figure 4 for Unsupervised Audio-Caption Aligning Learns Correspondences between Individual Sound Events and Textual Phrases
Viaarxiv icon

Zero-Shot Audio Classification Based on Class Label Embeddings

Add code
May 06, 2019
Figure 1 for Zero-Shot Audio Classification Based on Class Label Embeddings
Figure 2 for Zero-Shot Audio Classification Based on Class Label Embeddings
Figure 3 for Zero-Shot Audio Classification Based on Class Label Embeddings
Figure 4 for Zero-Shot Audio Classification Based on Class Label Embeddings
Viaarxiv icon