Picture for Yasunori Ohishi

Yasunori Ohishi

M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation

Add code
Jun 04, 2024
Viaarxiv icon

Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection

Add code
Apr 26, 2024
Viaarxiv icon

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis

Add code
Apr 12, 2024
Viaarxiv icon

Masked Modeling Duo: Towards a Universal Audio Pre-training Framework

Add code
Apr 09, 2024
Viaarxiv icon

Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval

Add code
Mar 16, 2024
Figure 1 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Figure 2 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Figure 3 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Figure 4 for Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval
Viaarxiv icon

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement

Add code
Aug 23, 2023
Viaarxiv icon

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation

Add code
May 23, 2023
Viaarxiv icon

First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline

Add code
Mar 01, 2023
Viaarxiv icon

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input

Add code
Oct 26, 2022
Viaarxiv icon

ConceptBeam: Concept Driven Target Speech Extraction

Add code
Jul 25, 2022
Figure 1 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 2 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 3 for ConceptBeam: Concept Driven Target Speech Extraction
Figure 4 for ConceptBeam: Concept Driven Target Speech Extraction
Viaarxiv icon