Picture for Nobutaka Ono

Nobutaka Ono

What Do Neurons Listen To? A Neuron-level Dissection of a General-purpose Audio Model

Add code
Feb 17, 2026
Viaarxiv icon

Fast Swap-Based Element Selection for Multiplication-Free Dimension Reduction

Add code
Feb 14, 2026
Viaarxiv icon

Incremental Averaging Method to Improve Graph-Based Time-Difference-of-Arrival Estimation

Add code
Jul 09, 2025
Viaarxiv icon

Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes

Add code
Jun 12, 2025
Figure 1 for Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
Figure 2 for Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
Figure 3 for Description and Discussion on DCASE 2025 Challenge Task 4: Spatial Semantic Segmentation of Sound Scenes
Viaarxiv icon

Mel-Spectrogram Inversion via Alternating Direction Method of Multipliers

Add code
Jan 09, 2025
Viaarxiv icon

Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis

Add code
Apr 12, 2024
Figure 1 for Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Figure 2 for Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Figure 3 for Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Figure 4 for Guided Masked Self-Distillation Modeling for Distributed Multimedia Sensor Event Analysis
Viaarxiv icon

Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase

Add code
Jul 23, 2023
Viaarxiv icon

Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation

Add code
Jul 23, 2023
Figure 1 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Figure 2 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Figure 3 for Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation
Viaarxiv icon

Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge

Add code
Feb 15, 2023
Viaarxiv icon

End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation

Add code
Oct 19, 2022
Figure 1 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 2 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 3 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Figure 4 for End-to-End Integration of Speech Recognition, Dereverberation, Beamforming, and Self-Supervised Learning Representation
Viaarxiv icon