Picture for Katsutoshi Itoyama

Katsutoshi Itoyama

Department of Systems and Control Engineering, School of Engineering, Tokyo Institute of Technology, Tokyo, Japan, Honda Research Institute Japan Co., Ltd., Saitama, Japan

Bird Vocalization Embedding Extraction Using Self-Supervised Disentangled Representation Learning

Add code
Dec 28, 2024
Viaarxiv icon

UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios

Add code
Aug 09, 2024
Viaarxiv icon

Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?

Add code
Jul 22, 2024
Viaarxiv icon

SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

Add code
May 30, 2024
Figure 1 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Figure 2 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Figure 3 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Figure 4 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Viaarxiv icon

From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution

Add code
Jan 26, 2024
Viaarxiv icon

Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers

Add code
Sep 21, 2023
Viaarxiv icon

Metric-based multimodal meta-learning for human movement identification via footstep recognition

Add code
Nov 15, 2021
Figure 1 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 2 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 3 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 4 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Viaarxiv icon

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Add code
Mar 31, 2019
Figure 1 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 2 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 3 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 4 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Viaarxiv icon

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

Add code
Mar 19, 2018
Figure 1 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 2 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 3 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 4 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Viaarxiv icon

Generative Statistical Models with Self-Emergent Grammar of Chord Sequences

Add code
Mar 02, 2018
Figure 1 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 2 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 3 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 4 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Viaarxiv icon