Picture for Katsutoshi Itoyama

Katsutoshi Itoyama

Department of Systems and Control Engineering, School of Engineering, Tokyo Institute of Technology, Tokyo, Japan, Honda Research Institute Japan Co., Ltd., Saitama, Japan

UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios

Add code
Aug 09, 2024
Viaarxiv icon

Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?

Add code
Jul 22, 2024
Viaarxiv icon

SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

Add code
May 30, 2024
Viaarxiv icon

From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution

Add code
Jan 26, 2024
Viaarxiv icon

Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers

Add code
Sep 21, 2023
Viaarxiv icon

Metric-based multimodal meta-learning for human movement identification via footstep recognition

Add code
Nov 15, 2021
Figure 1 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 2 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 3 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 4 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Viaarxiv icon

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Add code
Mar 31, 2019
Figure 1 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 2 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 3 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 4 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Viaarxiv icon

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

Add code
Mar 19, 2018
Figure 1 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 2 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 3 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 4 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Viaarxiv icon

Generative Statistical Models with Self-Emergent Grammar of Chord Sequences

Add code
Mar 02, 2018
Figure 1 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 2 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 3 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Figure 4 for Generative Statistical Models with Self-Emergent Grammar of Chord Sequences
Viaarxiv icon