Picture for Kazuhiro Nakadai

Kazuhiro Nakadai

Honda Research Institute Japan Co., Ltd., Saitama, Japan

Distance Based Single-Channel Target Speech Extraction

Add code
Dec 28, 2024
Viaarxiv icon

Bird Vocalization Embedding Extraction Using Self-Supervised Disentangled Representation Learning

Add code
Dec 28, 2024
Viaarxiv icon

Improvement in Sign Language Translation Using Text CTC Alignment

Add code
Dec 12, 2024
Viaarxiv icon

UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios

Add code
Aug 09, 2024
Viaarxiv icon

Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?

Add code
Jul 22, 2024
Viaarxiv icon

SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

Add code
May 30, 2024
Figure 1 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Figure 2 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Figure 3 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Figure 4 for SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization
Viaarxiv icon

From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution

Add code
Jan 26, 2024
Viaarxiv icon

Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers

Add code
Sep 21, 2023
Viaarxiv icon

Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation

Add code
May 29, 2023
Viaarxiv icon

Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization

Add code
Oct 11, 2022
Figure 1 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Figure 2 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Figure 3 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Figure 4 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Viaarxiv icon