Picture for Kazuhiro Nakadai

Kazuhiro Nakadai

Honda Research Institute Japan Co., Ltd., Saitama, Japan

Improvement in Sign Language Translation Using Text CTC Alignment

Add code
Dec 12, 2024
Viaarxiv icon

UAV-Enhanced Combination to Application: Comprehensive Analysis and Benchmarking of a Human Detection Dataset for Disaster Scenarios

Add code
Aug 09, 2024
Viaarxiv icon

Can all variations within the unified mask-based beamformer framework achieve identical peak extraction performance?

Add code
Jul 22, 2024
Viaarxiv icon

SLAM-based Joint Calibration of Multiple Asynchronous Microphone Arrays and Sound Source Localization

Add code
May 30, 2024
Viaarxiv icon

From Blurry to Brilliant Detection: YOLOv5-Based Aerial Object Detection with Super Resolution

Add code
Jan 26, 2024
Viaarxiv icon

Is the Ideal Ratio Mask Really the Best? -- Exploring the Best Extraction Performance and Optimal Mask of Mask-based Beamformers

Add code
Sep 21, 2023
Viaarxiv icon

Retraining-free Customized ASR for Enharmonic Words Based on a Named-Entity-Aware Model and Phoneme Similarity Estimation

Add code
May 29, 2023
Viaarxiv icon

Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization

Add code
Oct 11, 2022
Figure 1 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Figure 2 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Figure 3 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Figure 4 for Observability Analysis of Graph SLAM-Based Joint Calibration of Multiple Microphone Arrays and Sound Source Localization
Viaarxiv icon

Metric-based multimodal meta-learning for human movement identification via footstep recognition

Add code
Nov 15, 2021
Figure 1 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 2 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 3 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Figure 4 for Metric-based multimodal meta-learning for human movement identification via footstep recognition
Viaarxiv icon

CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments

Add code
Nov 07, 2018
Figure 1 for CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments
Figure 2 for CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments
Figure 3 for CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments
Figure 4 for CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments
Viaarxiv icon