Picture for Roger Hsiao

Roger Hsiao

3DGS$^2$-TR: Scalable Second-Order Trust-Region Method for 3D Gaussian Splatting

Add code
Jan 30, 2026
Viaarxiv icon

Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval

Add code
Nov 04, 2024
Viaarxiv icon

Optimizing Byte-level Representation for End-to-end ASR

Add code
Jun 14, 2024
Viaarxiv icon

Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers

Add code
May 23, 2023
Figure 1 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 2 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 3 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Figure 4 for Cross-lingual Knowledge Transfer and Iterative Pseudo-labeling for Low-Resource Speech Recognition with Transducers
Viaarxiv icon

Neural Transducer Training: Reduced Memory Consumption with Sample-wise Computation

Add code
Nov 29, 2022
Viaarxiv icon

Variable Attention Masking for Configurable Transformer Transducer Speech Recognition

Add code
Nov 02, 2022
Figure 1 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 2 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 3 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Figure 4 for Variable Attention Masking for Configurable Transformer Transducer Speech Recognition
Viaarxiv icon

Bilingual End-to-End ASR with Byte-Level Subwords

Add code
May 01, 2022
Figure 1 for Bilingual End-to-End ASR with Byte-Level Subwords
Figure 2 for Bilingual End-to-End ASR with Byte-Level Subwords
Figure 3 for Bilingual End-to-End ASR with Byte-Level Subwords
Figure 4 for Bilingual End-to-End ASR with Byte-Level Subwords
Viaarxiv icon

Online Automatic Speech Recognition with Listen, Attend and Spell Model

Add code
Aug 12, 2020
Figure 1 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 2 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 3 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Figure 4 for Online Automatic Speech Recognition with Listen, Attend and Spell Model
Viaarxiv icon

Improving Language Identification for Multilingual Speakers

Add code
Jan 29, 2020
Figure 1 for Improving Language Identification for Multilingual Speakers
Figure 2 for Improving Language Identification for Multilingual Speakers
Figure 3 for Improving Language Identification for Multilingual Speakers
Figure 4 for Improving Language Identification for Multilingual Speakers
Viaarxiv icon

Migrating Monarch Butterfly Localization Using Multi-Sensor Fusion Neural Networks

Add code
Dec 14, 2019
Figure 1 for Migrating Monarch Butterfly Localization Using Multi-Sensor Fusion Neural Networks
Figure 2 for Migrating Monarch Butterfly Localization Using Multi-Sensor Fusion Neural Networks
Figure 3 for Migrating Monarch Butterfly Localization Using Multi-Sensor Fusion Neural Networks
Figure 4 for Migrating Monarch Butterfly Localization Using Multi-Sensor Fusion Neural Networks
Viaarxiv icon